-
NextBillion AI
- Hong Kong
Stars
An official implementation of VideoRoPE: What Makes for Good Video Rotary Position Embedding?
Decentralized LLMs fine-tuning and inference with offloading
Video generation from text&image, 1st-gen
We introduce temporal working memory (TWM), which aims to enhance the temporal modeling capabilities of Multimodal foundation models (MFMs). This plug-and-play module can be easily integrated into …
Official Repo for Paper ‘’HealthGPT : A Medical Large Vision-Language Model for Unifying Comprehension and Generation via Heterogeneous Knowledge Adaptation‘’
Align Anything: Training All-modality Model with Feedback
FAST-LIVO2: Fast, Direct LiDAR-Inertial-Visual Odometry
A python package that integrate algorithms and various machine learning approaches to extract features (genes) effective for classification and attribute them accordingly.
Tomoyo is a Kotlin Compose Multiplatform app that is a sample for for common functionalities such as navigation, socket (for chat), video, audio, and db
辅学院企业培训系统是一套基于点播、培训、考试、面授、报表等功能完善的企业培训系统,开源版是基于企业版精简实现的一个线上学习系统,致力于打造一个各行业都适用的在线培训系统、员工培训平台、企业内部培训系统、在线教育系统、开源培训系统。
BIRD-CRITIC 1.0: Can Large Language Models Solve USER SQL Issues in Real-World Database Applications?
A curated list of papers, code and resources pertaining to image composition/compositing or object insertion/addition/compositing, which aims to generate realistic composite image.
Eko (Eko Keeps Operating) - Build Production-ready Agentic Workflow with Natural Language - eko.fellou.ai
Build multimodal language agents for fast prototype and production
🔥🔥🔥AI论文生成、AI论文写作,一键论文生成,AI写毕业论文、开题报告、文献综述、课程论文,AI写报告、写方案、降AIGC率
MetaTrx: Comprehensive Cross-Species Transcriptome Analysis
a family of versatile and state-of-the-art video tokenizers.
[AAAI 2025] Depth-Centric Dehazing and Depth-Estimation from Real-World Hazy Driving Video
LiveMock is a comprehensive tool for API development and testing, offering mock data, request proxying, and logging, to streamline workflows and track traffic.
The first open autoregressive foundational video AI model.
Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation
"LightRAG: Simple and Fast Retrieval-Augmented Generation"