Stars
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
"LightRAG: Simple and Fast Retrieval-Augmented Generation"
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation,…
An elegant PyTorch deep reinforcement learning library.
A UI-Focused Agent for Windows OS Interaction.
[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-sim…
Nexa SDK is a comprehensive toolkit for supporting GGML and ONNX models. It supports text generation, image generation, vision-language models (VLM), Audio Language Model, auto-speech-recognition (…
The next generation deep reinforcement learning tookit
Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation
Recommendation Algorithm大规模推荐算法库,包含推荐系统经典及最新算法LR、Wide&Deep、DSSM、TDM、MIND、Word2Vec、Bert4Rec、DeepWalk、SSR、AITM,DSIN,SIGN,IPREC、GRU4Rec、Youtube_dnn、NCF、GNN、FM、FFM、DeepFM、DCN、DIN、DIEN、DLRM、MMOE、PLE、ESM…
FEDML - The unified and scalable ML library for large-scale distributed training, model serving, and federated learning. FEDML Launch, a cross-cloud scheduler, further enables running any AI jobs o…
This Inventory management system is the currently Ford Asia Pacific after-sales logistics warehousing supply chain process . After I leave Ford , I start this project . You can share your vacant wa…
DAMO-YOLO: a fast and accurate object detection method with some new techs, including NAS backbones, efficient RepGFPN, ZeroHead, AlignedOTA, and distillation enhancement.
🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.
SDG is a specialized framework designed to generate high-quality structured tabular data.
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
Tool Learning for Big Models, Open-Source Solutions of ChatGPT-Plugins
Your Automatic Prompt Engineering Assistant for GenAI Applications
awesome game security [Welcome to PR]
LLM based data scientist, AI native data application. AI-driven infinite thinking redefines BI.
Large-scale, Informative, and Diverse Multi-round Chat Data (and Models)
Applications self-hosting and DevOps platform for running open source, web-based linux Panel of lite PaaS
airda(Air Data Agent)是面向数据分析的多智能体,能够理解数据开发和数据分析需求、理解数据、生成面向数据查询、数据可视化、机器学习等任务的SQL和Python代码
【CVPR 2024 Highlight】Monkey (LMM): Image Resolution and Text Label Are Important Things for Large Multi-modal Models