-
Adafruit
- SF Bay Area
Stars
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
"LightRAG: Simple and Fast Retrieval-Augmented Generation"
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation,…
A UI-Focused Agent for Windows OS Interaction.
The open source platform for AI-native application development.
Nexa SDK is a comprehensive toolkit for supporting GGML and ONNX models. It supports text generation, image generation, vision-language models (VLM), Audio Language Model, auto-speech-recognition (…
A True Instrumentable Binary Emulation Framework
The next generation deep reinforcement learning tookit
Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation
Recommendation Algorithm大规模推荐算法库,包含推荐系统经典及最新算法LR、Wide&Deep、DSSM、TDM、MIND、Word2Vec、Bert4Rec、DeepWalk、SSR、AITM,DSIN,SIGN,IPREC、GRU4Rec、Youtube_dnn、NCF、GNN、FM、FFM、DeepFM、DCN、DIN、DIEN、DLRM、MMOE、PLE、ESM…
FEDML - The unified and scalable ML library for large-scale distributed training, model serving, and federated learning. FEDML Launch, a cross-cloud scheduler, further enables running any AI jobs o…
This Inventory management system is the currently Ford Asia Pacific after-sales logistics warehousing supply chain process . After I leave Ford , I start this project . You can share your vacant wa…
DAMO-YOLO: a fast and accurate object detection method with some new techs, including NAS backbones, efficient RepGFPN, ZeroHead, AlignedOTA, and distillation enhancement.
🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.
SDG is a specialized framework designed to generate high-quality structured tabular data.
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
Mobile-Agent: The Powerful Mobile Device Operation Assistant Family
Tool Learning for Big Models, Open-Source Solutions of ChatGPT-Plugins
Your Automatic Prompt Engineering Assistant for GenAI Applications
GeneFace: Generalized and High-Fidelity 3D Talking Face Synthesis; ICLR 2023; Official code
awesome game security [Welcome to PR]
LLM based data scientist, AI native data application. AI-driven infinite thinking redefines BI.