-
NTT Data
- Osaka, Japan
Stars
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)
MiniCPM-o 2.6: A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming on Your Phone
"LightRAG: Simple and Fast Retrieval-Augmented Generation"
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation,…
The open source platform for AI-native application development.
Nexa SDK is a comprehensive toolkit for supporting GGML and ONNX models. It supports text generation, image generation, vision-language models (VLM), Audio Language Model, auto-speech-recognition (…
Recommendation Algorithm大规模推荐算法库,包含推荐系统经典及最新算法LR、Wide&Deep、DSSM、TDM、MIND、Word2Vec、Bert4Rec、DeepWalk、SSR、AITM,DSIN,SIGN,IPREC、GRU4Rec、Youtube_dnn、NCF、GNN、FM、FFM、DeepFM、DCN、DIN、DIEN、DLRM、MMOE、PLE、ESM…
Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation
The next generation deep reinforcement learning tookit
DAMO-YOLO: a fast and accurate object detection method with some new techs, including NAS backbones, efficient RepGFPN, ZeroHead, AlignedOTA, and distillation enhancement.
Align Anything: Training All-modality Model with Feedback
Large-scale, Informative, and Diverse Multi-round Chat Data (and Models)
awesome game security [Welcome to PR]
SDG is a specialized framework designed to generate high-quality structured tabular data.
Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.
Your Automatic Prompt Engineering Assistant for GenAI Applications
Applications self-hosting and DevOps platform for running open source, web-based linux Panel of lite PaaS
airda(Air Data Agent)是面向数据分析的多智能体,能够理解数据开发和数据分析需求、理解数据、生成面向数据查询、数据可视化、机器学习等任务的SQL和Python代码
A powerful baseline for image classification, face recognition and image retrieval with Pytorch
We introduce temporal working memory (TWM), which aims to enhance the temporal modeling capabilities of Multimodal foundation models (MFMs). This plug-and-play module can be easily integrated into …
[CVPR2023] REC-MV: REconstructing 3D Dynamic Cloth from Monocular Videos