Lists (4)
Sort Name ascending (A-Z)
Stars
🚀 DeepSeek-V3大模型逆向API【特长:良心厂商】(官方贼便宜,建议直接走官方),支持高速流式输出、多轮对话,联网搜索,r1深度思考,零配置部署,多路token支持,仅供测试,如需商用请前往官方开放平台。
llama3 implementation one matrix multiplication at a time
An open source implementation of "Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning", an all-new multi modal AI that uses just a decoder to generate both text and images
Generative Models by Stability AI
Label, clean and enrich text datasets with LLMs.
High-performance In-browser LLM Inference Engine
Real-time face swap for PC streaming or video calls
Llama中文社区,Llama3在线体验和微调模型已开放,实时汇总最新Llama3学习资料,已将所有代码更新适配Llama3,构建最好的中文Llama大模型,完全开源可商用
A PyTorch native library for large model training
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)
openpilot is an operating system for robotics. Currently, it upgrades the driver assistance system on 275+ supported cars.
This is a python API which allows you to get the transcript/subtitles for a given YouTube video. It also works for automatically generated subtitles and it does not require an API key nor a headles…
SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2…
🔉 Youtube Videos Transcription with OpenAI's Whisper
Faster Whisper transcription with CTranslate2
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
Large Language-and-Vision Assistant for Biomedicine, built towards multimodal GPT-4 level capabilities.