Stars
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
OpenMMLab Detection Toolbox and Benchmark
Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow
deep learning for image processing including classification and object-detection etc.
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型
The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.
Fast and flexible image augmentation library. Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
Llama中文社区,Llama3在线体验和微调模型已开放,实时汇总最新Llama3学习资料,已将所有代码更新适配Llama3,构建最好的中文Llama大模型,完全开源可商用
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
End-to-End Object Detection with Transformers
ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型
🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, Image similarity and more.
Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.
YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/
OpenMMLab Semantic Segmentation Toolbox and Benchmark.
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
Use PEFT or Full-parameter to finetune 400+ LLMs (Qwen2.5, InternLM3, GLM4, Llama3.3, Mistral, Yi1.5, Baichuan2, DeepSeek3, ...) and 150+ MLLMs (Qwen2-VL, Qwen2-Audio, Llama3.2-Vision, Llava, Inter…