Lists (31)
Sort Name ascending (A-Z)
audio
Darknet
Data
data_augmentation
detect
diffusion
efficientnetv2
face
Framework
image_calibration
image_fusion
image-matching
image_stitching
Infrared
lane_detection
large_language_model
LLM
Opencv
Others
powershell
Python
segmentation
spider
Super_Resolution
tensorflow
Tools
Tracking
Yolo
分类
学习
自然语言处理系列
Stars
A CPU Realtime VLM in 500M. Surpassed Moondream2 and SmolVLM. Training from scratch with ease.
HuggingLLM, Hugging Future.
Solve Visual Understanding with Reinforced VLMs
Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.
[IEEE TIP] TOPIC: A Parallel Association Paradigm for Multi-Object Tracking under Complex Motions and Diverse Scenes
MambaGlue: Fast and Robust Local Feature Matching With Mamba @ ICRA'25
Code for "MatchAnything: Universal Cross-Modality Image Matching with Large-Scale Pre-Training", Arxiv 2025.
llm theoretical performance analysis tools and support params, flops, memory and latency analysis.
【三年面试五年模拟】AI算法工程师面试秘籍。涵盖AIGC、传统深度学习、自动驾驶、机器学习、计算机视觉、自然语言处理、强化学习、具身智能、元宇宙、AGI等AI行业面试笔试经验与干货知识。
The world's simplest facial recognition api for Python and the command line
Python最佳实践指南。 The chinese translation of "Hitchhiker's Guide to Python".
Fisheye-Calib-Adapter: An Easy Tool for Fisheye Camera Model Conversion
《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调(全参数/Lora)、部署国内外开源大模型(LLM)/多模态大模型(MLLM)教程
《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。
制作懂人情世故的大语言模型 | 涵盖提示词工程、RAG、Agent、LLM微调教程
Nexa SDK is a comprehensive toolkit for supporting GGML and ONNX models. It supports text generation, image generation, vision-language models (VLM), Audio Language Model, auto-speech-recognition (…
Each chapter of this (mini-)book guides you in programming one important software component for automated driving.
Scenic: A Jax Library for Computer Vision Research and Beyond
Python interface to PROJ (cartographic projections and coordinate transformations library)
[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…
[CVPR2021] Pytorch implementation for paper ''Progressively Complementary Network for Fisheye Image Rectification Using Appearance Flow''
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
An open source implementation of CLIP.
[NeurIPS 2024] Classification Done Right for Vision-Language Pre-Training