Stars
We write your reusable computer vision tools. 💜
A DeepStream sample application demonstrating end-to-end retail video analytics for brick-and-mortar retail.
Plug-and-Play Custom Parsers for AI Models in NVIDIA DeepStream SDK. Supported YOLOv11 model.
NVIDIA DeepStream SDK 7.1 / 7.0 / 6.4 / 6.3 / 6.2 / 6.1.1 / 6.1 / 6.0.1 / 6.0 / 5.1 implementation for YOLO models
Python Computer Vision & Video Analytics Framework With Batteries Included
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 2, and other large language models.
Document to Markdown OCR library with Llama 3.2 vision
State-of-the-art 2D and 3D Face Analysis Project
The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.
Chat with your documents using Vision Language Models. This repo implements an End to End RAG pipeline with both local and proprietary VLMs
[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
Code for robust monocular depth estimation described in "Ranftl et. al., Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-shot Cross-dataset Transfer, TPAMI 2022"
Advanced Retrieval-Augmented Generation (RAG) through practical notebooks, using the power of the Langchain, OpenAI GPTs ,META LLAMA3 ,Agents.
Full stack, modern web application template. Using FastAPI, React, SQLModel, PostgreSQL, Docker, GitHub Actions, automatic HTTPS and more.
Provides an ensemble model to deploy a YOLOv8 TensorRT model to Triton
A framework to easily use 32 (and growing) different image matching methods
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…
BoxMOT: pluggable SOTA tracking modules for segmentation, object detection and pose estimation models
Object Detection toolkit based on PaddlePaddle. It supports object detection, instance segmentation, multiple object tracking and real-time multi-person keypoint detection.
Framework agnostic sliced/tiled inference + interactive ui + error analysis plots
Demo code and other handouts for students of our FastAPI Web Apps course.
OCR, layout analysis, reading order, table recognition in 90+ languages
Segment Anything in Medical Images
This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!
A really more real-time adaptation of deep sort
🇻🇳 VNTechies Dev Blog - Kho tài nguyên mã nguồn mở với sứ mệnh đào tạo kiến thức, định hướng nghề nghiệp cho cộng đồng Cloud ☁️ DevOps 🚀