Highlights
- Pro
Stars
Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024
Toolkit for linearizing PDFs for LLM datasets/training
⚡ TabPFN: Foundation Model for Tabular Data ⚡
Python tool for converting files and office documents to Markdown.
[CVPR 2023] VideoMAE V2: Scaling Video Masked Autoencoders with Dual Masking
A Lightweight Face Recognition and Facial Attribute Analysis (Age, Gender, Emotion and Race) Library for Python
MiVOLO age & gender transformer neural network
Papers for Video Anomaly Detection, released codes collection, Performance Comparision.
YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open
GeoCalib: Learning Single-image Calibration with Geometric Optimization (ECCV 2024)
Scalable and user friendly neural 🧠 forecasting algorithms.
Open-source and strong foundation image recognition models.
Codebase for the Recognize Anything Model (RAM)
Fixes the Sound Quality of AirPods when connected to a Mac.
Display progress as a pretty table in the command line.
Multi Person Skeleton Based Action Recognition and Tracking
NOT MAINTAINED - A simple Rust like Result type for Python 3. Fully type annotated.
An extremely fast Python package and project manager, written in Rust.
🚀 Easier & Faster YOLO Deployment Toolkit for NVIDIA 🛠️
An open-source & self-hostable Heroku / Netlify / Vercel alternative.
A command-line tool and Python library and Pytest plugin for automated testing of RESTful APIs, with a simple, concise and flexible YAML-based syntax
Experiment with NVIDIA Triton and Whisper
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
Python Computer Vision & Video Analytics Framework With Batteries Included
D-FINE: Redefine Regression Task of DETRs as Fine-grained Distribution Refinement [ICLR 2025 Spotlight]
Papers, code and datasets about deep learning and multi-modal learning for video analysis