Stars
[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
A Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning
Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2
[ECCV2024] API code for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy
Images to inference with no labeling (use foundation models to train supervised models).
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
Real-time face swap for PC streaming or video calls
A general purpose Library for Evolutionary Algorithms in Python.
Google Colab Notebook for creating and testing a Tiny Yolo 3 real-time object detection model.
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code
The Ludii general game system, developed as part of the ERC-funded Digital Ludeme Project.
PCA / Regularization / Kernel / Time Series / Bayesian regularization
scikit-learn: machine learning in Python
Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications
Term project for Machine Learning UoT course
The PyTorch-based audio source separation toolkit for researchers
Automated Backward and Forward Selection On Python
Python library for interactive topic model visualization. Port of the R LDAvis package.