-
The University of Texas at Austin
- Austin, TX
Stars
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Stable Diffusion web UI
A Gradio web UI for Large Language Models with support for multiple inference backends.
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
We write your reusable computer vision tools. 💜
LabelImg is now part of the Label Studio community. The popular image annotation tool created by Tzutalin is no longer actively being developed, but you can check out Label Studio, the open source …
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
Python Implementation of Reinforcement Learning: An Introduction
OpenMMLab's next-generation platform for general 3D object detection.
PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
Google Drive Public File Downloader when Curl/Wget Fails
PyTorch Lightning + Hydra. A very user-friendly template for ML experimentation. ⚡🔥⚡
Sequence modeling benchmarks and temporal convolutional networks
PointNet and PointNet++ implemented by pytorch (pure python) and on ModelNet, ShapeNet and S3DIS.
OpenMMLab Video Perception Toolbox. It supports Video Object Detection (VID), Multiple Object Tracking (MOT), Single Object Tracking (SOT), Video Instance Segmentation (VIS) with a unified framework.
[ECCV 2022] This is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.
Rich-cli is a command line toolbox for fancy output in the terminal
PyTorch implementation of TabNet paper : https://arxiv.org/pdf/1908.07442.pdf
ROS Wrapper for Intel(R) RealSense(TM) Cameras
The devkit of the nuScenes dataset.
Simple and easily configurable grid world environments for reinforcement learning
Images to inference with no labeling (use foundation models to train supervised models).
A collection of extensions and data-loaders for few-shot learning & meta-learning in PyTorch