-
eler
- NYC
Highlights
- Pro
Stars
A generative world for general-purpose robotics & embodied AI learning.
GaussianAD: Gaussian-Centric End-to-End Autonomous Driving
[ICLR'23 Spotlight & IJCV'24] MapTR: Structured Modeling and Learning for Online Vectorized HD Map Construction
[CVPR 2024] Official implementation of "Deformable 3D Gaussians for High-Fidelity Monocular Dynamic Scene Reconstruction"
A 3DGS framework for omni urban scene reconstruction and simulation.
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
[IEEE T-PAMI 2024] All you need for End-to-end Autonomous Driving
A collaboration friendly studio for NeRFs
Unofficial implementation of "Street Gaussians: Modeling Dynamic Urban Scenes with Gaussian Splatting", ECCV2024.
[ECCV 2024] Street Gaussians: Modeling Dynamic Urban Scenes with Gaussian Splatting
[ECCV 2022] This is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.
[PAMI'23] TransFuser: Imitation with Transformer-Based Sensor Fusion for Autonomous Driving; [CVPR'21] Multi-Modal Fusion Transformer for End-to-End Autonomous Driving
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
The world’s fastest framework for building websites.
Easy-to-use image segmentation library with awesome pre-trained model zoo, supporting wide-range of practical tasks in Semantic Segmentation, Interactive Segmentation, Panoptic Segmentation, Image …
YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite
[ICRA'23] BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation
A cloud-native vector database, storage for next generation AI applications
Official Jetpack Compose samples.
🔊 Text-Prompted Generative Audio Model
A standalone version of the readability lib
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Robust Speech Recognition via Large-Scale Weak Supervision
Learn how to design, develop, deploy and iterate on production-grade ML applications.
ROS 2 Navigation Framework and System
An Efficient Probabilistic 3D Mapping Framework Based on Octrees. Contains the main OctoMap library, the viewer octovis, and dynamicEDT3D.
🤖🐑 It's a sheep, it's a dolly, it's a following robot. Dolly was born to be cloned.
A curated collection of iOS, ML, AR resources sprinkled with some UI additions