Lists (3)
Sort Name ascending (A-Z)
Stars
Integrate the DeepSeek API into popular softwares
OpenEMMA, a permissively licensed open source "reproduction" of Waymo’s EMMA model.
An implementation of EMMA (End-to-End Multimodal Model for Autonomous Driving) using the Claude API, based on the EMMA paper.
awesome-autonomous-driving
Large World Model -- Modeling Text and Video with Millions Context
openpilot is an operating system for robotics. Currently, it upgrades the driver assistance system on 275+ supported cars.
PLUTO: Push the Limit of Imitation Learning-based Planning for Autonomous Driving
[ECCV 2024 Oral🔥] Arc2Face: A Foundation Model for ID-Consistent Human Faces
End-to-End Object Detection with Transformers
Official implementation for 'SparseOcc: Rethinking Sparse Latent Representation for Vision-Based Semantic Occupancy Prediction' (CVPR 2024)
[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…
This project includes a client/library to connect to Electrolux and AEG cleaner robots.
EOS is a dual-core operating system designed specifically for embodied intelligence, suitable for robots, drones, satellites or other scenarios requiring real-time and general capabilities.
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
📚200+ Tensor/CUDA Cores Kernels, ⚡️flash-attn-mma, ⚡️hgemm with WMMA, MMA and CuTe (98%~100% TFLOPS of cuBLAS/FA2 🎉🎉).
On the Road with GPT-4V(ision): Explorations of Utilizing Visual-Language Model as Autonomous Driving Agent
[CVPR 2023] An academic alternative to Tesla's occupancy network for autonomous driving.
Vision-Centric BEV Perception: A Survey
A Simple PointPillars PyTorch Implementation for 3D LiDAR(KITTI) Detection.
Lift, Splat, Shoot: Encoding Images from Arbitrary Camera Rigs by Implicitly Unprojecting to 3D (ECCV 2020)
Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors
Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.
You Only Look Once for Panopitic Driving Perception.(MIR2022)
Best Practices, code samples, and documentation for Computer Vision.
Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite
12 Weeks, 24 Lessons, IoT for All!