Stars
The Triton TensorRT-LLM Backend
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
Get up and running with Llama 3.3, Phi 4, Gemma 2, and other large language models.
Open-source and strong foundation image recognition models.
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…
🧠 世界上覆盖最全的优秀Qwen提示语大全,欢迎贡献你的提示词。🧠 The most comprehensive collection of excellent Qwen prompts in the world. Feel free to contribute your own prompts!
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
TensorRT deployment for CenterPoint Lidar Detection Model.
Faster Whisper transcription with CTranslate2
General AI methods for Anything: AnyObject, AnyGeneration, AnyModel, AnyTask, AnyX
Segment Anything in High Quality [NeurIPS 2023]
Robust Speech Recognition via Large-Scale Weak Supervision
The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.
PointCloud Annotation Tools, support to label object bound box, ground, lane and kerb
Production First and Production Ready End-to-End Speech Recognition Toolkit
A lightweight tool for labeling 3D bounding boxes in point clouds.
OpenAnnotate3D: Open-Vocabulary Auto-Labeling System for Multi-modal Data
A project demonstrating Lidar related AI solutions, including three GPU accelerated Lidar/camera DL networks (PointPillars, CenterPoint, BEVFusion) and the related libs (cuPCL, 3D SparseConvolution…
An ultra fast tiny model for lane detection, using onnx_parser, TensorRTAPI, torch2trt to accelerate. our model support for int8, dynamic input and profiling. (Nvidia-Alibaba-TensoRT-hackathon2021)
[ECCV 2022] ByteTrack: Multi-Object Tracking by Associating Every Detection Box
🛠 A lite C++ toolkit of 100+ Awesome AI models, support ORT, MNN, NCNN, TNN and TensorRT. 🎉🎉
HybridNets: End-to-End Perception Network
Pytorch implementation of our paper "CLRNet: Cross Layer Refinement Network for Lane Detection" (CVPR2022 Acceptance).
YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
LSD (LiDAR SLAM & Detection) is an open source perception architecture for autonomous vehicle/robotic
PyTorch code and models for the DINOv2 self-supervised learning method.