Stars
SkyPilot: Run AI and batch jobs on any infra (Kubernetes or 14+ clouds). Get unified execution, cost savings, and high GPU availability via a simple interface.
CLiB中文大模型能力评测榜单(持续更新):目前已囊括195个大模型,覆盖chatgpt、gpt-4o、o3-mini、谷歌gemini、Claude3.5、智谱GLM-Zero、文心一言、qwen-max、百川、讯飞星火、商汤senseChat、minimax等商用模型, 以及DeepSeek-R1、deepseek-v3、qwen2.5、llama3.3、phi-4、glm4、书生int…
Speech-to-text, text-to-speech, speaker diarization, speech enhancement, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS…
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
Fast and accurate automatic speech recognition (ASR) for edge devices
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
A feature-rich command-line audio/video downloader
Port of OpenAI's Whisper model in C/C++
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
The minimal opencv for Android, iOS, ARM Linux, Windows, Linux, MacOS, HarmonyOS, WebAssembly, watchOS, tvOS, visionOS
Collection of public available person re-identification datasets
整理开源的中文大语言模型,以规模较小、可私有化部署、训练成本较低的模型为主,包括底座模型,垂直领域微调及应用,数据集与教程等。
此项目完成了关于 NLP-Beginner:自然语言处理入门练习 的所有任务(文本分类、信息抽取、知识图谱、机器翻译、问答系统、文本生成、Text-to-SQL、文本纠错、文本挖掘、知识蒸馏、模型加速、OCR、TTS、Prompt、embedding等),所有代码都经过测试,可以正常运行。
Paper list for single object tracking (State-of-the-art SOT trackers)
ncnn is a high-performance neural network inference framework optimized for the mobile platform
The official code of "Towards Unified Text-based Person Retrieval: A Large-scale Multi-Attribute and Language Search Benchmark"
The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.
LAVIS - A One-stop Library for Language-Vision Intelligence
Tensors and Dynamic neural networks in Python with strong GPU acceleration
深度学习面试宝典(含数学、机器学习、深度学习、计算机视觉、自然语言处理和SLAM等方向)
Scene Text Recognition with Permuted Autoregressive Sequence Models (ECCV 2022)
Most popular metrics used to evaluate object detection algorithms.
Compare multiple optimization methods on triton to imporve model service performance
Torchreid: Deep learning person re-identification in PyTorch.