Lists (1)
Sort Name ascending (A-Z)
Starred repositories
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
类似按键精灵的鼠标键盘录制和自动化操作 模拟点击和键入 | automate mouse clicks and keyboard input
[EMNLP 2023] The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning
Examples and tutorials on using SOTA computer vision models and techniques. Learn everything from old-school ResNet, through YOLO and object-detection transformers like DETR, to the latest models l…
JSON to JSON transformation library written in Java.
JavaScript API for Chrome and Firefox
A comic app built with Flutter, supporting multiple comic sources.
RxJava – Reactive Extensions for the JVM – a library for composing asynchronous and event-based programs using observable sequences for the Java VM.
MeterSphere 是新一代的开源持续测试工具,让软件测试工作更简单、更高效,不再成为持续交付的瓶颈。
Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI
中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌词库、汽车零件词库、连续英文切割、各种中文词向量、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、…
YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]
Fast and memory-efficient exact attention
V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.
JavaScript API for face detection and face recognition in the browser and nodejs with tensorflow.js
OpenFace – a state-of-the art tool intended for facial landmark detection, head pose estimation, facial action unit recognition, and eye-gaze estimation.
An Open Source Tools for Speaker Recognition
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
Robust Speech Recognition via Large-Scale Weak Supervision
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
The world's simplest facial recognition api for Python and the command line
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…
业内为数不多致力于极致体验的超强全自研跨平台(windows/linux/android/iOS)流媒体内核,通过模块化自由组合,支持实时RTMP推流、RTSP推流、RTMP播放器、RTSP播放器、录像、多路流媒体转发、音视频导播、动态视频合成、音频混音、直播互动、内置轻量级RTSP服务等,比快更快,业界真正靠谱的超低延迟直播SDK(1秒内,低延迟模式下150~300ms)。
Cross-platform, customizable ML solutions for live and streaming media.
Universal Office Converter - Convert between any document format supported by LibreOffice/OpenOffice.