TEN Agent is a conversational AI powered by the TEN, integrating Gemini 2.0 Live, OpenAI Realtime, RTC, and more. It delivers real-time capabilities to see, hear, and speak, while being fully compa…

Python 3,874 392 Updated Jan 3, 2025

langflow-ai / langflow

Langflow is a low-code app builder for RAG and multi-agent AI applications. It’s Python-based and agnostic to any model, API, or database.

Python 41,971 4,677 Updated Jan 5, 2025

Byaidu / PDFMathTranslate

PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译，支持 Google/DeepL/Ollama/OpenAI 等服务，提供 CLI/GUI/Docker

Python 13,696 994 Updated Jan 4, 2025

modelscope / ClearerVoice-Studio

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

Python 1,923 135 Updated Dec 31, 2024

fishaudio / fish-speech

SOTA Open Source TTS

Python 18,092 1,358 Updated Jan 4, 2025

SYSTRAN / faster-whisper

Faster Whisper transcription with CTranslate2

Python 13,267 1,114 Updated Jan 1, 2025

OpenOCR: A general OCR system with accuracy and efficiency. Supporting 24 Scene Text Recognition methods trained from scratch on large-scale real datasets, and will continue to add the latest methods.

Python 404 33 Updated Jan 3, 2025

VikParuchuri / marker

Convert PDF to markdown + JSON quickly with high accuracy

Python 19,041 1,117 Updated Jan 5, 2025

Huanshere / VideoLingo

Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音，一键全自动视频搬运AI字幕组

Python 8,959 874 Updated Jan 5, 2025

QuivrHQ / MegaParse

File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.

Python 4,855 239 Updated Jan 5, 2025

jianchang512 / stt

Voice Recognition to Text Tool / 一个离线运行的本地音视频转字幕工具，输出json、srt字幕、纯文字格式

Python 2,777 302 Updated Dec 5, 2024

jianchang512 / pyvideotrans

Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言，同时支持语音识别转录、语音合成、字幕翻译。

Python 11,401 1,274 Updated Dec 28, 2024

WEIFENG2333 / AsrTools

Python 1,558 136 Updated Nov 13, 2024

WEIFENG2333 / VideoCaptioner

🎬 卡卡字幕助手 | VideoCaptioner - 基于 LLM 的智能字幕助手，无需GPU一键高质量字幕视频合成！视频字幕生成、断句、校正、字幕翻译全流程。让字幕制作简单高效！

Python 2,734 246 Updated Dec 27, 2024

infiniflow / ragflow

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Python 27,173 2,593 Updated Jan 3, 2025

DS4SD / docling

Get your documents ready for gen AI

Python 17,372 903 Updated Jan 3, 2025

camelot-dev / camelot

A Python library to extract tabular data from PDFs

Python 3,084 476 Updated Jan 3, 2025

huridocs / pdf-document-layout-analysis

A Docker-powered service for PDF document layout analysis. This service provides a powerful and flexible PDF analysis service. The service allows for the segmentation and classification of differen…

Python 219 24 Updated Dec 28, 2024

VikParuchuri / tabled

Detect and extract tables to markdown and csv

Python 703 44 Updated Dec 12, 2024

opendatalab / PDF-Extract-Kit

A Comprehensive Toolkit for High-Quality PDF Content Extraction

Python 6,265 411 Updated Jan 3, 2025

opendatalab / MinerU

A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具，将PDF转换成Markdown和JSON格式。

Python 23,155 1,677 Updated Jan 3, 2025

deepdoctection / deepdoctection

A Repo For Document AI

Python 2,645 145 Updated Dec 20, 2024

RapidAI / TableStructureRec

整理目前开源的最优表格识别模型，完善前后处理，模型转换为ONNX Organize the currently open-source optimal table recognition models, improve pre-processing and post-processing, and convert the models to ONNX.

Python 426 41 Updated Dec 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mina000 MinaRoss

Block or report MinaRoss

Stars

whotto / Video_note_generator

bytedance / LatentSync

521xueweihan / HelloGitHub

lipku / LiveTalking

getomni-ai / zerox

phidatahq / phidata

microsoft / markitdown

TEN-framework / TEN-Agent