Skip to content
View MinaRoss's full-sized avatar

Block or report MinaRoss

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

一键将视频转换为优质小红书笔记,自动优化内容和配图

Python 864 100 Updated Dec 13, 2024

Taming Stable Diffusion for Lip Sync!

Python 619 48 Updated Jan 4, 2025

:octocat: 分享 GitHub 上有趣、入门级的开源项目。Share interesting, entry-level open source projects on GitHub.

Python 97,105 9,761 Updated Dec 27, 2024

Real time interactive streaming digital human

Python 4,245 617 Updated Jan 1, 2025

PDF to Markdown with vision models

Python 7,755 462 Updated Dec 18, 2024

Build multi-modal Agents with memory, knowledge, tools and reasoning. Chat with them using a beautiful Agent UI.

Python 17,412 2,356 Updated Jan 5, 2025

Python tool for converting files and office documents to Markdown.

Python 32,022 1,324 Updated Jan 4, 2025

TEN Agent is a conversational AI powered by the TEN, integrating Gemini 2.0 Live, OpenAI Realtime, RTC, and more. It delivers real-time capabilities to see, hear, and speak, while being fully compa…

Python 3,874 392 Updated Jan 3, 2025

Langflow is a low-code app builder for RAG and multi-agent AI applications. It’s Python-based and agnostic to any model, API, or database.

Python 41,971 4,677 Updated Jan 5, 2025

PDF scientific paper translation with preserved formats - 基于 AI 完整保留排版的 PDF 文档全文双语翻译,支持 Google/DeepL/Ollama/OpenAI 等服务,提供 CLI/GUI/Docker

Python 13,696 994 Updated Jan 4, 2025

An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Speaker Extraction, etc.

Python 1,923 135 Updated Dec 31, 2024

SOTA Open Source TTS

Python 18,092 1,358 Updated Jan 4, 2025

Faster Whisper transcription with CTranslate2

Python 13,267 1,114 Updated Jan 1, 2025

OpenOCR: A general OCR system with accuracy and efficiency. Supporting 24 Scene Text Recognition methods trained from scratch on large-scale real datasets, and will continue to add the latest methods.

Python 404 33 Updated Jan 3, 2025

Convert PDF to markdown + JSON quickly with high accuracy

Python 19,041 1,117 Updated Jan 5, 2025

Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音,一键全自动视频搬运AI字幕组

Python 8,959 874 Updated Jan 5, 2025

File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.

Python 4,855 239 Updated Jan 5, 2025

Voice Recognition to Text Tool / 一个离线运行的本地音视频转字幕工具,输出json、srt字幕、纯文字格式

Python 2,777 302 Updated Dec 5, 2024

Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,同时支持语音识别转录、语音合成、字幕翻译。

Python 11,401 1,274 Updated Dec 28, 2024

✨ AsrTools: 智能语音转文字工具 | 高效批处理 | 用户友好界面 | 无需 GPU |支持 SRT/TXT 输出 | 让您的音频瞬间变成精确文字!

Python 1,558 136 Updated Nov 13, 2024

🎬 卡卡字幕助手 | VideoCaptioner - 基于 LLM 的智能字幕助手,无需GPU一键高质量字幕视频合成!视频字幕生成、断句、校正、字幕翻译全流程。让字幕制作简单高效!

Python 2,734 246 Updated Dec 27, 2024

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Python 27,173 2,593 Updated Jan 3, 2025

Get your documents ready for gen AI

Python 17,372 903 Updated Jan 3, 2025

A Python library to extract tabular data from PDFs

Python 3,084 476 Updated Jan 3, 2025

A Docker-powered service for PDF document layout analysis. This service provides a powerful and flexible PDF analysis service. The service allows for the segmentation and classification of differen…

Python 219 24 Updated Dec 28, 2024

Detect and extract tables to markdown and csv

Python 703 44 Updated Dec 12, 2024

A Comprehensive Toolkit for High-Quality PDF Content Extraction

Python 6,265 411 Updated Jan 3, 2025

A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。

Python 23,155 1,677 Updated Jan 3, 2025

A Repo For Document AI

Python 2,645 145 Updated Dec 20, 2024

整理目前开源的最优表格识别模型,完善前后处理,模型转换为ONNX Organize the currently open-source optimal table recognition models, improve pre-processing and post-processing, and convert the models to ONNX.

Python 426 41 Updated Dec 15, 2024
Next