Lists (1)
Sort Name ascending (A-Z)
Stars
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
Code for the SIGIR'23 paper "Unsupervised Dense Retrieval Training with Web Anchors"
Graph-structured Indices for Scalable, Fast, Fresh and Filtered Approximate Nearest Neighbor Search
Building a quick conversation-based search demo with Lepton AI.
Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]
An open-source RAG-based tool for chatting with your documents.
Colab for making Wav2Lip high quality and easy to use
A generative speech model for daily dialogue.
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system that combines large language models (LLMs) with visual models to create a novel human-AI interaction…
CVPR2023 talking face implementation for Identity-Preserving Talking Face Generation With Landmark and Appearance Priors
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
Digital Human Resource Collection: 2D/3D/4D human modeling, avatar generation & animation, clothed people digitalization, virtual try-on, and others.
Baichuan-Omni: Towards Capable Open-source Omni-modal LLM 🌊
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音,一键全自动视频搬运AI字幕组
Vision-Augmented Retrieval and Generation (VARAG) - Vision first RAG Engine
Industry leading face manipulation platform
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Qwen / DeepSeek), Knowledge Base (file upload / knowledge managemen…
A deep learning based model to judge the AQ, Appearance Quotient, of faces. (For Chinese Young Girls Only)
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
Backtesting for sleepless cryptocurrency markets
🔎 📈 🐍 💰 Backtest trading strategies in Python.
Headline Sentiment Analysis Backtester. Backtests trading strategy from ai-trading-prototype trading bot.
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.