Stars
Comprehensive Gradio WebUI for audio processing, powered by Whisper engines (Whisper, Faster-Whisper, Whisper-Timestamped). Features Voice Changer, zero-shot Voice Cloning (E2, F5-TTS), YouTube dow…
Faster Whisper transcription with CTranslate2
Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!
Improved AnimateDiff for ComfyUI and Advanced Sampling Support
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Various AI scripts. Mostly Stable Diffusion stuff.
Open-Sora: Democratizing Efficient Video Production for All
Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.
Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
TensorRT Extension for Stable Diffusion Web UI
The suite of modeling video with Mamba
Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复
Concurrently chat with ChatGPT, Bing Chat, Bard, Alpaca, Vicuna, Claude, ChatGLM, MOSS, 讯飞星火, 文心一言 and more, discover the best answers
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…