Lists (16)
Sort Name ascending (A-Z)
Starred repositories
Build AI assistants that interact with your systems
Implementing Ollama and Agents to create a blogging bot
A Chrome extension that uses the magic of OpenAI's chat and image models to ensure a seamless ChatGPT-Like experience - all without ever having to leave your favorite website.
AI Device Template Featuring Whisper, TTS, Groq, Llama3, OpenAI and more
Advanced AI email assistant using Groq for responsive replies, Llama for contextual information retrieval, and RAG with LangChain for enhanced accuracy.
Infinite Bookshelf: Generate entire books in seconds using Groq and Llama3
Chat with your PDF files for free, using Langchain, Groq, ChromaDB, and Jina AI embeddings.
Latests Langchain (2024) App to chat with any website given its URL
This is a Phi-3 book for getting started with Phi-3. Phi-3, a family of open sourced AI models developed by Microsoft. Phi-3 models are the most capable and cost-effective small language models (SL…
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs
AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of advanced features, such as a settings page, low VRAM support, D…
*CREPE+HYBRID TRAINING* A very experimental fork of the Retrieval-based-Voice-Conversion-WebUI repo that incorporates a variety of other f0 methods, along with a hybrid f0 nanmedian method.
Robust Speech Recognition via Large-Scale Weak Supervision
the first library to let you embed a developer agent in your own app!
Real-time microphone noise suppression on Linux.
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
🔊 Text-Prompted Generative Audio Model
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
A generative speech model for daily dialogue.
🎥 Create youtube videos from a text prompt in seconds
🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with LlamaIndex, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23
This is AI Chat build using Gemini API , Response with images and text and go to video you get Response from AI what it see on camera
Web app built with React,Typescript for creating resume easily with different templates and export to pdf easily.
Create your personalized portfolio website using Notion and deploy it with Next.js on Vercel. Showcase your work, projects, and achievements with ease.