ML
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
[CVPR 2024] code release for "DiffusionLight: Light Probes for Free by Painting a Chrome Ball"
Neural Factorization of Shape and Reflectance Under an Unknown Illumination
OCR bill detection is a python program that can detect the type of your household bills. for example, if you have samples of electricity bills, movie bills, or grocery bills, this python program de…
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
Streamlined interface for generating images with AI in Krita. Inpaint and outpaint with optional text prompt, no tweaking required.
Yet another voice assistant, but alive.
Zero-Shot Speech Editing and Text-to-Speech in the Wild
21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
A neural-network-based generative model for video-game characters animations
A local chatbot fine-tuned by bilibili user comments.
A simple toy demo of a local voice assistant with whisper and large language model.
A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour
Build multi-modal Agents with memory, knowledge, tools and reasoning. Chat with them using a beautiful Agent UI.
Awesome 3D Stylization - Advances in 3D Neural Stylization
[NeurIPS 2023] MotionGPT: Human Motion as a Foreign Language, a unified motion-language generation model using LLMs
proof of concept of machine learning rivet node
An end-to-end library for editing and rendering motion of 3D characters with deep learning [SIGGRAPH 2020]
This repository provides motion datasets collected by Bandai Namco Research Inc
Cross-platform, customizable ML solutions for live and streaming media.
Ingest, parse, and optimize any data format ➡️ from documents to multimedia ➡️ for enhanced compatibility with GenAI frameworks
Markerless kinematics with any cameras — From 2D Pose estimation to 3D OpenSim motion