Stars
WebUI extension for ControlNet
[ICCV 2023] Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation
Massive open Japanese speech corpus
[CVPR 2022] Code for "Learning Hierarchical Cross-Modal Association for Co-Speech Gesture Generation"
PantoMatrix: Co-Speech Talking Head and Gestures Generation
An unofficial PyTorch implementation of the audio LM VALL-E
repository to research & share the machine learning articles
Official implementation for "InfoGCN: Representation Learning for Human Skeleton-Based Action Recognition"
PyTorch deep learning projects made easy.
Headless chrome/chromium automation library (unofficial port of puppeteer)
A curated list of resources dedicated to Python libraries, LLMs, dictionaries, and corpora of NLP for Japanese
PyTorch Implementation of DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs
convert .lab files to .TextGrid files, which can be used in Praat
Speech Segmentation Toolkit using Julius
The code for CVPR21 paper "Deep Animation Video Interpolation in the Wild"
Anime Face Detector using mmdet and mmpose
WACV2022: Transfer Learning for Pose Estimation of Illustrated Characters
Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors
CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.
IJCAI2023 - Collaborative Neural Rendering using Anime Character Sheets
Real-time Web Dashboard for Optuna.
Official Implementation for "Only a Matter of Style: Age Transformation Using a Style-Based Regression Model" (SIGGRAPH 2021) https://arxiv.org/abs/2102.02754