Stars
WebUI extension for ControlNet
[ICCV 2023] Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation
Massive open Japanese speech corpus
[CVPR 2022] Code for "Learning Hierarchical Cross-Modal Association for Co-Speech Gesture Generation"
PantoMatrix: Co-Speech Talking Head and Gestures Generation
An unofficial PyTorch implementation of the audio LM VALL-E
repository to research & share the machine learning articles
Official implementation for "InfoGCN: Representation Learning for Human Skeleton-Based Action Recognition"
r9y9 / jvs_r9y9
Forked from Hiroshiba/jvs_hihoJVS (Japanese versatile speech) コーパスの自作のラベル
PyTorch deep learning projects made easy.
ptrblck / apex
Forked from NVIDIA/apexA PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
Headless chrome/chromium automation library (unofficial port of puppeteer)
A curated list of resources dedicated to Python libraries, LLMs, dictionaries, and corpora of NLP for Japanese
Speech Segmentation Toolkit using Julius
PyTorch Implementation of DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs
convert .lab files to .TextGrid files, which can be used in Praat
Speech Segmentation Toolkit using Julius
The code for CVPR21 paper "Deep Animation Video Interpolation in the Wild"
Anime Face Detector using mmdet and mmpose
WACV2022: Transfer Learning for Pose Estimation of Illustrated Characters
Implementation of paper - YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors
CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.