Stars
LLM-MultiModel
4 repositories
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Create Music in Seconds with SunoAPI. Contact me , if you need suno api. 👇
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
Emu Series: Generative Multimodal Models from BAAI