Stars
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), ga…
PyTorch Tutorial for Deep Learning Researchers
Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model
Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set (CVPRW 2019)
Speed up Stable Diffusion with this one simple trick!
Pytorch Implementations of large number classical backbone CNNs, data enhancement, torch loss, attention, visualization and some common algorithms.
【ICLR 2024🔥】 Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment
A Lite Bert For Self-Supervised Learning Language Representations
Official repository of Agent Attention (ECCV2024)
Efficient Contextualized Representation: Language Model Pruning for Sequence Labeling
Official code of AAAI'23 paper AudioEar: Single-View Ear Reconstruction for Personalized Spatial Audio written in PyTorch
🎈 2020-2021年ASC世界大学生超级计算机竞赛第3题 使用ALBert模型完成完形填空的NLP任务
Action2Sound: Ambient-Aware Generation of Action Sounds from Egocentric Videos
SyncFusion: Multimodal Onset-synchronized Video-to-Audio Foley Synthesis