Stars
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
TextAttack 🐙 is a Python framework for adversarial attacks, data augmentation, and model training in NLP https://textattack.readthedocs.io/en/master/
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
A unified evaluation framework for large language models
GPT4V-level open-source multi-modal model based on Llama3-8B
The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.
心理健康大模型、LLM、The Big Model of Mental Health、Finetune、InternLM2、InternLM2.5、Qwen、ChatGLM、Baichuan、DeepSeek、Mixtral、LLama3、GLM4、Qwen2、LLama3.1
SALMONN: Speech Audio Language Music Open Neural Network
MELD: A Multimodal Multi-Party Dataset for Emotion Recognition in Conversation
[ACL'19] [PyTorch] Multimodal Transformer
MMSA is a unified framework for Multimodal Sentiment Analysis.
Attention-based multimodal fusion for sentiment analysis
Data and codes for ACL 2021 paper: Towards Emotional Support Dialog Systems
MISA: Modality-Invariant and -Specific Representations for Multimodal Sentiment Analysis
Emotion-LLaMA: Multimodal Emotion Recognition and Reasoning with Instruction Tuning
Toolkits for Multimodal Emotion Recognition
A Transformer Framework Based Translation Task
Explainable Multimodal Emotion Reasoning (EMER) and AffectGPT
Multilingual Multitask Multipurpose Medical Speech Recognition
[NAACL 2024] Data and code for our paper "Sentiment Analysis in the Era of Large Language Models: A Reality Check"
This is the official implementation of 2024 CVPR paper "EmoGen: Emotional Image Content Generation with Text-to-Image Diffusion Models".
[ACL 2024 Main] Official PyTorch implementation of the paper "Multimodal Prompt Learning with Missing Modalities for Sentiment Analysis and Emotion Recognition"
Make Acoustic and Visual Cues Matter: CH-SIMS v2.0 Dataset and AV-Mixup Consistent Module