Skip to content
View sjghh's full-sized avatar
  • XI AN

Block or report sjghh

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
56 stars written in Python
Clear filter

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Python 25,564 2,925 Updated Sep 2, 2024

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 21,406 2,355 Updated Aug 12, 2024

TextAttack 🐙 is a Python framework for adversarial attacks, data augmentation, and model training in NLP https://textattack.readthedocs.io/en/master/

Python 3,060 407 Updated Jul 25, 2024

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

Python 2,914 267 Updated Jun 4, 2024

A unified evaluation framework for large language models

Python 2,528 188 Updated Feb 11, 2025

GPT4V-level open-source multi-modal model based on Llama3-8B

Python 2,249 148 Updated Sep 3, 2024

The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,481 116 Updated Aug 13, 2024

心理健康大模型、LLM、The Big Model of Mental Health、Finetune、InternLM2、InternLM2.5、Qwen、ChatGLM、Baichuan、DeepSeek、Mixtral、LLama3、GLM4、Qwen2、LLama3.1

Python 1,164 153 Updated Jan 16, 2025

SALMONN: Speech Audio Language Music Open Neural Network

Python 1,143 89 Updated Dec 12, 2024

MELD: A Multimodal Multi-Party Dataset for Emotion Recognition in Conversation

Python 861 209 Updated Mar 10, 2024

[ACL'19] [PyTorch] Multimodal Transformer

Python 853 152 Updated Sep 12, 2022

MMSA is a unified framework for Multimodal Sentiment Analysis.

Python 731 114 Updated Jan 15, 2025

Code for RoboFlamingo

Python 341 28 Updated May 8, 2024

Attention-based multimodal fusion for sentiment analysis

Python 334 74 Updated Apr 8, 2024

Data and codes for ACL 2021 paper: Towards Emotional Support Dialog Systems

Python 248 34 Updated Jun 19, 2024

MISA: Modality-Invariant and -Specific Representations for Multimodal Sentiment Analysis

Python 216 33 Updated Mar 14, 2023

Emotion-LLaMA: Multimodal Emotion Recognition and Reasoning with Instruction Tuning

Python 198 17 Updated Feb 12, 2025

Toolkits for Multimodal Emotion Recognition

Python 182 16 Updated May 26, 2024

A Transformer Framework Based Translation Task

Python 143 38 Updated Jul 21, 2024

The offical realization of InstructERC

Python 129 8 Updated Dec 18, 2024

Explainable Multimodal Emotion Reasoning (EMER) and AffectGPT

Python 128 8 Updated Apr 26, 2024
Python 117 10 Updated Apr 5, 2023

Multilingual Multitask Multipurpose Medical Speech Recognition

Python 94 15 Updated Nov 9, 2024

[NAACL 2024] Data and code for our paper "Sentiment Analysis in the Era of Large Language Models: A Reality Check"

Python 88 15 Updated May 30, 2023
Python 86 4 Updated Jul 8, 2024
Python 72 4 Updated Feb 8, 2025

This is the official implementation of 2024 CVPR paper "EmoGen: Emotional Image Content Generation with Text-to-Image Diffusion Models".

Python 70 7 Updated Jan 15, 2025

[ACL 2024 Main] Official PyTorch implementation of the paper "Multimodal Prompt Learning with Missing Modalities for Sentiment Analysis and Emotion Recognition"

Python 67 5 Updated Dec 13, 2024

Make Acoustic and Visual Cues Matter: CH-SIMS v2.0 Dataset and AV-Mixup Consistent Module

Python 66 9 Updated Sep 8, 2022
Python 59 8 Updated Dec 4, 2024
Next