Stars
[ACL 2024 Main] Official PyTorch implementation of the paper "Multimodal Prompt Learning with Missing Modalities for Sentiment Analysis and Emotion Recognition"
PaSa -- an advanced paper search agent powered by large language models. It can autonomously make a series of decisions, including invoking search tools, reading papers, and selecting relevant refe…
Benchmarking LLMs' Emotional Alignment with Humans
This is a repository for sharing papers in the field of empathetic conversational AI. The related source code for each paper is linked if available.
Diffprivlib: The IBM Differential Privacy Library
Speech emotion recognition implemented in Keras (LSTM, CNN, SVM, MLP) | 语音情感识别
This repo contains implementation of different architectures for emotion recognition in conversations.
Bark Voice Cloning and Voice Cloning for Chinese Speech
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
A comprehensive collection of KAN(Kolmogorov-Arnold Network)-related resources, including libraries, projects, tutorials, papers, and more, for researchers and developers in the Kolmogorov-Arnold N…
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
😎 Awesome list of tools and projects with the awesome LangChain framework
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…
Instant voice cloning by MIT and MyShell. Audio foundation model.
Reading list for research topics in multimodal machine learning
Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Software in C and data files for the popular GloVe model for distributed word representations, a.k.a. word vectors or embeddings