Stars
This repository contains various models targetting multimodal representation learning, multimodal fusion for downstream tasks such as multimodal sentiment analysis.
图深度学习(葡萄书),在线阅读地址: https://datawhalechina.github.io/grape-book
Official inference library for Mistral models
A Pytorch implementation of Sparsely-Gated Mixture of Experts, for massively increasing the parameter count of language models
A latent text-to-image diffusion model
High-Resolution Image Synthesis with Latent Diffusion Models
Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP
Represent, send, store and search multimodal data
☁️ Build multimodal AI applications with cloud-native stack
A Github repository about micro-expression recognition, micro-expression detection, and micro-expression analysis
OpenFace – a state-of-the art tool intended for facial landmark detection, head pose estimation, facial action unit recognition, and eye-gaze estimation.
Source code for "A Deep Learning based Light-weight Face Mask Detector to Fight Against COVID-19"
Occlusion aware facial expression recognition using CNN with attention mechanism
ICPR 2020: Facial Expression Recognition using Residual Masking Network
Code for the paper "Fusing Body Posture with Facial Expressions for Joint Recognition of Affect in Child-Robot Interaction"