Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
code		code
pretrains		pretrains
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md

Repository files navigation

Seeing Beyond the Brain: Masked Modeling Conditioned Diffusion Model for Human Vision Decoding

Overview

Requirements

A suitable conda environment named mind_vis can be created and activated with:

conda env create -f environment.yml conda activate mind_vis

Decoding visual stimuli from brain recordings aims to deepen our understanding of the human visual system and build a solid foundation for bridging human and computer vision through the Brain-Computer Interface. However, due to the scarcity of data annotations and the complexity of underlying brain information, it is challenging to decode images with faithful details and meaningful semantics. In this work, we present MinD-Vis: Sparse Masked Brain Modeling with Double-Conditioned Latent Diffusion Model for Human Vision Decoding. Specifically, by boosting the information capacity of feature representations learned from a large-scale resting-state fMRI dataset, we show that our MinD-Vis can reconstruct highly plausible images with semantically matching details from brain recordings with very few paired annotations. We benchmarked our model qualitatively and quantitatively; the experimental results indicate that our method outperformed state-of-the-art in both semantic mapping (100-way semantic classification) and generation quality (FID) by 66% and 41% respectively. Exhaustive ablation studies are conducted to analyze our framework.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Seeing Beyond the Brain: Masked Modeling Conditioned Diffusion Model for Human Vision Decoding

Overview

Requirements

MinD-Vis Framework

About

Releases

Packages

Languages

License

celsopitta/mind-vis

Folders and files

Latest commit

History

Repository files navigation

Seeing Beyond the Brain: Masked Modeling Conditioned Diffusion Model for Human Vision Decoding

Overview

Requirements

MinD-Vis Framework

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages