GitHub - MengyuanChen21/CVPR2023-CMPAE: [CVPR 2023] Collecting Cross-Modal Presence-Absence Evidence for Weakly-Supervised Audio-Visual Event Perception

Collecting Cross-Modal Presence-Absence Evidence for Weakly-Supervised Audio-Visual Event Perception

Code for CVPR 2023 paper Collecting Cross-Modal Presence-Absence Evidence for Weakly-Supervised Audio-Visual Event Perception

Paper Overview

Weakly-supervised Audio-Visual Video Parsing

Overview of CMPAE

**Typo**: It should be noted that in the framework graph of the paper, we incorrectly labeled the name of "Absence/Presence Evidence Collecter". Here's the correct version. We are sorry for the typo.

Get Started

Dependencies

Here we list our used requirements and dependencies.

GPU: GeForce RTX 3090
Python: 3.8.6
PyTorch: 1.12.1
Other: Pandas, Openpyxl, Wandb (optional)

Prepare data

Please download the preprocessed audio and visual features from https://github.com/YapengTian/AVVP-ECCV20.
Put the downloaded features into data/feats/, and put the annotation files into data/annotations/.

Train your own models

Run ./train.sh.

Test the pre-trained model

Download the checkpoint file from Google Drive, and put it into save/pretrained/. Then run ./test.sh.

Citation

If you find the code useful in your research, please consider citing it:

@inproceedings{junyu2023CVPR_CMPAE,
  author = {Gao, Junyu and Chen, Mengyuan and Xu, Changsheng},
  title = {Collecting Cross-Modal Presence-Absence Evidence for Weakly-Supervised Audio-Visual Event Perception},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
  year = {2023}
}

License

See MIT License

Acknowledgement

This repo contains modified codes from:

JoMoLD: for implementation of the backbone JoMoLD (ECCV-2022).

We sincerely thank the owners of the great repos!

Name	Name	Last commit message	Last commit date
Latest commit MengyuanChen21 Update README.md Jun 17, 2023 73345f7 · Jun 17, 2023 History 9 Commits
feature_extractor	feature_extractor	Add files via upload	May 11, 2023
graph	graph	Add files via upload	Jun 17, 2023
nets	nets	Add files via upload	May 11, 2023
save/pretrained	save/pretrained	Add files via upload	May 11, 2023
scripts	scripts	Add files via upload	May 11, 2023
utils	utils	Add files via upload	May 11, 2023
LICENSE	LICENSE	Create LICENSE	May 11, 2023
README.md	README.md	Update README.md	Jun 17, 2023
dataloader.py	dataloader.py	Add files via upload	May 11, 2023
main.py	main.py	Add files via upload	May 11, 2023
mutual_loss.py	mutual_loss.py	Add files via upload	May 11, 2023
noise_ratios.npz	noise_ratios.npz	Add files via upload	May 11, 2023
option.py	option.py	Add files via upload	May 11, 2023
test.sh	test.sh	Add files via upload	May 11, 2023
train.sh	train.sh	Add files via upload	May 11, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Collecting Cross-Modal Presence-Absence Evidence for Weakly-Supervised Audio-Visual Event Perception

Paper Overview

Weakly-supervised Audio-Visual Video Parsing

Overview of CMPAE

Get Started

Dependencies

Prepare data

Train your own models

Test the pre-trained model

Citation

License

Acknowledgement

About

Releases

Packages

Languages

License

MengyuanChen21/CVPR2023-CMPAE

Folders and files

Latest commit

History

Repository files navigation

Collecting Cross-Modal Presence-Absence Evidence for Weakly-Supervised Audio-Visual Event Perception

Paper Overview

Weakly-supervised Audio-Visual Video Parsing

Overview of CMPAE

Get Started

Dependencies

Prepare data

Train your own models

Test the pre-trained model

Citation

License

Acknowledgement

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages