Dual Transformer for Audio captioning

Pytorch: Dual Transformer Decoder based Features Fusion Network for Automated Audio Captioning

Create conda environment with dependencies: conda env create -f environment.yml -n name

Set up dataset

Run data_prep.py to prepare the h5py files

Prepare evaluation tool

Run coco_caption/get_stanford_models.sh to download the libraries necessary for evaluating the metrics.

Run experiments

Set the parameters you want in settings/settings.yaml
Run experiments: python train.py -n exp_name

Reinforcement learning training

Set settings in rl block in settings/settings.yaml
Run: python finetune_rl.py -n exp_name

Citation

@INPROCEEDING{sun2023dual,
title={Dual Transformer Decoder based Features Fusion Network for Automated Audio Captioning},
author={Jianyuan Sun and Xubo Liu and Xinhao Mei and Volkan Kılıç and Mark D. Plumbley and Wenwu Wang},
year={2023},
booktitle={INTERSPEECH2023}}

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
__pycache__		__pycache__
coco_caption		coco_caption
data		data
data_handling		data_handling
models		models
pretrained_models/word2vec		pretrained_models/word2vec
pycocotools		pycocotools
settings		settings
tools		tools
trainer		trainer
README.md		README.md
data_prep.py		data_prep.py
environment.yml		environment.yml
eval_metrics.py		eval_metrics.py
finetune_rl.py		finetune_rl.py
train.py		train.py
word2vec_train.py		word2vec_train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Dual Transformer for Audio captioning

Set up dataset

Prepare evaluation tool

Run experiments

Reinforcement learning training

Citation

About

Releases

Packages

Languages

Sunsunny11/Audio-captioning

Folders and files

Latest commit

History

Repository files navigation

Dual Transformer for Audio captioning

Set up dataset

Prepare evaluation tool

Run experiments

Reinforcement learning training

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages