Official Implementation of Zero-Shot Video Captioning with Evolving Pseudo-Tokens

Approach

Example of capabilities

Dependencies

conda install pytorch==2.0.0 torchvision==0.15.0 torchaudio==2.0.0 pytorch-cuda=11.8 -c pytorch -c nvidia

pip3 install clip-by-openai

pip install chardet

For ActivityNet data

bash get_captions.sh

python run.py --token_wise --randomized_prompt --run_type caption_videos --data_path ../data/ActivityNet_200/validation/Mixing_drinks/yjazHd6a5SQ.mp4 --start_sec 0.0 --end_sec 17.87

python run.py --token_wise --randomized_prompt --run_type caption_videos --data_path examples/example_video.mp4

Usage

To run captioning on a video:

python run.py --token_wise --randomized_prompt --run_type caption_videos --data_path examples/example_video.mp4

To run captioning on a single image:

python run.py --token_wise --randomized_prompt --run_type caption_images --data_path examples/example_image.jpg

Citation

Please cite our work if you use it in your research:

@article{tewel2022videocap,
  title={Zero-Shot Video Captioning with Evolving Pseudo-Tokens},
  author={Tewel, Yoad and Shalev, Yoav and Nadler, Roy and Schwartz, Idan and Wolf, Lior},
  journal={arXiv preprint arXiv:2207.11100},
  year={2022}
}

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
examples		examples
git_images		git_images
model		model
.gitignore		.gitignore
README.md		README.md
get_captions.sh		get_captions.sh
requirements.txt		requirements.txt
run.py		run.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Official Implementation of Zero-Shot Video Captioning with Evolving Pseudo-Tokens

Approach

Example of capabilities

Dependencies

For ActivityNet data

Usage

To run captioning on a video:

To run captioning on a single image:

Citation

About

Releases

Packages

Languages

tian1327/zero-shot-video-to-text

Folders and files

Latest commit

History

Repository files navigation

Official Implementation of Zero-Shot Video Captioning with Evolving Pseudo-Tokens

Approach

Example of capabilities

Dependencies

For ActivityNet data

Usage

To run captioning on a video:

To run captioning on a single image:

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages