Skip to content

tian1327/zero-shot-video-to-text

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

16 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Approach

Example of capabilities

Dependencies

conda install pytorch==2.0.0 torchvision==0.15.0 torchaudio==2.0.0 pytorch-cuda=11.8 -c pytorch -c nvidia
pip3 install clip-by-openai
pip install chardet

For ActivityNet data

bash get_captions.sh
python run.py --token_wise --randomized_prompt --run_type caption_videos --data_path ../data/ActivityNet_200/validation/Mixing_drinks/yjazHd6a5SQ.mp4 --start_sec 0.0 --end_sec 17.87
python run.py --token_wise --randomized_prompt --run_type caption_videos --data_path examples/example_video.mp4

Usage

To run captioning on a video:

python run.py --token_wise --randomized_prompt --run_type caption_videos --data_path examples/example_video.mp4

To run captioning on a single image:

python run.py --token_wise --randomized_prompt --run_type caption_images --data_path examples/example_image.jpg

Citation

Please cite our work if you use it in your research:

@article{tewel2022videocap,
  title={Zero-Shot Video Captioning with Evolving Pseudo-Tokens},
  author={Tewel, Yoad and Shalev, Yoav and Nadler, Roy and Schwartz, Idan and Wolf, Lior},
  journal={arXiv preprint arXiv:2207.11100},
  year={2022}
}

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 87.9%
  • Shell 12.1%