Skip to content

yyang181/NTIRE23-VIDEO-COLORIZATION

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

21 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Track 1: Fréchet Inception Distance (FID) Optimization

google colab logo OpenXLab visitorsGitHub Stars

Please visit test_NTIRE23_Track_1_FID.py to evaluate our model.

We provide the colorized images HERE, and the reference images used to obtain the results HERE.

News

  • [2024-04-12] New project colormnet would be released soon, which featuring increased speed, reduced GPU memory requirements, better performance, and only requires a single reference image for inference to colorize hundreds or even thousands grayscale frames.
  • [2024-04-12] Add training dataloader support (see branch DeepExemplar) for excellent work DeepExemplar, as suggested by @doolachen.
  • [2023-12-05] Integrated to 🐼 OpenXLab. Try out online demo! OpenXLab.
  • [2023-12-05] Add inference code using two reference images, see test_BiSTNet.py.
  • [2023-12-02] Colab demo for BiSTNet is available google colab logo.

💼 Dependencies and Installation

  • PyTorch >= 1.8.0 (please do not use 2.0.1)

  • CUDA >= 10.2

  • mmcv == 1.x

  • mmediting == 0.x

  • Other required packages

    # git clone this repository
    git clone https://github.com/yyang181/NTIRE23-VIDEO-COLORIZATION.git
    cd NTIRE23-VIDEO-COLORIZATION
    

Environment configuration:

cd BiSTNet-NTIRE2023

# create a new anaconda env
conda create -n bistnet python=3.6
conda activate bistnet

# install pytortch
conda install pytorch==1.10.1 torchvision==0.11.2 torchaudio==0.10.1 cudatoolkit=11.3 -c pytorch -c conda-forge

# mmcv install (1.x, please do not use 2.x)
pip install -U openmim
mim install mmcv-full

# install mmediting (0.x, please do not use 1.x)
git clone https://github.com/open-mmlab/mmediting.git
cd mmediting
pip3 install -e .

# install other pip pkgs 
cd .. && pip install -r pip_requirements.txt

🎁 Checkpoints

Name URL Script FID CDC
BiSTNet model test_NTIRE23_Track_1_FID.py 21.5372 0.001717

⚡ Quick Inference for NTIRE 2023 Video Colorization Challenge

This version is specifically designed for NTIRE 2023 Video Colorization Challenge @ CVPR 2023. We colorize every 50 frames with two exemplars, see clip 001 for an example.

  • Download Pre-trained Models: download a pretrained colorization model from the tabulated links, and put it into the folder ./BiSTNet-NTIRE2023/, like ./BiSTNet-NTIRE2023/checkpoints , ./BiSTNet-NTIRE2023/data and ./BiSTNet-NTIRE2023/models/protoseg_core/checkpoints .
  • Prepare Testing Data: You can put the testing images in a folder, like clip 001 at ./demo_dataset1.
    • demo_dataset1/input: the directory of input grayscale images.
    • demo_dataset1/ref: the directory of reference images (only f001.png, f050.png and f100.png are colorful images).
    • demo_dataset1/output: the directory to save the colorization results.
  • Test on Images:
conda activate bistnet && cd BiSTNet-NTIRE2023
CUDA_VISIBLE_DEVICES=0 python test_NTIRE23_Track_1_FID.py

For more details please refer to test_NTIRE23_Track_1_FID.py.

⚡ Quick Inference for BiSTNet version

This version allows arbitrary number of input video frames with two exemplars. See clip fanghua234 for an example.

  • Prepare Testing Data: You can put the testing images in a folder, like fanghua234 at ./demo_dataset2.

    • demo_dataset2/input: the directory of input grayscale images.

    • demo_dataset2/ref: the directory of reference images ( frame0000.png, frame0150.png).

    • demo_dataset2/output: the directory to save the colorization results.

  • Test on Images:

conda activate bistnet && cd BiSTNet-NTIRE2023
CUDA_VISIBLE_DEVICES=0 python test_BiSTNet.py

For more details please refer to test_BiSTNet.py.

License

BiSTNet is released under the MIT license, while some methods adopted in this project are with other licenses. Please refer to LICENSES.md for the careful check, if you are using our code for commercial matters. Thank @milmor so much for bringing up this concern about license.

Citation

If this work is helpful for your research, please consider citing the following entry.

@article{bistnet,
  title={BiSTNet: Semantic Image Prior Guided Bidirectional Temporal Feature Fusion for Deep Exemplar-based Video Colorization},
  author={Yang, Yixin and Peng, Zhongzheng and Du, Xiaoyu and Tao, Zhulin and Tang, Jinhui and Pan, Jinshan},
  journal={arXiv preprint arXiv:2212.02268},
  year={2022}
}

Acknowledgement

Part of our codes are taken from DeepExemplar, RAFT, HED and ProtoSeg. Thanks for their awesome works.