PowerQE: An Open Framework for Quality Enhancement of Compressed Visual Data

💪 An unified framework for training/testing blind/non-blind fidelity/perception-oriented quality enhancement approaches for compressed images/videos based on PyTorch.

🛠️ Support now:

🚀 Clone: this repository adopts PythonUtils as a sub-module. You may clone PowerQE by:

git clone [email protected]:RyanXingQL/PowerQE.git --depth=1 --recursive

📓 [Wiki]

📧 Feel free to contact: [email protected].

0. Archive

v2: support full-resolution DIV2K data-set.
v1: support down-sampled DIV2K data-set.

1. Dependency

Basis

conda create -n pqe python=3.7 -y && conda activate pqe

# case 1: given CUDA 10.x
python -m pip install torch==1.6.0+cu101 torchvision==0.7.0+cu101 -f https://download.pytorch.org/whl/torch_stable.html
python -m pip install tqdm lmdb pyyaml opencv-python scikit-image tensorboard lpips

# case 2: given CUDA 11.x
python -m pip install torch==1.9.0+cu111 torchvision==0.10.0+cu111 torchaudio==0.9.0 -f https://download.pytorch.org/whl/torch_stable.html
python -m pip install tqdm lmdb pyyaml opencv-python scikit-image tensorboard lpips

Deformable Convolutions V2 (required only for STDF)

cd ./net/ops/dcn
conda activate pqe && sh build.sh

# check (optional)
python simple_check.py

2. Image Data

We take the DIV2K dataset for an example.

Download & Unzip & Symlink

cd <any-path-to-your-database>
mkdir div2k && cd div2k/

# 800 2K images for training and validation (3.3 GB)
wget http://data.vision.ee.ethz.ch/cvl/DIV2K/DIV2K_train_HR.zip

# 100 2K images for test (428 MB)
wget http://data.vision.ee.ethz.ch/cvl/DIV2K/DIV2K_valid_HR.zip

mkdir raw/
unzip -j DIV2K_train_HR.zip -d raw/
unzip -j DIV2K_valid_HR.zip -d raw/

cd <path-to-PowerQE>
mkdir data && ln -s <path-to-div2k> data/

We take {0001-0700}.png for training, {0701-0800}.png for validation, and {0801-0900}.png for test.

Compress

We take JPEG (QF=10) and BPG (HEVC-MSP, QP=42) as examples.

[How to install BPG]

cd ./data_pre_process/ && conda activate pqe

# JPEG QF=10
python main_compress_jpeg.py div2k raw jpeg 10

# BPG (HEVC-MSP) QP=42
python main_compress_bpg.py div2k raw bpg 42 <dir-to-libbpg>

Combine images of different distortions (required only for blind QE)

For blind QE tasks, we combine images of different QP/QFs into new dirs by symlink.

cd ./data_pre_process/ && conda activate pqe

python main_compress_jpeg_blind.py div2k raw jpeg  # compress images with qf=10, 20, 30, 40 and 50 first
python main_combine_im.py div2k raw jpeg

python main_compress_bpg_blind.py div2k raw bpg <dir-to-libbpg>  # compress images with qp=42, 37, 32, 27 and 22 first
python main_combine_im.py div2k raw bpg

3. Video Data

To obtain uncompressed raw videos, we adopt the MFQEv2 dataset.

It may take days to compress videos due to the low speed of the HM codec.

4. Train

Edit YML in opts/, then run:

conda activate pqe && CUDA_VISIBLE_DEVICES=0 python train.py -opt opts/arcnn.yml -case div2k_qf10

You can also use multiple gpus:

conda activate pqe && CUDA_VISIBLE_DEVICES=0,1,2,3 python -m torch.distributed.launch --nproc_per_node=4 --master_port=1111 train.py -opt opts/arcnn.yml -case div2k_qf10

Tensorboard:

cd ./exp/arcnn_div2k_qf10
conda activate pqe && tensorboard --logdir=./ --port=10001 --bind_all

5. Test

Edit YML in opts/, then run:

conda activate pqe && CUDA_VISIBLE_DEVICES=0 python test.py -opt opts/arcnn.yml -case div2k_qf10

6. Result

7. License

We adopt Apache License v2.0.

If you find this repository helpful, you may cite:

@misc{PowerQE_xing_2021,
  author = {Qunliang Xing},
  title = {PowerQE: An Open Framework for Quality Enhancement of Compressed Visual Data},
  howpublished = "\url{https://github.com/RyanXingQL/PowerQE}",
  year = {2021},
  note = "[Online; accessed 11-April-2021]"
}

Name		Name	Last commit message	Last commit date
Latest commit History 71 Commits
algorithm		algorithm
data_pre_process		data_pre_process
dataset		dataset
net		net
opts		opts
utils @ 1e1b688		utils @ 1e1b688
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
test.py		test.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PowerQE: An Open Framework for Quality Enhancement of Compressed Visual Data

0. Archive

1. Dependency

Basis

Deformable Convolutions V2 (required only for STDF)

2. Image Data

Download & Unzip & Symlink

Compress

Combine images of different distortions (required only for blind QE)

3. Video Data

4. Train

5. Test

6. Result

7. License

About

Releases

Packages

Languages

License

distoramos/PowerQE

Folders and files

Latest commit

History

Repository files navigation

PowerQE: An Open Framework for Quality Enhancement of Compressed Visual Data

0. Archive

1. Dependency

Basis

Deformable Convolutions V2 (required only for STDF)

2. Image Data

Download & Unzip & Symlink

Compress

Combine images of different distortions (required only for blind QE)

3. Video Data

4. Train

5. Test

6. Result

7. License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages