PyTorch Lightning Optical Flow

Introduction

This is a collection of state-of-the-art deep model for estimating optical flow. The main goal is to provide a unified framework where multiple models can be trained and tested more easily.

The work and code from many others are present here. I tried to make sure everything is properly referenced, but please let me know if I missed something.

This is still under development, so some things may not work as intended. I plan to add more models in the future, as well keep improving the platform.

What's new
Available models
Results
Getting started
Licenses
Contributing
Citing
Acknowledgements

What's new

- v0.4.0

Major update to support Lightning 2 (finally!). However, it also introduces breaking changes from the previous v0.3 code. See the details below.

Transitioning from v0.3 to v0.4: check the v0.4 upgrade guide
Added features:
- Support for YAML config files. See the config file documentation
- Table comparing PTLFlow results with the original papers to check the stability of the included models.
Added new models:
- NeuFlow v2 https://arxiv.org/abs/2408.10161
Add support for more datasets:
- Middlebury-ST [https://vision.middlebury.edu/stereo/data/scenes2014/]{https://vision.middlebury.edu/stereo/data/scenes2014/}
- VIPER https://playing-for-benchmarks.org/

- v0.3.2

Added new models:
- MemFlow https://arxiv.org/abs/2404.04808
- NeuFlow https://arxiv.org/abs/2403.10425
- SEA-RAFT https://arxiv.org/abs/2405.14793
- SplatFlow https://arxiv.org/abs/2306.08887
Add support for more datasets:
- TartanAir https://theairlab.org/tartanair-dataset/
- Kubric https://github.com/google-research/kubric
Add ONNX and TensorRT conversion to RAPIDFlow
Fix LR scheduler when accumulating gradients

- v0.3.1

Added new models:
- CCMR https://arxiv.org/abs/2311.02661
- LLA-Flow https://arxiv.org/abs/2304.08101
- RAPIDFlow https://hmorimitsu.com/publication/2024-icra-rapidflow/
Enable FP16 in most models.
- Except the following models, since they have operations that cannot run in FP16: lcv_raft, matchflow, and separableflow
Add FP16 mode in infer, model_benchmark, and validate scripts
Create plot_results.py script
Move resize operations to CUDA (thanks to coca-huang)

- v0.3.0

Added new models:
- DIP https://arxiv.org/abs/2204.00330
- Flow1D https://arxiv.org/abs/2103.04524
- FlowFormer++ https://arxiv.org/abs/2303.01237
- GMFlow+, UniMatch https://arxiv.org/abs/2211.05783
- MatchFlow https://arxiv.org/abs/2303.08384
- MS-RAFT+ https://arxiv.org/abs/2210.16900
- RPKNet https://hmorimitsu.com/publication/2024-aaai-rpknet
- SeparableFlow https://openaccess.thecvf.com/content/ICCV2021/papers/Zhang_Separable_Flow_Learning_Motion_Cost_Volumes_for_Optical_Flow_Estimation_ICCV_2021_paper.pdf
- SKFlow https://arxiv.org/abs/2205.14623
- VideoFlow https://arxiv.org/abs/2303.08340
speed_benchmark.py becomes model_benchmark.py and records more metrics
Fix compatibility with PyTorch 2.0
Fix compatibility with PyTorch Lightning 1.9
Fix resizing augmentation when the valid mask is sparse
Add support for more datasets:
- Middlebury https://vision.middlebury.edu/flow/
- Monkaa https://lmb.informatik.uni-freiburg.de/resources/datasets/SceneFlowDatasets.en.html
- Spring https://spring-benchmark.org/

Available models

CCMR https://arxiv.org/abs/2311.02661
CRAFT https://arxiv.org/abs/2203.16896
CSFlow https://arxiv.org/abs/2202.00909
DICL-Flow https://arxiv.org/abs/2010.14851
DIP https://arxiv.org/abs/2204.00330
FastFlowNet https://arxiv.org/abs/2103.04524
Flow1D https://arxiv.org/abs/2103.04524
FlowFormer https://arxiv.org/abs/2203.16194
FlowFormer++ https://arxiv.org/abs/2303.01237
FlowNet https://arxiv.org/abs/1504.06852
FlowNet2 https://arxiv.org/abs/1612.01925
GMA https://arxiv.org/abs/2104.02409
GMFlow https://arxiv.org/abs/2111.13680
GMFlow+, UniMatch https://arxiv.org/abs/2211.05783
GMFlowNet https://arxiv.org/abs/2203.11335
HD3 https://arxiv.org/abs/1812.06264
IRR https://arxiv.org/abs/1904.05290
LCV https://arxiv.org/abs/2007.11431
LiteFlowNet https://arxiv.org/abs/1805.07036
LiteFlowNet2 https://arxiv.org/abs/1903.07414
LiteFlowNet3 https://arxiv.org/abs/2007.09319
LLA-Flow https://arxiv.org/abs/2304.08101
MaskFlownet https://arxiv.org/abs/2003.10955
MatchFlow https://arxiv.org/abs/2303.08384
MemFlow https://arxiv.org/abs/2404.04808
MS-RAFT+ https://arxiv.org/abs/2210.16900
NeuFlow v1 https://arxiv.org/abs/2403.10425
NeuFlow v2 https://arxiv.org/abs/2408.10161
PWCNet https://arxiv.org/abs/1709.02371
RAFT https://arxiv.org/abs/2003.12039
RAPIDFlow https://hmorimitsu.com/publication/2024-icra-rapidflow/
RPKNet https://hmorimitsu.com/publication/2024-aaai-rpknet
ScopeFlow https://arxiv.org/abs/2002.10770
SCV https://arxiv.org/abs/2104.02166
SEA-RAFT https://arxiv.org/abs/2405.14793
SeparableFlow https://openaccess.thecvf.com/content/ICCV2021/papers/Zhang_Separable_Flow_Learning_Motion_Cost_Volumes_for_Optical_Flow_Estimation_ICCV_2021_paper.pdf
SKFlow https://arxiv.org/abs/2205.14623
SplatFlow https://arxiv.org/abs/2306.08887
STaRFlow https://arxiv.org/abs/2007.05481
VCN https://papers.nips.cc/paper/2019/file/bbf94b34eb32268ada57a3be5062fe7d-Paper.pdf
VideoFlow https://arxiv.org/abs/2303.08340

Read more details about the models on https://ptlflow.readthedocs.io/en/latest/models/models_list.html.

Results

You can see a table with main evaluation results of the available models here. More results are also available in the folder docs/source/results.

Disclaimer: These results are the ones obtained by evaluating the available models in this framework in my machine. Your results may be different due to differences in hardware and software. I also do not guarantee that the results of each model will be similar to the ones presented in the respective papers or other original sources. If you need to replicate the original results from a paper, you should use the original implementations.

Getting started

Please take a look at the documentation to learn how to install and use PTLFlow.

You can also check the notebooks below running on Google Colab for some practical examples:

If you are using the previous v0.3.X code, then check the v0.3.2 documentation and the following example notebooks:

Licenses

The original code of this repository is licensed under the Apache 2.0 license.

Each model may be subjected to different licenses. The license of each model is included in their respective folders. It is your responsibility to make sure that your project is in compliance with all the licenses and conditions involved.

The external pretrained weights all have different licenses, which are listed in their respective folders.

The pretrained weights that were trained within this project are available under the CC BY-NC-SA 4.0 license, which I believe that covers the licenses of the datasets used in the training. That being said, I am not a legal expert so if you plan to use them to any purpose other than research, you should check all the involved licenses by yourself. Additionally, the datasets used for the training usually require the user to cite the original papers, so be sure to include their respective references in your work.

Contributing

Contribution are welcome! Please check CONTRIBUTING.md to see how to contribute.

Citing

BibTeX

@misc{morimitsu2021ptlflow,
  author = {Henrique Morimitsu},
  title = {PyTorch Lightning Optical Flow},
  year = {2021},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/hmorimitsu/ptlflow}}
}

Acknowledgements

This README file is heavily inspired by the one from the timm repository.
Some parts of the code were inspired by or taken from FlowNetPytorch.
flownet2-pytorch was also another important source.
The current main training routine is based on RAFT.

Name		Name	Last commit message	Last commit date
Latest commit History 519 Commits
.github/workflows		.github/workflows
configs/results		configs/results
docs		docs
misc		misc
ptlflow		ptlflow
tests		tests
.gitignore		.gitignore
.readthedocs.yaml		.readthedocs.yaml
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
compare_paper_results.py		compare_paper_results.py
datasets.yaml		datasets.yaml
infer.py		infer.py
model_benchmark.py		model_benchmark.py
plot_results.py		plot_results.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
summary_metrics.py		summary_metrics.py
test.py		test.py
train.py		train.py
validate.py		validate.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PyTorch Lightning Optical Flow

Introduction

What's new

- v0.4.0

- v0.3.2

- v0.3.1

- v0.3.0

Available models

Results

Getting started

Licenses

Contributing

Citing

BibTeX

Acknowledgements

About

Releases 20

Packages

Contributors 2

Languages

License

hmorimitsu/ptlflow

Folders and files

Latest commit

History

Repository files navigation

PyTorch Lightning Optical Flow

Introduction

What's new

- v0.4.0

- v0.3.2

- v0.3.1

- v0.3.0

Available models

Results

Getting started

Licenses

Contributing

Citing

BibTeX

Acknowledgements

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 20

Packages 0

Contributors 2

Languages

Packages