FPD

Overview

FPD is the first manually annotated dataset that focuses on fine-grained segmentation, which consists of 66 non-Manhattan layout documents (shown in Fig.~\ref{fig:fpd}). In the labeling process, we use point labeling and polygon labeling, and the label export style is ``cvat for images 1.1". The data comes from the pages of complex Chinese and English magazines, and the page size is not fixed.

For more information, please consult our paper here.

Access

If you would like to download the FPD dataset, please send email to [email protected] and [email protected]. We will send you the dataset download link.

Citation

If you use the FPD data please cite:

@inproceedings{ma2023image,
  title={Image Layer Modeling for Complex Document Layout Generation},
  author={Ma, Tianlong and Wu, Xingjiao and Du, Xiangcheng and Wang, Yanlong and Jin, Cheng},
  booktitle={2023 IEEE International Conference on Multimedia and Expo (ICME)},
  pages={2261--2266},
  year={2023},
  organization={IEEE}
}

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
FPD.jpg		FPD.jpg
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FPD

Overview

Access

Citation

About

Releases

Packages

XingjiaoWu/FPD

Folders and files

Latest commit

History

Repository files navigation

FPD

Overview

Access

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages