Ever downloaded a lecture slide like this?
The duplicates are used for presentation, but they become annoying when one wants to read the PDF slides. This simple tool automatically drops the duplicates in the PDF files, for a better readability.
Use the following command to install Nyquisis, with its dependencies:
$ pip3 install nyquisis
Use command line to interact:
$ nyquisis ./ve216,chap1,teach.pdf
In this case, it will generate ./output/ve216,chap1,teach.pdf
.
You can also specify the output location:
$ nyquisis ./ve216,chap1,teach.pdf -o ./chap1_output.pdf
The path can also be a directory:
$ nyquisis lecture_slides
This will traverse all PDF files under the directory, and generate duplicate-dropped versions in ./output
at the same directory as the original PDF file. In this mode the -o
or --out
argument is invalid.
- Generate PDF files that without duplicates.
- Traverse a given directory to perform drop tasks.
- Preserve the TOC structure in the original PDF file.
- 2022/1/20: Remastered.
- 2020/5/19: Now pdfDropDuplicates will preserve the original TOC structure.
- 2020/5/18: Uses PyMuPDF now instead of PyPDF2 and pdf2image. Also rewrited some part to remove OpenCV dependency.