Skip to content

Releases: NoEdgeAI/pdfdeal

V0.1.1

22 Jun 14:17
Compare
Choose a tag to compare

✨ New Features

  • All functions now support the new return format, through the optional parameter version to choose, when it is v2, it will return: list: successful processing files list: processing failed files bool, and the default v1 return parameter will only return list: successful processing files.
  • pdf2file and file2pdf now support the optional parameter output_names to specify the output file name.
  • Added request retry mechanism, now it will automatically retry when the network request fails.
  • Added error handling mechanism, now it will automatically handle errors when processing files, and will not cause the entire program to interrupt due to a file error.

🐛 Bug Fixes

  • Fixed the font exception problem in the pdfdeal function.
  • Fixed some abnormal use of keys.
  • Fixed the problem that the rpm limit may not take effect.

V0.1.0

20 Jun 11:56
7c8de88
Compare
Choose a tag to compare

Important

The Doc2x methods in version 0.0.X have been deprecated and will be removed in the future, please migrate to the new implementation as soon as possible. You will receive a warning when you still use the old methods.

Most of its interface has not changed, you can try to change from pdfdeal.doc2x import Doc2x to from pdfdeal.doc2x import Doc2X directly.

✨ New Features

  • Refactored Doc2X support using concurrency to speed up processing.
  • New: RPM limiter

🐛 Bug Fixes

  • Performance improvement

Add support for Features of Doc2X Translation

13 Jun 03:04
Compare
Choose a tag to compare
  • Add support for Features of Doc2X Translation

  • 添加了Doc2X翻译功能的支持

Fix some bug in Doc2X API

05 Jun 12:26
Compare
Choose a tag to compare
  • 增加了对Doc2X中图片转换纯公式的支持

  • 添加了将[转换为$的开关

  • Added support for converting images to plain formulas in Doc2X

  • Added switch to convert [to $.

Add more about Doc2X features

01 Jun 07:33
Compare
Choose a tag to compare
  • Asynchronous functionality of Doc2X is supported.
  • Processes designed specifically for Doc2X, the content of each page of the processed pdf is the source document corresponding to the number of pages.

Add support for Doc2X API

31 May 15:18
Compare
Choose a tag to compare
  • Add Doc2X as a new OCR engine
  • Also can use Doc2X's conversion function alone:from pdfdeal.doc2x import Doc2x

Added some new OCR options

29 May 12:14
Compare
Choose a tag to compare
  • Build bug fix
  • Now can use easyocr or pytesseract as the OCR engine or just skip OCR.
  • Improved package installation.
  • Fixed the bug of outputting PDF without line breaks.

Added some new OCR options

29 May 11:53
Compare
Choose a tag to compare
  • Now can use easyocr or pytesseract as the OCR engine or just skip OCR.
  • Improved package installation.
  • Fixed the bug of outputting PDF without line breaks.