Releases: NoEdgeAI/pdfdeal
Releases · NoEdgeAI/pdfdeal
V0.1.1
✨ New Features
- All functions now support the new return format, through the optional parameter
version
to choose, when it isv2
, it will return:list: successful processing files
list: processing failed files
bool
, and the defaultv1
return parameter will only returnlist: successful processing files
. pdf2file
andfile2pdf
now support the optional parameteroutput_names
to specify the output file name.- Added request retry mechanism, now it will automatically retry when the network request fails.
- Added error handling mechanism, now it will automatically handle errors when processing files, and will not cause the entire program to interrupt due to a file error.
🐛 Bug Fixes
- Fixed the font exception problem in the
pdfdeal
function. - Fixed some abnormal use of keys.
- Fixed the problem that the rpm limit may not take effect.
V0.1.0
Important
The Doc2x methods in version 0.0.X
have been deprecated and will be removed in the future, please migrate to the new implementation as soon as possible. You will receive a warning when you still use the old methods.
Most of its interface has not changed, you can try to change from pdfdeal.doc2x import Doc2x
to from pdfdeal.doc2x import Doc2X
directly.
✨ New Features
- Refactored Doc2X support using concurrency to speed up processing.
- New: RPM limiter
🐛 Bug Fixes
- Performance improvement
Add support for Features of Doc2X Translation
-
Add support for Features of Doc2X Translation
-
添加了Doc2X翻译功能的支持
Fix some bug in Doc2X API
-
增加了对Doc2X中图片转换纯公式的支持
-
添加了将[转换为$的开关
-
Added support for converting images to plain formulas in Doc2X
-
Added switch to convert [to $.
Add more about Doc2X features
- Asynchronous functionality of Doc2X is supported.
- Processes designed specifically for Doc2X, the content of each page of the processed pdf is the source document corresponding to the number of pages.
Add support for Doc2X API
- Add Doc2X as a new OCR engine
- Also can use Doc2X's conversion function alone:
from pdfdeal.doc2x import Doc2x
Added some new OCR options
- Build bug fix
- Now can use
easyocr
orpytesseract
as the OCR engine or just skip OCR. - Improved package installation.
- Fixed the bug of outputting PDF without line breaks.
Added some new OCR options
- Now can use
easyocr
orpytesseract
as the OCR engine or just skip OCR. - Improved package installation.
- Fixed the bug of outputting PDF without line breaks.