Skip to content

A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。

License

Notifications You must be signed in to change notification settings

quasimodo226/MinerU

Repository files navigation

Magic-PDF

便捷、准确的将PDF转换成Markdown文档

上手指南

开发前的配置要求

python 3.9+

安装步骤

1.Clone the repo

git clone https://github.com/myhloli/Magic-PDF.git

2.Install the requirements

pip install -r requirements.txt

3.Run the main script

use demo/demo_test.py

版权说明

LICENSE.md

鸣谢

About

A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%