A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。

Name
Last commit
Last update
.github Loading commit data...
demo Loading commit data...
docs Loading commit data...
magic_pdf Loading commit data...
next_docs Loading commit data...
projects Loading commit data...
scripts Loading commit data...
signatures/version1 Loading commit data...
tests Loading commit data...
.gitattributes Loading commit data...
.gitignore Loading commit data...
.pre-commit-config.yaml Loading commit data...
.readthedocs.yaml Loading commit data...
Dockerfile Loading commit data...
LICENSE.md Loading commit data...
MinerU_CLA.md Loading commit data...
README.md Loading commit data...
README_ja-JP.md Loading commit data...
README_zh-CN.md Loading commit data...
magic-pdf.template.json Loading commit data...
requirements-docker.txt Loading commit data...
requirements-qa.txt Loading commit data...
requirements.txt Loading commit data...
setup.py Loading commit data...
update_version.py Loading commit data...