Commit 79d850b8 authored by 赵小蒙's avatar 赵小蒙

update readme

parent 7c7099ad
......@@ -21,8 +21,8 @@
MinerU is a one-stop, open-source data extraction tool, primarily includes the following features:
- PDF Document Extraction [Magic-PDF](#Magic-PDF)
- Webpage & E-book Extraction [Magic-Doc](#Magic-Doc)
- [Magic-PDF](#Magic-PDF) PDF Document Extraction
- [Magic-Doc](#Magic-Doc) Webpage & E-book Extraction
# Magic-PDF
......@@ -58,9 +58,9 @@ https://github.com/magicpdf/Magic-PDF/assets/11393164/618937cb-dc6a-4646-b433-e3
### Submodule Repositories
- [PDF-Extract-Kit](https://github.com/opendatalab/PDF-Extract-Kit)
A Comprehensive Toolkit for High-Quality PDF Content Extraction
- A Comprehensive Toolkit for High-Quality PDF Content Extraction
- [Miner-PDF-Benchmark](https://github.com/opendatalab/Miner-PDF-Benchmark)
An end-to-end PDF document comprehension evaluation suite designed for large-scale model data scenarios
- An end-to-end PDF document comprehension evaluation suite designed for large-scale model data scenarios
## Getting Started
......
......@@ -21,8 +21,8 @@
MinerU 是一款一站式开源数据提取工具,主要包含以下功能:
- PDF文档提取 [Magic-PDF](#Magic-PDF)
- 网页与电子书提取 [Magic-Doc](#Magic-Doc)
- [Magic-PDF](#Magic-PDF) PDF文档提取
- [Magic-Doc](#Magic-Doc) 网页与电子书提取
# Magic-PDF
......@@ -58,9 +58,9 @@ https://github.com/magicpdf/Magic-PDF/assets/11393164/618937cb-dc6a-4646-b433-e3
### 子模块仓库
- [PDF-Extract-Kit](https://github.com/opendatalab/PDF-Extract-Kit)
高质量的PDF内容提取工具包
- 高质量的PDF内容提取工具包
- [Miner-PDF-Benchmark](https://github.com/opendatalab/Miner-PDF-Benchmark)
端到端的PDF文档理解评估套件,专为大规模模型数据场景而设计
- 端到端的PDF文档理解评估套件,专为大规模模型数据场景而设计
## 上手指南
......
Markdown is supported
0% or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment