Skip to content
Projects
Groups
Snippets
Help
Loading...
Help
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
P
pdf-miner
Project
Project
Details
Activity
Releases
Cycle Analytics
Repository
Repository
Files
Commits
Branches
Tags
Contributors
Graph
Compare
Charts
Issues
0
Issues
0
List
Board
Labels
Milestones
Merge Requests
0
Merge Requests
0
CI / CD
CI / CD
Pipelines
Jobs
Schedules
Charts
Wiki
Wiki
Snippets
Snippets
Members
Members
Collapse sidebar
Close sidebar
Activity
Graph
Charts
Create a new issue
Jobs
Commits
Issue Boards
Open sidebar
Qin Kaijie
pdf-miner
Commits
79d850b8
Commit
79d850b8
authored
Jun 28, 2024
by
赵小蒙
Browse files
Options
Browse Files
Download
Email Patches
Plain Diff
update readme
parent
7c7099ad
Changes
2
Hide whitespace changes
Inline
Side-by-side
Showing
2 changed files
with
8 additions
and
8 deletions
+8
-8
README.md
README.md
+4
-4
README_zh-CN.md
README_zh-CN.md
+4
-4
No files found.
README.md
View file @
79d850b8
...
@@ -21,8 +21,8 @@
...
@@ -21,8 +21,8 @@
MinerU is a one-stop, open-source data extraction tool, primarily includes the following features:
MinerU is a one-stop, open-source data extraction tool, primarily includes the following features:
-
PDF Document Extraction
[
Magic-PDF
](
#Magic-PDF
)
-
[
Magic-PDF
](
#Magic-PDF
)
PDF Document Extraction
-
Webpage & E-book Extraction
[
Magic-Doc
](
#Magic-Doc
)
-
[
Magic-Doc
](
#Magic-Doc
)
Webpage & E-book Extraction
# Magic-PDF
# Magic-PDF
...
@@ -58,9 +58,9 @@ https://github.com/magicpdf/Magic-PDF/assets/11393164/618937cb-dc6a-4646-b433-e3
...
@@ -58,9 +58,9 @@ https://github.com/magicpdf/Magic-PDF/assets/11393164/618937cb-dc6a-4646-b433-e3
### Submodule Repositories
### Submodule Repositories
-
[
PDF-Extract-Kit
](
https://github.com/opendatalab/PDF-Extract-Kit
)
-
[
PDF-Extract-Kit
](
https://github.com/opendatalab/PDF-Extract-Kit
)
A Comprehensive Toolkit for High-Quality PDF Content Extraction
-
A Comprehensive Toolkit for High-Quality PDF Content Extraction
-
[
Miner-PDF-Benchmark
](
https://github.com/opendatalab/Miner-PDF-Benchmark
)
-
[
Miner-PDF-Benchmark
](
https://github.com/opendatalab/Miner-PDF-Benchmark
)
An end-to-end PDF document comprehension evaluation suite designed for large-scale model data scenarios
-
An end-to-end PDF document comprehension evaluation suite designed for large-scale model data scenarios
## Getting Started
## Getting Started
...
...
README_zh-CN.md
View file @
79d850b8
...
@@ -21,8 +21,8 @@
...
@@ -21,8 +21,8 @@
MinerU 是一款一站式开源数据提取工具,主要包含以下功能:
MinerU 是一款一站式开源数据提取工具,主要包含以下功能:
-
PDF文档提取
[
Magic-PDF
](
#Magic-PDF
)
-
[
Magic-PDF
](
#Magic-PDF
)
PDF文档提取
-
网页与电子书提取
[
Magic-Doc
](
#Magic-Doc
)
-
[
Magic-Doc
](
#Magic-Doc
)
网页与电子书提取
# Magic-PDF
# Magic-PDF
...
@@ -58,9 +58,9 @@ https://github.com/magicpdf/Magic-PDF/assets/11393164/618937cb-dc6a-4646-b433-e3
...
@@ -58,9 +58,9 @@ https://github.com/magicpdf/Magic-PDF/assets/11393164/618937cb-dc6a-4646-b433-e3
### 子模块仓库
### 子模块仓库
-
[
PDF-Extract-Kit
](
https://github.com/opendatalab/PDF-Extract-Kit
)
-
[
PDF-Extract-Kit
](
https://github.com/opendatalab/PDF-Extract-Kit
)
高质量的PDF内容提取工具包
-
高质量的PDF内容提取工具包
-
[
Miner-PDF-Benchmark
](
https://github.com/opendatalab/Miner-PDF-Benchmark
)
-
[
Miner-PDF-Benchmark
](
https://github.com/opendatalab/Miner-PDF-Benchmark
)
端到端的PDF文档理解评估套件,专为大规模模型数据场景而设计
-
端到端的PDF文档理解评估套件,专为大规模模型数据场景而设计
## 上手指南
## 上手指南
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment