Skip to content
Projects
Groups
Snippets
Help
Loading...
Help
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
P
pdf-miner
Project
Project
Details
Activity
Releases
Cycle Analytics
Repository
Repository
Files
Commits
Branches
Tags
Contributors
Graph
Compare
Charts
Issues
0
Issues
0
List
Board
Labels
Milestones
Merge Requests
0
Merge Requests
0
CI / CD
CI / CD
Pipelines
Jobs
Schedules
Charts
Wiki
Wiki
Snippets
Snippets
Members
Members
Collapse sidebar
Close sidebar
Activity
Graph
Charts
Create a new issue
Jobs
Commits
Issue Boards
Open sidebar
Qin Kaijie
pdf-miner
Commits
fc6a7c30
Unverified
Commit
fc6a7c30
authored
Aug 09, 2024
by
sfk
Committed by
GitHub
Aug 09, 2024
Browse files
Options
Browse Files
Download
Email Patches
Plain Diff
Update README_zh-CN_v2.md
parent
4ec8466e
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
9 additions
and
2 deletions
+9
-2
README_zh-CN_v2.md
README_zh-CN_v2.md
+9
-2
No files found.
README_zh-CN_v2.md
View file @
fc6a7c30
...
@@ -175,7 +175,7 @@ https://github.com/user-attachments/assets/4bea02c9-6d54-4cd6-97ed-dff14340982c
...
@@ -175,7 +175,7 @@ https://github.com/user-attachments/assets/4bea02c9-6d54-4cd6-97ed-dff14340982c
### 使用CPU快速体验
### 使用CPU快速体验
#### 1. 安装magic-pdf
#### 1. 安装magic-pdf
最新版本国内镜像源同步可能会有延迟,请耐心等待
```
bash
```
bash
conda create
-n
MinerU
python
=
3.10
conda create
-n
MinerU
python
=
3.10
conda activate MinerU
conda activate MinerU
...
@@ -321,7 +321,14 @@ TODO
...
@@ -321,7 +321,14 @@ TODO
-
漫画书、艺术图册、小学教材、习题尚不能很好解析
-
漫画书、艺术图册、小学教材、习题尚不能很好解析
-
在一些公式密集的PDF上强制启用OCR效果会更好
-
在一些公式密集的PDF上强制启用OCR效果会更好
-
如果您要处理包含大量公式的pdf,强烈建议开启OCR功能。使用pymuPDF提取文字的时候会出现文本行互相重叠的情况导致公式插入位置不准确。
-
如果您要处理包含大量公式的pdf,强烈建议开启OCR功能。使用pymuPDF提取文字的时候会出现文本行互相重叠的情况导致公式插入位置不准确。
-
表格识别目前处于测试阶段识别速度较慢,识别准确度有待提升
-
**表格识别**
目前处于测试阶段,识别速度较慢,识别准确度有待提升。以下是我们在Ubuntu 22.04 LTS + NVIDIA GeForce RTX 4090环境下的一些性能测试结果,可供参考。
| 表格大小 | 解析耗时 |
|---------------|----------------------------|
| 6
\*
5 55kb | 37s |
| 16
\*
12 284kb | 3m18s |
| 44
\*
7 559kb | 4m12s |
# FAQ
# FAQ
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment