Skip to content
Projects
Groups
Snippets
Help
Loading...
Help
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
P
pdf-miner
Project
Project
Details
Activity
Releases
Cycle Analytics
Repository
Repository
Files
Commits
Branches
Tags
Contributors
Graph
Compare
Charts
Issues
0
Issues
0
List
Board
Labels
Milestones
Merge Requests
0
Merge Requests
0
CI / CD
CI / CD
Pipelines
Jobs
Schedules
Charts
Wiki
Wiki
Snippets
Snippets
Members
Members
Collapse sidebar
Close sidebar
Activity
Graph
Charts
Create a new issue
Jobs
Commits
Issue Boards
Open sidebar
Qin Kaijie
pdf-miner
Commits
69eb2c3b
Unverified
Commit
69eb2c3b
authored
Oct 08, 2024
by
Xiaomeng Zhao
Committed by
GitHub
Oct 08, 2024
Browse files
Options
Browse Files
Download
Plain Diff
Merge pull request #693 from myhloli/dev
feat: add arXiv paper link to header and adjust PDF parsing logic
parents
de60127c
a71db703
Changes
2
Hide whitespace changes
Inline
Side-by-side
Showing
2 changed files
with
11 additions
and
1 deletion
+11
-1
pdf_parse_union_core_v2.py
magic_pdf/pdf_parse_union_core_v2.py
+1
-1
header.html
projects/gradio_app/header.html
+10
-0
No files found.
magic_pdf/pdf_parse_union_core_v2.py
View file @
69eb2c3b
...
@@ -191,7 +191,7 @@ def insert_lines_into_block(block_bbox, line_height, page_w, page_h):
...
@@ -191,7 +191,7 @@ def insert_lines_into_block(block_bbox, line_height, page_w, page_h):
# 如果block高度小于n行正文,则直接返回block的bbox
# 如果block高度小于n行正文,则直接返回block的bbox
if
line_height
*
3
<
block_height
:
if
line_height
*
3
<
block_height
:
if
block_height
>
page_h
*
0.25
and
page_w
*
0.5
>
block_weight
>
page_w
*
0.25
:
# 可能是双列结构,可以切细点
if
block_height
>
page_h
*
0.25
and
page_w
*
0.5
>
block_weight
>
page_w
*
0.25
:
# 可能是双列结构,可以切细点
lines
=
int
(
block_height
/
line_height
)
lines
=
int
(
block_height
/
line_height
)
+
1
else
:
else
:
# 如果block的宽度超过0.4页面宽度,则将block分成3行
# 如果block的宽度超过0.4页面宽度,则将block分成3行
if
block_weight
>
page_w
*
0.4
:
if
block_weight
>
page_w
*
0.4
:
...
...
projects/gradio_app/header.html
View file @
69eb2c3b
...
@@ -90,6 +90,16 @@
...
@@ -90,6 +90,16 @@
</a>
</a>
</span>
</span>
<!-- arXiv Link. -->
<span
class=
"link-block"
>
<a
href=
"https://arxiv.org/abs/2409.18839"
class=
"external-link button is-normal is-rounded is-dark"
style=
"text-decoration: none; cursor: pointer"
>
<span
class=
"icon"
style=
"margin-right: 8px"
>
<i
class=
"fas fa-file"
style=
"color: white"
></i>
</span>
<span
style=
"color: white"
>
Paper
</span>
</a>
</span>
<!-- Homepage Link. -->
<!-- Homepage Link. -->
<span
class=
"link-block"
>
<span
class=
"link-block"
>
<a
href=
"https://opendatalab.com/"
class=
"external-link button is-normal is-rounded is-dark"
style=
"text-decoration: none; cursor: pointer"
>
<a
href=
"https://opendatalab.com/"
class=
"external-link button is-normal is-rounded is-dark"
style=
"text-decoration: none; cursor: pointer"
>
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment