Skip to content
Projects
Groups
Snippets
Help
Loading...
Help
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
P
pdf-miner
Project
Project
Details
Activity
Releases
Cycle Analytics
Repository
Repository
Files
Commits
Branches
Tags
Contributors
Graph
Compare
Charts
Issues
0
Issues
0
List
Board
Labels
Milestones
Merge Requests
0
Merge Requests
0
CI / CD
CI / CD
Pipelines
Jobs
Schedules
Charts
Wiki
Wiki
Snippets
Snippets
Members
Members
Collapse sidebar
Close sidebar
Activity
Graph
Charts
Create a new issue
Jobs
Commits
Issue Boards
Open sidebar
Qin Kaijie
pdf-miner
Commits
af2a36ab
Commit
af2a36ab
authored
May 22, 2024
by
quyuan
Browse files
Options
Browse Files
Download
Plain Diff
Merge branch 'master' of
https://github.com/magicpdf/Magic-PDF
parents
484b3304
a288b572
Changes
1
Show whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
1 addition
and
19 deletions
+1
-19
pdf_parse_union_core.py
magic_pdf/pdf_parse_union_core.py
+1
-19
No files found.
magic_pdf/pdf_parse_union_core.py
View file @
af2a36ab
...
...
@@ -59,25 +59,7 @@ def txt_spans_extract(pdf_page, inline_equations, interline_equations):
bbox
=
span
[
"bbox"
]
if
float_equal
(
bbox
[
0
],
bbox
[
2
])
or
float_equal
(
bbox
[
1
],
bbox
[
3
]):
continue
if
span
.
get
(
'type'
)
==
ContentType
.
InlineEquation
:
spans
.
append
(
{
"bbox"
:
list
(
span
[
"bbox"
]),
"content"
:
span
[
"latex"
],
"type"
:
ContentType
.
InlineEquation
,
"score"
:
1.0
,
}
)
elif
span
.
get
(
'type'
)
==
ContentType
.
InterlineEquation
:
spans
.
append
(
{
"bbox"
:
list
(
span
[
"bbox"
]),
"content"
:
span
[
"latex"
],
"type"
:
ContentType
.
InterlineEquation
,
"score"
:
1.0
,
}
)
else
:
if
span
.
get
(
'type'
)
not
in
(
ContentType
.
InlineEquation
,
ContentType
.
InterlineEquation
):
spans
.
append
(
{
"bbox"
:
list
(
span
[
"bbox"
]),
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment