Skip to content
Projects
Groups
Snippets
Help
Loading...
Help
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
P
pdf-miner
Project
Project
Details
Activity
Releases
Cycle Analytics
Repository
Repository
Files
Commits
Branches
Tags
Contributors
Graph
Compare
Charts
Issues
0
Issues
0
List
Board
Labels
Milestones
Merge Requests
0
Merge Requests
0
CI / CD
CI / CD
Pipelines
Jobs
Schedules
Charts
Wiki
Wiki
Snippets
Snippets
Members
Members
Collapse sidebar
Close sidebar
Activity
Graph
Charts
Create a new issue
Jobs
Commits
Issue Boards
Open sidebar
Qin Kaijie
pdf-miner
Commits
magic_pdf-0.5.7-released
Switch branch/tag
pdf-miner
20 Jun, 2024
1 commit
update check invalid_chars algorithm to improve accuracy
· 8998380d
赵小蒙
authored
Jun 20, 2024
8998380d
19 Jun, 2024
5 commits
update annotate
· 35a700da
赵小蒙
authored
Jun 19, 2024
35a700da
Update version.py with new version
· 38de8d59
myhloli
authored
Jun 19, 2024
38de8d59
update: Enhance the capability to detect garbled document issues
· df14c61f
赵小蒙
authored
Jun 19, 2024
df14c61f
Merge remote-tracking branch 'origin/master'
· 89d7964c
赵小蒙
authored
Jun 19, 2024
89d7964c
fix:use line_lang instead of content_lang to concatenate para
· 5de013e6
赵小蒙
authored
Jun 19, 2024
5de013e6
18 Jun, 2024
8 commits
Update version.py with new version
· 6d79b1c7
myhloli
authored
Jun 18, 2024
6d79b1c7
fix local write pdf file name bug
· 5f313bd0
赵小蒙
authored
Jun 18, 2024
5f313bd0
update cli output files
· 3b7342b8
赵小蒙
authored
Jun 18, 2024
3b7342b8
update requirements
· 9dc5033c
赵小蒙
authored
Jun 18, 2024
9dc5033c
update custom model framework
· 389826c5
赵小蒙
authored
Jun 18, 2024
389826c5
Merge pull request #119 from icecraft/feat/parallel_paddle
· c96aa88d
myhloli
authored
Jun 18, 2024
feat: parallelize paddle
c96aa88d
feat: parallelize paddle
· 738f9274
blue
authored
Jun 18, 2024
738f9274
update AVG_TEXT_LEN_THRESHOLD 200->100
· 084dc22a
赵小蒙
authored
Jun 18, 2024
084dc22a
17 Jun, 2024
7 commits
remove useless import
· 6c52856d
赵小蒙
authored
Jun 17, 2024
6c52856d
update pypi upload logic
· c69f414b
赵小蒙
authored
Jun 17, 2024
c69f414b
update pypi upload logic
· 0306d66d
赵小蒙
authored
Jun 17, 2024
0306d66d
update pypi upload logic
· 35d39735
赵小蒙
authored
Jun 17, 2024
35d39735
Update version.py with new version
· e57a9d87
myhloli
authored
Jun 17, 2024
e57a9d87
use fast_langdetect replace cld2
· ce0d9905
赵小蒙
authored
Jun 17, 2024
ce0d9905
make paddle analyze mode adaptation cli input mode to improve analyze speed
· 06063014
赵小蒙
authored
Jun 17, 2024
06063014
14 Jun, 2024
4 commits
update github workflow to Publish to PyPI
· 39b46ea9
赵小蒙
authored
Jun 14, 2024
39b46ea9
update github workflow
· aeef64b4
赵小蒙
authored
Jun 14, 2024
aeef64b4
Merge remote-tracking branch 'origin/master'
· d2e82713
赵小蒙
authored
Jun 14, 2024
d2e82713
update paddleocr url
· d62dd249
赵小蒙
authored
Jun 14, 2024
d62dd249
13 Jun, 2024
5 commits
Update version.py with new version
· 0c33f2f0
myhloli
authored
Jun 13, 2024
0c33f2f0
Merge remote-tracking branch 'origin/master'
· 64c62843
赵小蒙
authored
Jun 13, 2024
64c62843
update paddleocr to 2.8+ and add layout score output
· a5ff8ace
赵小蒙
authored
Jun 13, 2024
a5ff8ace
Merge pull request #118 from papayalove/master
· 0b97f265
myhloli
authored
Jun 13, 2024
修复分段边界问题
0b97f265
修复分段边界问题
· 2284e0d7
liukaiwen
authored
Jun 13, 2024
2284e0d7
12 Jun, 2024
3 commits
Update version.py with new version
· f80560ff
myhloli
authored
Jun 12, 2024
f80560ff
add paddlepaddle in requirements.txt
· 5aa2e012
赵小蒙
authored
Jun 12, 2024
5aa2e012
update: use paddleocr analyze layout in no model_json input
· 384c979d
赵小蒙
authored
Jun 12, 2024
384c979d
11 Jun, 2024
4 commits
Update version.py with new version
· 2ad0134c
myhloli
authored
Jun 11, 2024
2ad0134c
Merge pull request #117 from papayalove/master
· 0678a860
myhloli
authored
Jun 11, 2024
修复分段边界问题
0678a860
修复分段边界问题
· 9c6cb7b7
liukaiwen
authored
Jun 11, 2024
9c6cb7b7
update: model parse support paddle output
· bf18172d
赵小蒙
authored
Jun 11, 2024
bf18172d
07 Jun, 2024
1 commit
add todo about interline_equation
· e92de758
赵小蒙
authored
Jun 07, 2024
e92de758
06 Jun, 2024
2 commits
Update version.py with new version
· b7a418b5
myhloli
authored
Jun 06, 2024
b7a418b5
fix: some text char removed by interline_equations overlap
· 3c145ba0
赵小蒙
authored
Jun 06, 2024
3c145ba0