• myhloli's avatar
    refactor(magic_pdf): improve line sorting and block indexing · 564c4ce1
    myhloli authored
    - Insert lines into blocks based on median line height- Calculate block index using line indices median
    - Remove virtual line information for table and image blocks
    - Enhance line sorting algorithm for different block types
    - Add line height calculation function
    564c4ce1
Name
Last commit
Last update
.github Loading commit data...
demo Loading commit data...
docs Loading commit data...
magic_pdf Loading commit data...
projects Loading commit data...
signatures/version1 Loading commit data...
tests Loading commit data...
web_api Loading commit data...
.gitignore Loading commit data...
.pre-commit-config.yaml Loading commit data...
Dockerfile Loading commit data...
LICENSE.md Loading commit data...
MinerU_CLA.md Loading commit data...
README.md Loading commit data...
README.md.bak Loading commit data...
README_ja-JP.md Loading commit data...
README_zh-CN.md Loading commit data...
README_zh-CN.md.bak Loading commit data...
magic-pdf.template.json Loading commit data...
requirements-docker.txt Loading commit data...
requirements-qa.txt Loading commit data...
requirements.txt Loading commit data...
setup.py Loading commit data...
update_version.py Loading commit data...