• myhloli's avatar
    refactor(magic_pdf): improve line sorting and block indexing · 564c4ce1
    myhloli authored
    - Insert lines into blocks based on median line height- Calculate block index using line indices median
    - Remove virtual line information for table and image blocks
    - Enhance line sorting algorithm for different block types
    - Add line height calculation function
    564c4ce1
Name
Last commit
Last update
..
dict2md Loading commit data...
filter Loading commit data...
integrations Loading commit data...
layout Loading commit data...
libs Loading commit data...
model Loading commit data...
para Loading commit data...
pipe Loading commit data...
post_proc Loading commit data...
pre_proc Loading commit data...
resources Loading commit data...
rw Loading commit data...
spark Loading commit data...
tools Loading commit data...
__init__.py Loading commit data...
pdf_parse_by_ocr.py Loading commit data...
pdf_parse_by_txt.py Loading commit data...
pdf_parse_union_core.py Loading commit data...
pdf_parse_union_core_v2.py Loading commit data...
user_api.py Loading commit data...