-
myhloli authored
Add a new function `draw_line_sort_bbox` to visualize the sorting of lines on each page. This includes indexing lines and handling both text and non-text elements such as tables and images for better content organization. Also, comment out GPU-related code for flexibility and remove overlaps in bounding box detection, which improves the accuracy of layout splitting.
34f89650
Name |
Last commit
|
Last update |
---|---|---|
.. | ||
dict2md | ||
filter | ||
integrations | ||
layout | ||
libs | ||
model | ||
para | ||
pipe | ||
post_proc | ||
pre_proc | ||
resources | ||
rw | ||
spark | ||
tools | ||
v3 | ||
__init__.py | ||
pdf_parse_by_ocr.py | ||
pdf_parse_by_txt.py | ||
pdf_parse_union_core.py | ||
pdf_parse_union_core_v2.py | ||
user_api.py |