• myhloli's avatar
    feat(pdf_parse): improve span filtering and add new block types · 149132d6
    myhloli authored
    - Refactor remove_outside_spans function to filter spans more accurately
    - Add image_footnote, index, and list block types to output file documentation
    - Update draw_span_bbox to use preproc_blocks instead of para_blocks
    - Bump version to 0.9.0
    149132d6
Name
Last commit
Last update
..
Constants.py Loading commit data...
MakeContentConfig.py Loading commit data...
ModelBlockTypeEnum.py Loading commit data...
__init__.py Loading commit data...
boxbase.py Loading commit data...
calc_span_stats.py Loading commit data...
clean_memory.py Loading commit data...
commons.py Loading commit data...
config_reader.py Loading commit data...
convert_utils.py Loading commit data...
coordinate_transform.py Loading commit data...
detect_language_from_model.py Loading commit data...
draw_bbox.py Loading commit data...
drop_reason.py Loading commit data...
drop_tag.py Loading commit data...
hash_utils.py Loading commit data...
json_compressor.py Loading commit data...
language.py Loading commit data...
local_math.py Loading commit data...
markdown_utils.py Loading commit data...
nlp_utils.py Loading commit data...
ocr_content_type.py Loading commit data...
path_utils.py Loading commit data...
pdf_check.py Loading commit data...
pdf_image_tools.py Loading commit data...
safe_filename.py Loading commit data...
textbase.py Loading commit data...
version.py Loading commit data...
vis_utils.py Loading commit data...