• myhloli's avatar
    fix(pdf_parse): handle blocks without lines and enable bf16 on compatible devices · 2145a8b6
    myhloli authored
    Blocks without lines are now correctly indexed even when they contain textual content rendered
    as images. The sorting logic has been updated to accommodate this scenario. Additionally, the
    LayoutLMv3 model initialization has been enhanced to utilize bfloat16 precision on devices that
    support it, offering potential performance benefits on supported hardware.
    2145a8b6
pdf_parse_union_core_v2.py 14.6 KB