fix(pdf_parse): optimize span processing by removing outside spans
- Add new function `remove_outside_spans` to filter spans based on image and table blocks - Reorder span processing steps to improve efficiency - Update imports to include `calculate_overlap_area_in_bbox1_area_ratio`
Showing
Please register or sign in to comment