1. 08 Oct, 2024 1 commit
    • myhloli's avatar
      feat: add arXiv paper link to header and adjust PDF parsing logic- Add arXiv... · a71db703
      myhloli authored
      feat: add arXiv paper link to header and adjust PDF parsing logic- Add arXiv paper link to the header template for easy access to the latest research paper.
      - Modify the PDF parsing logic to handle edge cases more accurately, particularly in determining the number of lines in a block based on its height.
      a71db703
  2. 06 Oct, 2024 2 commits
  3. 30 Sep, 2024 6 commits
  4. 29 Sep, 2024 2 commits
  5. 28 Sep, 2024 3 commits
    • myhloli's avatar
      refactor(magic_pdf): import model helpers directly for clarity · 42a7d792
      myhloli authored
      Update import statements in `pdf_parse_union_core_v2.py` to directly import
      `prepare_inputs`, `boxes2inputs`, and `parse_logits` from `magic_pdf.model.v3.helpers`
      instead of from `magic_pdf.model.v3`. This change streamlines the imports, making the
      code more readable and maintaining a cleaner approach to modular design.
      42a7d792
    • myhloli's avatar
      refactor(pdf_parse_union_core_v2): update import paths to use new package structure · 5522d0a3
      myhloli authored
      Adapt import statements in `pdf_parse_union_core_v2.py` to reflect the updated packagestructure, changing from the `magic_pdf.v3.helpers` module to the `magic_pdf.model.v3`
      module. This ensures compatibility with the revised directory layout.
      5522d0a3
    • myhloli's avatar
      fix(pdf_parse): handle blocks without lines and enable bf16 on compatible devices · 2145a8b6
      myhloli authored
      Blocks without lines are now correctly indexed even when they contain textual content rendered
      as images. The sorting logic has been updated to accommodate this scenario. Additionally, the
      LayoutLMv3 model initialization has been enhanced to utilize bfloat16 precision on devices that
      support it, offering potential performance benefits on supported hardware.
      2145a8b6
  6. 27 Sep, 2024 26 commits