- 24 Oct, 2024 3 commits
- 23 Oct, 2024 10 commits
-
-
Xiaomeng Zhao authored
feat: add support for non-PDF file conversion to PDF
-
myhloli authored
- Implement to_pdf function to convert non-PDF files to PDF format - Integrate file upload functionality for PDF and image files- Update UI to include file upload component and PDF preview - Add conversion button and update its functionality to handle new file types
-
Xiaomeng Zhao authored
build(docker): add doclayout-yolo dependency
-
myhloli authored
- Add doclayout-yolo==0.0.2 to requirements-docker.txt
-
Xiaomeng Zhao authored
build(setup): add doclayout_yolo dependency
-
myhloli authored
- Add doclayout_yolo==0.0.2 to the list of dependencies in setup.py
-
Xiaomeng Zhao authored
feat(model): add support for DocLayout-YOLO model
-
myhloli authored
- Add new layout model option: DocLayout-YOLO - Implement model initialization and prediction for DocLayout-YOLO - Update configuration options to include new model- Modify existing code to support both LayoutLMv3 and DocLayout-YOLO models - Update Gradio app to support more Custom Switch
-
Xiaomeng Zhao authored
update:update config json
-
myhloli authored
-
- 21 Oct, 2024 5 commits
-
-
Xiaomeng Zhao authored
docs:Update the driver requirements on the Ubuntu system.
-
myhloli authored
-
myhloli authored
- Modified the condition to include List and Index block types- This change enhances the function's capability to process different paragraph types
-
Xiaomeng Zhao authored
refactor(para): improve paragraph splitting algorithm
-
myhloli authored
- Adjust the threshold for identifying index blocks from 3 lines to 2 lines - Add a new function __is_list_group to detect if a group of blocks is a list - Modify the paragraph merging logic to handle list groups differently
-
- 18 Oct, 2024 1 commit
-
-
myhloli authored
- Remove unused parameters parse_type and lang from various functions - Simplify function calls by removing unnecessary arguments - Update related files to reflect these changes
-
- 17 Oct, 2024 2 commits
-
-
Xiaomeng Zhao authored
refactor(ocr):Increase the dilation factor in OCR to address the issue of word concatenation.
-
myhloli authored
- Remove unused functions such as split_long_words, ocr_mk_mm_markdown_with_para, etc. - Simplify ocr_mk_markdown_with_para_core_v2 by removing unnecessary language detection and word splitting logic- Remove wordninja dependency from requirements - Update ocr_model_init to include additional parameters for OCR model configuration
-
- 16 Oct, 2024 4 commits
-
-
Xiaomeng Zhao authored
update example files
-
myhloli authored
-
myhloli authored
-
myhloli authored
- Improve reading order with model-based sorting- Add list recognition within text - Implement table of contents recognition - Support table recognition - Enhance code block and geometric shape recognition - Address known issues in both English and Chinese READMEs
-
- 15 Oct, 2024 6 commits
-
-
Xiaomeng Zhao authored
fix(para_split_v3): refine list block detection in paragraph splitting
-
myhloli authored
- Update list block detection logic to require at least 2 numeric start lines - Ensure the number of numeric start lines matches the number of end lines - Remove detection of non-border starting lines for simplicity
-
Xiaomeng Zhao authored
refactor(para_split_v3): merge list and index block detection
-
myhloli authored
-
myhloli authored
Increased the threshold for filling spans in blocks from 0.3 to 0.5 to improve the accuracy of block formation. This change helps refine the grouping of spans into blocks, potentially enhancing the overall structure and readability of the PDF content.
-
myhloli authored
- Combine __is_list_block() and __is_index_block() into a single function __is_list_or_index_block() - Simplify block type determination logic - Remove redundant code and improve readability - Optimize block merging process
-
- 14 Oct, 2024 4 commits
-
-
myhloli authored
Add List and Index to the list of block types being processed in the draw_bbox.py file. This inclusion ensures that these block types are handled similarly to other text-containing blocks, improving the overall document processing accuracy and consistency.
-
Xiaomeng Zhao authored
feat(list&index block): detect and merge list and index blocks
-
myhloli authored
- Add detection for list and index blocks in OCR processing- Implement merging of list and index blocks across pages - Update block types to include list and index categories - Adjust text merging logic to handle new block types - Modify layout drawing to distinguish list and index blocks
-
icecraft authored
* feat: manager docs with sphinx * fix: readthedocs configure * feat: support multiple language * fix: add .readthedocs.yaml * fix: requirments.txt path --------- Co-authored-by:
icecraft <xurui1@pjlab.org.cn>
-
- 10 Oct, 2024 5 commits
-
-
Xiaomeng Zhao authored
fix: Solving the Grouping Anomaly Issue with Multiple Consecutive Non-Text Blocks
-
myhloli authored
-
Xiaomeng Zhao authored
Update how_to_download_models_zh_cn.md
-
Xiaomeng Zhao authored
-
Xiaomeng Zhao authored
feat(pdf_parse_union_core_v2): reintegrate para_split_v3 and add page range support
-