- 28 Oct, 2024 1 commit
-
-
myhloli authored
- Add changelog for v0.9.0 release with major refactoring and improvements - Update key features list to include new functionalities - Modify system requirements and hardware support information - Add section for deploying derived projects - Update known issues and TODO list
-
- 27 Oct, 2024 2 commits
-
-
myhloli authored
- Modify the logic for splitting wide blocks exceeding 0.4 page width - Remove the specific case for blocks exceeding 0.25 page width - Add comments to explain the reasoning behind different splitting strategies
-
myhloli authored
- Update model download instructions for versions 0.9.x and later - Simplify demo scripts by removing unnecessary model configuration - Add visualization function to draw bounding boxes - Update CLI help message with new URL
-
- 26 Oct, 2024 2 commits
-
-
Xiaomeng Zhao authored
feat(draw_bbox): update bounding box drawing for tables and images
-
myhloli authored
- Add support for drawing bounding boxes of table and image sub-blocks - Implement sorting of table blocks based on type order - Update bounding box drawing for text and title blocks - Refactor code to handle different block types and their sub-blocks
-
- 25 Oct, 2024 11 commits
-
-
Xiaomeng Zhao authored
fix: add init to magic_pdf.utils
-
myhloli authored
-
Xiaomeng Zhao authored
fix: add init to magic_pdf.config
-
myhloli authored
-
Xiaomeng Zhao authored
refactor(ocr): adjust OCR processing parameters
-
myhloli authored
- Lower the Y-axis overlap threshold for merging spans into lines from0.6 to 0.5 - Reduce the unclip ratio for OCR detection from 2.4 to 1.8
-
Xiaomeng Zhao authored
refactor(parse_core): improve image and table block handling
-
myhloli authored
- Split image and table blocks into separate categories - Add group_id to image and table blocks- Update block processing logic to handle new categories - Modify layout splitting and span filling to accommodate new block types - Adjust block indexing and sorting to consider new structures
-
Xiaomeng Zhao authored
Feat/new table caption match
-
icecraft authored
-
icecraft authored
-
- 24 Oct, 2024 5 commits
-
-
myhloli authored
- Changed the confidence threshold from0.15 to 0.25 in the DocLayout_YOLO model prediction - This adjustment aims to improve the accuracy of layout detection by filtering out low-confidence predictions
-
Xiaomeng Zhao authored
Feat/data api
-
icecraft authored
-
icecraft authored
-
icecraft authored
feat: add Data api
-
- 23 Oct, 2024 10 commits
-
-
Xiaomeng Zhao authored
feat: add support for non-PDF file conversion to PDF
-
myhloli authored
- Implement to_pdf function to convert non-PDF files to PDF format - Integrate file upload functionality for PDF and image files- Update UI to include file upload component and PDF preview - Add conversion button and update its functionality to handle new file types
-
Xiaomeng Zhao authored
build(docker): add doclayout-yolo dependency
-
myhloli authored
- Add doclayout-yolo==0.0.2 to requirements-docker.txt
-
Xiaomeng Zhao authored
build(setup): add doclayout_yolo dependency
-
myhloli authored
- Add doclayout_yolo==0.0.2 to the list of dependencies in setup.py
-
Xiaomeng Zhao authored
feat(model): add support for DocLayout-YOLO model
-
myhloli authored
- Add new layout model option: DocLayout-YOLO - Implement model initialization and prediction for DocLayout-YOLO - Update configuration options to include new model- Modify existing code to support both LayoutLMv3 and DocLayout-YOLO models - Update Gradio app to support more Custom Switch
-
Xiaomeng Zhao authored
update:update config json
-
myhloli authored
-
- 21 Oct, 2024 5 commits
-
-
Xiaomeng Zhao authored
docs:Update the driver requirements on the Ubuntu system.
-
myhloli authored
-
myhloli authored
- Modified the condition to include List and Index block types- This change enhances the function's capability to process different paragraph types
-
Xiaomeng Zhao authored
refactor(para): improve paragraph splitting algorithm
-
myhloli authored
- Adjust the threshold for identifying index blocks from 3 lines to 2 lines - Add a new function __is_list_group to detect if a group of blocks is a list - Modify the paragraph merging logic to handle list groups differently
-
- 18 Oct, 2024 1 commit
-
-
myhloli authored
- Remove unused parameters parse_type and lang from various functions - Simplify function calls by removing unnecessary arguments - Update related files to reflect these changes
-
- 17 Oct, 2024 2 commits
-
-
Xiaomeng Zhao authored
refactor(ocr):Increase the dilation factor in OCR to address the issue of word concatenation.
-
myhloli authored
- Remove unused functions such as split_long_words, ocr_mk_mm_markdown_with_para, etc. - Simplify ocr_mk_markdown_with_para_core_v2 by removing unnecessary language detection and word splitting logic- Remove wordninja dependency from requirements - Update ocr_model_init to include additional parameters for OCR model configuration
-
- 16 Oct, 2024 1 commit
-
-
Xiaomeng Zhao authored
update example files
-