- 25 Oct, 2024 3 commits
-
-
myhloli authored
-
myhloli authored
- Lower the Y-axis overlap threshold for merging spans into lines from0.6 to 0.5 - Reduce the unclip ratio for OCR detection from 2.4 to 1.8
-
myhloli authored
- Split image and table blocks into separate categories - Add group_id to image and table blocks- Update block processing logic to handle new categories - Modify layout splitting and span filling to accommodate new block types - Adjust block indexing and sorting to consider new structures
-
- 24 Oct, 2024 5 commits
-
-
myhloli authored
- Changed the confidence threshold from0.15 to 0.25 in the DocLayout_YOLO model prediction - This adjustment aims to improve the accuracy of layout detection by filtering out low-confidence predictions
-
Xiaomeng Zhao authored
Feat/data api
-
icecraft authored
-
icecraft authored
-
icecraft authored
feat: add Data api
-
- 23 Oct, 2024 10 commits
-
-
Xiaomeng Zhao authored
feat: add support for non-PDF file conversion to PDF
-
myhloli authored
- Implement to_pdf function to convert non-PDF files to PDF format - Integrate file upload functionality for PDF and image files- Update UI to include file upload component and PDF preview - Add conversion button and update its functionality to handle new file types
-
Xiaomeng Zhao authored
build(docker): add doclayout-yolo dependency
-
myhloli authored
- Add doclayout-yolo==0.0.2 to requirements-docker.txt
-
Xiaomeng Zhao authored
build(setup): add doclayout_yolo dependency
-
myhloli authored
- Add doclayout_yolo==0.0.2 to the list of dependencies in setup.py
-
Xiaomeng Zhao authored
feat(model): add support for DocLayout-YOLO model
-
myhloli authored
- Add new layout model option: DocLayout-YOLO - Implement model initialization and prediction for DocLayout-YOLO - Update configuration options to include new model- Modify existing code to support both LayoutLMv3 and DocLayout-YOLO models - Update Gradio app to support more Custom Switch
-
Xiaomeng Zhao authored
update:update config json
-
myhloli authored
-
- 21 Oct, 2024 5 commits
-
-
Xiaomeng Zhao authored
docs:Update the driver requirements on the Ubuntu system.
-
myhloli authored
-
myhloli authored
- Modified the condition to include List and Index block types- This change enhances the function's capability to process different paragraph types
-
Xiaomeng Zhao authored
refactor(para): improve paragraph splitting algorithm
-
myhloli authored
- Adjust the threshold for identifying index blocks from 3 lines to 2 lines - Add a new function __is_list_group to detect if a group of blocks is a list - Modify the paragraph merging logic to handle list groups differently
-
- 18 Oct, 2024 1 commit
-
-
myhloli authored
- Remove unused parameters parse_type and lang from various functions - Simplify function calls by removing unnecessary arguments - Update related files to reflect these changes
-
- 17 Oct, 2024 2 commits
-
-
Xiaomeng Zhao authored
refactor(ocr):Increase the dilation factor in OCR to address the issue of word concatenation.
-
myhloli authored
- Remove unused functions such as split_long_words, ocr_mk_mm_markdown_with_para, etc. - Simplify ocr_mk_markdown_with_para_core_v2 by removing unnecessary language detection and word splitting logic- Remove wordninja dependency from requirements - Update ocr_model_init to include additional parameters for OCR model configuration
-
- 16 Oct, 2024 4 commits
-
-
Xiaomeng Zhao authored
update example files
-
myhloli authored
-
myhloli authored
-
myhloli authored
- Improve reading order with model-based sorting- Add list recognition within text - Implement table of contents recognition - Support table recognition - Enhance code block and geometric shape recognition - Address known issues in both English and Chinese READMEs
-
- 15 Oct, 2024 6 commits
-
-
Xiaomeng Zhao authored
fix(para_split_v3): refine list block detection in paragraph splitting
-
myhloli authored
- Update list block detection logic to require at least 2 numeric start lines - Ensure the number of numeric start lines matches the number of end lines - Remove detection of non-border starting lines for simplicity
-
Xiaomeng Zhao authored
refactor(para_split_v3): merge list and index block detection
-
myhloli authored
-
myhloli authored
Increased the threshold for filling spans in blocks from 0.3 to 0.5 to improve the accuracy of block formation. This change helps refine the grouping of spans into blocks, potentially enhancing the overall structure and readability of the PDF content.
-
myhloli authored
- Combine __is_list_block() and __is_index_block() into a single function __is_list_or_index_block() - Simplify block type determination logic - Remove redundant code and improve readability - Optimize block merging process
-
- 14 Oct, 2024 4 commits
-
-
myhloli authored
Add List and Index to the list of block types being processed in the draw_bbox.py file. This inclusion ensures that these block types are handled similarly to other text-containing blocks, improving the overall document processing accuracy and consistency.
-
Xiaomeng Zhao authored
feat(list&index block): detect and merge list and index blocks
-
myhloli authored
- Add detection for list and index blocks in OCR processing- Implement merging of list and index blocks across pages - Update block types to include list and index categories - Adjust text merging logic to handle new block types - Modify layout drawing to distinguish list and index blocks
-
icecraft authored
* feat: manager docs with sphinx * fix: readthedocs configure * feat: support multiple language * fix: add .readthedocs.yaml * fix: requirments.txt path --------- Co-authored-by:
icecraft <xurui1@pjlab.org.cn>
-