- 18 Oct, 2024 1 commit
-
-
myhloli authored
- Remove unused parameters parse_type and lang from various functions - Simplify function calls by removing unnecessary arguments - Update related files to reflect these changes
-
- 17 Oct, 2024 2 commits
-
-
Xiaomeng Zhao authored
refactor(ocr):Increase the dilation factor in OCR to address the issue of word concatenation.
-
myhloli authored
- Remove unused functions such as split_long_words, ocr_mk_mm_markdown_with_para, etc. - Simplify ocr_mk_markdown_with_para_core_v2 by removing unnecessary language detection and word splitting logic- Remove wordninja dependency from requirements - Update ocr_model_init to include additional parameters for OCR model configuration
-
- 16 Oct, 2024 4 commits
-
-
Xiaomeng Zhao authored
update example files
-
myhloli authored
-
myhloli authored
-
myhloli authored
- Improve reading order with model-based sorting- Add list recognition within text - Implement table of contents recognition - Support table recognition - Enhance code block and geometric shape recognition - Address known issues in both English and Chinese READMEs
-
- 15 Oct, 2024 6 commits
-
-
Xiaomeng Zhao authored
fix(para_split_v3): refine list block detection in paragraph splitting
-
myhloli authored
- Update list block detection logic to require at least 2 numeric start lines - Ensure the number of numeric start lines matches the number of end lines - Remove detection of non-border starting lines for simplicity
-
Xiaomeng Zhao authored
refactor(para_split_v3): merge list and index block detection
-
myhloli authored
-
myhloli authored
Increased the threshold for filling spans in blocks from 0.3 to 0.5 to improve the accuracy of block formation. This change helps refine the grouping of spans into blocks, potentially enhancing the overall structure and readability of the PDF content.
-
myhloli authored
- Combine __is_list_block() and __is_index_block() into a single function __is_list_or_index_block() - Simplify block type determination logic - Remove redundant code and improve readability - Optimize block merging process
-
- 14 Oct, 2024 4 commits
-
-
myhloli authored
Add List and Index to the list of block types being processed in the draw_bbox.py file. This inclusion ensures that these block types are handled similarly to other text-containing blocks, improving the overall document processing accuracy and consistency.
-
Xiaomeng Zhao authored
feat(list&index block): detect and merge list and index blocks
-
myhloli authored
- Add detection for list and index blocks in OCR processing- Implement merging of list and index blocks across pages - Update block types to include list and index categories - Adjust text merging logic to handle new block types - Modify layout drawing to distinguish list and index blocks
-
icecraft authored
* feat: manager docs with sphinx * fix: readthedocs configure * feat: support multiple language * fix: add .readthedocs.yaml * fix: requirments.txt path --------- Co-authored-by:
icecraft <xurui1@pjlab.org.cn>
-
- 10 Oct, 2024 6 commits
-
-
Xiaomeng Zhao authored
fix: Solving the Grouping Anomaly Issue with Multiple Consecutive Non-Text Blocks
-
myhloli authored
-
Xiaomeng Zhao authored
Update how_to_download_models_zh_cn.md
-
Xiaomeng Zhao authored
-
Xiaomeng Zhao authored
feat(pdf_parse_union_core_v2): reintegrate para_split_v3 and add page range support
-
myhloli authored
- Reintegrate para_split_v3 into the pdf_parse_union_core_v2 process - Add support for specifying page range in doc_analyze_by_custom_model - Implement garbage collection and memory cleaning after processing - Refine image loading from PDF, including handling out-of-range pages
-
- 09 Oct, 2024 3 commits
-
-
Xiaomeng Zhao authored
-
Xiaomeng Zhao authored
Update README_Windows_CUDA_Acceleration_en_US.md
-
Xiaomeng Zhao authored
-
- 08 Oct, 2024 14 commits
-
-
Xiaomeng Zhao authored
docs: update CUDA acceleration guides and README content
-
myhloli authored
- Update GPU hardware support information in README.md and README_zh-CN.md - Enhance CUDA acceleration guides for Ubuntu and Windows - Modify README_zh-CN.md to reflect changes in GPU requirements and configurations - Update TODO list to mark semantic reading order as completed
-
myhloli authored
- Update GPU hardware support information in README.md and README_zh-CN.md - Enhance CUDA acceleration guides for Ubuntu and Windows - Modify README_zh-CN.md to reflect changes in GPU requirements and configurations - Update TODO list to mark semantic reading order as completed
-
Xiaomeng Zhao authored
docs: add filename to wget command in model download scripts
-
myhloli authored
- Update wget commands in both English and Chinese documentation to specify the filename - Improve clarity and prevent potential filename conflicts when downloading the scripts
-
Xiaomeng Zhao authored
feat(docs): automate model download and configuration
-
myhloli authored
- Add scripts to download models and update configuration file - Remove manual steps for modifying model paths - Update documentation for both ModelScope and HuggingFace model downloads - Improve user experience by automating the entire process
-
myhloli authored
- Add scripts to download models and update configuration file - Remove manual steps for modifying model paths - Update documentation for both ModelScope and HuggingFace model downloads - Improve user experience by automating the entire process
-
Xiaomeng Zhao authored
feat(layoutreader): support local model directory and improve model loading
-
myhloli authored
Added a link to the layoutreader repository in the Related Projects sections of both the README.md and README_zh-CN.md files. This addition helps to provide users with more resources and tools related to document layout analysis and processing.
-
myhloli authored
docs: update model download instructions for version 0.9.x and later- Add note about separate download for layoutreader model in version 0.9.x and later - Include example code for downloading layoutreader model using ModelScope - Clarify that previous download methods do not support updating to version 0.9.x and later
-
myhloli authored
- Add function to get local LayoutReader model directory- Check and use local model directory if available - Fall back to online model if local directory not found - Update model initialization to support local path - Refactor model loading in singleton class
-
Xiaomeng Zhao authored
fix: caption|footnote match algorithm
-
icecraft authored
-