- 09 Aug, 2024 26 commits
-
-
xuchao authored
-
sfk authored
-
sfk authored
-
sfk authored
edit FAQ
-
sfk authored
update doc url
-
sfk authored
update discord url
-
sfk authored
update content
-
sfk authored
update content
-
sfk authored
-
Xiaomeng Zhao authored
-
sfk authored
-
sfk authored
-
sfk authored
-
sfk authored
-
sfk authored
-
sfk authored
-
sfk authored
-
sfk authored
-
Xiaomeng Zhao authored
fix(doc-analyze): adjust image scaling limit to 9000 pixels
-
myhloli authored
Previously, images were not enlarged if their width or height exceeded 3000 pixels. This threshold has been increased to 9000 pixels to better handle high-resolutionscans and improve the analysis of documents with larger dimensions.
-
xuchao authored
-
xuchao authored
-
Xiaomeng Zhao authored
fix&refactor(pdf-extract-kit): table recognition and ocr
-
myhloli authored
fix(pdf-extract-kit): ensure table extraction success with additional ending conditionAdd an additional condition to determine the success of table extraction by checking if the latex_code ends with 'end{table}'. This extends the validation to cover table environments that may not strictly end with 'end{tabular}', thus improving the robustnessof table recognition processing.
-
myhloli authored
refactor(pdf_extract_kit): optimize image processing and table recognition logicRefactor the image processing logic for OCR and table recognition to ensure consistency and improve performance. Remove redundant initialization of PIL images, unify image cropping logic, and streamline the handling of formula detection results. Also, adjust the table recognition process to improve integration with the updated image processing logic and enhance overall efficiency.
-
icecraft authored
Co-authored-by:
shenguanlin <shenguanlin@pjlab.org.cn>
-
- 08 Aug, 2024 2 commits
- 07 Aug, 2024 11 commits
-
-
github-actions[bot] authored
-
Xiaomeng Zhao authored
feat: add table recognition success detect
-
liukaiwen authored
-
liukaiwen authored
-
myhloli authored
Add an exclamation mark to the section title to stress the importance of completing the additional steps after downloading a model. This change is made in the Chinese documentation to ensure users are aware of the necessary post-download actions.
-
myhloli authored
Adjust the print statement in the how_to_download_models_zh_cn.md guide to reflect the correct model download location. The path has been updated to specify the 'models' directory where the model is actually downloaded.
-
Xiaomeng Zhao authored
feat: add table recognition success detect
-
liukaiwen authored
-
liukaiwen authored
-
赵小蒙 authored
Add a print statement to the example code in 'how_to_download_models_zh_cn.md' to output the downloaded model directory path. This enhancement aids users in locating the model files as it provides a clear indication of where they are saved on the user's file system.
-
myhloli authored
docs(readme): update acknowledgment section and project description- Streamline the Acknowledgments section in the README by removing redundant entries.- Clarify the project's current use of PyMuPDF and future plans for exploring a more permissively licensed PDF processing library in the project description. - Ensure all modifications adhere to the project's documentation standards and improve reader understanding.
-
- 06 Aug, 2024 1 commit
-
-
myhloli authored
docs(readme): update acknowledgment section and project description- Streamline the Acknowledgments section in the README by removing redundant entries.- Clarify the project's current use of PyMuPDF and future plans for exploring a more permissively licensed PDF processing library in the project description. - Ensure all modifications adhere to the project's documentation standards and improve reader understanding.
-