- 02 Aug, 2024 4 commits
-
-
myhloli authored
Update the documentation to reflect the latest model download procedures, emphasis on model file integrity checks, and expanded instructions for setting up CUDA accelerationon Ubuntu and Windows environments. The README files for various OS have been enhanced with additional details to assist users in configuring and verifying their environments for optimal performance.
-
myhloli authored
Update the Chinese documentation to include detailed steps for installingmagic-pdf using CPU and GPU. These updates clarify the process for end users, addressing common issues such as configuration file placement and model weight file downloads. The documentation now provides users with direct links and version validation steps to ensure a smoother installation experience.
-
Kaiwen Liu authored
* # add table recognition using struct-eqtable ## Changelog 31/07/20204 - Support table recognition. Table images will be converted into html. ### how to use the new feature: set the attribute 'table-mode' to 'true' in magic-pdf.json ### caution: it takes 200s to 500s to convert a single table image using cpu * # add table recognition using struct-eqtable ## Changelog 31/07/20204 - Support table recognition. Table images will be converted into LaTex. ### how to use the new feature: set the attribute 'table-mode' to 'true' in magic-pdf.json ### caution: it takes 200s to 500s to convert a single table image using cpu * # feat(model inference): add table recognition and convertion to LaTeX # What's Changed ### New Features - Add table content recognition, we use weights of [StructEqTable](https://github.com/UniModal4Reasoning/StructEqTable-Deploy) to convert table image to LaTex. ### Instruction - pip install pypandoc struct-eqtable==0.1.0 - Download [StructEqTable weights](https://huggingface.co/wanderkid/PDF-Extract-Kit/tree/main/models/TabRec) and put it under models/ directory. - Edit 'table-mode' value to turn on table recognition function which is turned off by default. - If you did not download any models before, refer to [how to download models](docs/how_to_download_models_zh_cn.md)。 * add table recognition and convertion to LaTeX * add table recognition and conversion to LaTeX * add table recognition and conversion to LaTeX * add table recognition and conversion to LaTeX --------- Co-authored-by:
liukaiwen <liukaiwen@pjlab.org.cn>
-
myhloli authored
- Fix the description of the 'poly' coordinate format in the output file documentation to correctly reflect the order of coordinates: left-top, right-top, right-bottom, left-bottom. - Update various table-related descriptions for clarity and consistency, including field names and their corresponding explanations. - Add version name field description in 'middle.json' structure to document the version of the magic-pdf used in the parsing process. - Refactor the block and line description tables to improve readability and alignment with the rest of the documentation.
-
- 01 Aug, 2024 21 commits
-
-
icecraft authored
* feat: remove dummpy code, magic_pdf/cli, magic_pdf/train_utils * feat: expose version in command line --------- Co-authored-by:
shenguanlin <shenguanlin@pjlab.org.cn>
-
myhloli authored
- Adjust command installation format for PaddlePaddle GPU. - Clarify instruction numbering for testing OCR acceleration.
-
myhloli authored
Update the Ubuntu CUDA Acceleration setup guide to reflect the correct user directory path and improve the clarity of instructions. Remove references to Windows and macOS as they are out of scope for this document. Ensure the configuration file copying command is correctly represented for Linux users.
-
myhloli authored
Update the link to the magic-pdf.template.json configuration template file in the README_Ubuntu_CUDA_Acceleration_zh_CN.md document. The file path was previously incorrect and has been amended to point to the correct location.
-
myhloli authored
Update the instruction in README_Ubuntu_CUDA_Acceleration_zh_CN.md to reference the correct section number for downloading the model weights. This change ensures that users are directed to the correct location in the document for setting up the model directory in the magic-pdf.json configuration.
-
myhloli authored
-
myhloli authored
docs: update Ubuntu CUDA acceleration guide for version 0.6.2- Add steps for Ubuntu 22.04 LTS installation. - Detail the process of checking, installing, and configuring NVIDIA drivers. - Include instructions for installing Anaconda and creating a specific environment. - Provide guidance on installing magic-pdf and its dependencies. - Add a note to verify magic-pdf version and report issues if necessary. - Describe the process of downloading models and configuring the application. - Include a sample command to run the application with CUDA acceleration. - Add a note for enabling OCR CUDA acceleration with specific GPU requirements. This update ensures users have the latest information for setting up CUDA accelerationwith magic-pdf on Ubuntu 22.04 LTS, specifically for version 0.6.2, and provides clearer instructions on the installation and configuration process.
-
xuchao authored
-
xuchao authored
-
myhloli authored
-
myhloli authored
-
myhloli authored
Restructured the how-to download models document for better clarity and added sections on downloading models from ModelScope, including SDK and Git download methods. Provided detailed steps for installing Git LFS and checking model integrity after download. Also included recommendations for moving the models to an SSD for better performance.
-
icecraft authored
* feat: refractor cli command * feat: add docs to describe the output files of cli * feat: resove review comments * feat: updat docs about middle.json --------- Co-authored-by:
shenguanlin <shenguanlin@pjlab.org.cn>
-
xuchao authored
-
drunkpig authored
Update how_to_download_models_en.md
-
myhloli authored
-
myhloli authored
-
myhloli authored
Change the URLs in the installation instructions to new mirrors that are expected to provide faster downloads for users. This update affects the installation guides for both detectron2 and magic-pdf in the Chinese documentation.
-
Richard Li authored
-
drunkpig authored
Update how_to_download_models_zh_cn.md
-
myhloli authored
Reorganize the installation instructions for Magic-PDF to clarify the dependency on detectron2 and provide a more straightforward installation process. The update includes separating the dependency installation from the package installation and adding a note about precompiled wheels for Python 3.10. BREAKING CHANGE: The installation guide now assumes basic familiarity with detectron2 installation requirements. Users who need to compile detectron2 from source should refer to the official detectron2 documentation.
-
- 31 Jul, 2024 14 commits
-
-
Xiaomeng Zhao authored
-
myhloli authored
Update the PyTorch installation command in the README files for both English and Chinese versions to reflect the required version compatibility with CUDA 11.8. Include explicit instructions to specify the PyTorch version to avoid automatic installation of higher, unsupported versions. Additionally, clarify the importance of modifying the "device-mode" parameter in the magic-pdf.json configuration file for proper CUDA device selection.
-
myhloli authored
Update the PyTorch installation guide in both English and Chinese READMEs to explicitly recommend using torch==2.3.1 and torchvision==0.18.1 for CUDA 11.8. Emphasize the importance of specifying these versions to avoid compatibility issues with higher, unsupported versions.
-
myhloli authored
A pre-release version0.6.2b1 of magic-pdf is now available. This version includes many fixes addressed in our logs but has not undergone full QA testing. Users are advised to report any issues encountered or revert to version 0.6.1. The installationguides in both Japanese and Chinese READMEs have been updated to reflect the availability of this pre-release version and the previous stable version. BREAKING CHANGE: Installation commands now point to version 0.6.2b1 by default. Users who wish to install the stable version 0.6.1 should follow the provided command instead.
-
myhloli authored
- Change the pip install command in README_zh-CN.md to reflect the new version 0.6.2b1. - Include a notice about the pre-release of version 0.6.2beta, cautioning users about its未经完整QA测试的状态,并提供回退到0.6.1版本的指导。 - Verify the installed version with `magic-pdf --version` after installation to ensure the correct version is installed, addressing feedback about incorrect versions due to mirror source and dependency conflicts.
-
xuchao authored
-
myhloli authored
We have pre-released the 0.6.2 beta version, which addresses numerous issues reported in our logs. This commit updates the installation guide in the README to include information on how to install this beta version. Users are advised that this build has not undergone full QA testing and may contain issues. A revert instruction to version 0.6.1 is also provided for users who encounter problems. BREAKING CHANGE: Installation instructions now include beta version information. Users should be aware of potential issues with the0.6.2 beta version and consider reverting to 0.6.1 if necessary.
-
myhloli authored
-
myhloli authored
-
myhloli authored
-
myhloli authored
-
myhloli authored
Add detailed instructions for installing git lfs on Linux, macOS, and Windows to facilitate users in downloading models from ModelScope repository. The guide is included in the `how_to_download_models_zh_cn.md` document.
-
github-actions[bot] authored
-
Richard Li authored
use git lfs clone to download model from ModelScope
-
- 30 Jul, 2024 1 commit
-
-
xuchao authored
-