- 07 Aug, 2024 5 commits
-
-
liukaiwen authored
-
liukaiwen authored
-
liukaiwen authored
-
赵小蒙 authored
Add a print statement to the example code in 'how_to_download_models_zh_cn.md' to output the downloaded model directory path. This enhancement aids users in locating the model files as it provides a clear indication of where they are saved on the user's file system.
-
myhloli authored
docs(readme): update acknowledgment section and project description- Streamline the Acknowledgments section in the README by removing redundant entries.- Clarify the project's current use of PyMuPDF and future plans for exploring a more permissively licensed PDF processing library in the project description. - Ensure all modifications adhere to the project's documentation standards and improve reader understanding.
-
- 06 Aug, 2024 10 commits
-
-
myhloli authored
docs(readme): update acknowledgment section and project description- Streamline the Acknowledgments section in the README by removing redundant entries.- Clarify the project's current use of PyMuPDF and future plans for exploring a more permissively licensed PDF processing library in the project description. - Ensure all modifications adhere to the project's documentation standards and improve reader understanding.
-
myhloli authored
docs(models-download): update steps and remove deprecated sectionsUpdate the model download instructions to reflect the current process, removing unnecessary sections and simplifying the steps. The updated guide now includesclearer instructions on installing Git LFS, downloading models from Hugging Face,and additional checks for model file completeness. This change ensures that the documentation is up-to-date and provides a streamlined experience for users downloading models.
-
myhloli authored
docs: correct path format description in Windows CUDA docsUpdate the instructions in the Windows CUDA Acceleration documentation toreflect the correct path format. Specifically, clarify that Windows paths should include the drive letter and replace backslashes with forward slashes.
-
myhloli authored
-
myhloli authored
Update the URLs for downloading the `magic-pdf.template.json` and `small_ocr.pdf` files in the Windows CUDA acceleration guides. The links now point to the giteerepository instead of GitHub, ensuring users have access to the necessary files from the correct source.
-
myhloli authored
- Streamline the installation process by removing the redundant apt update step. - Adjust the numbering of installation steps throughout the document. - Update download URLs to gitee for the configuration template and demo file. - Ensure consistency in the model directory configuration advice.
-
myhloli authored
Add a new README_Ubuntu_CUDA_Acceleration_en_US.md document to provide users with a setup guide for enabling and testing CUDA acceleration on Ubuntu 22.04 LTS. The guideincludes steps to check and install NVIDIA drivers, install Anaconda, create a conda environment, install required applications, download and verify models, configure theenvironment, and test CUDA acceleration. This addition addresses the need for clear, concise instructions on achieving better performance with CUDA-enabled graphics cards and
-
myhloli authored
- Note the fix in version 0.6.2b1 for the network error during the first run of offline deployment and clarify the model download requirement. - Update the dependency installation guide for users on macOS with Intel CPUs. - Indicate the resolution in version 0.6.2b1 for compatibility issues with paddlepaddle version 2.6.1 on certain Linux systems. This change aims to make the FAQ more informative and easier to navigate for users experiencing similar issues, providing direct solutions and links where applicable.
-
myhloli authored
docs: add conda install steps for environment setupAdd detailed steps on how to create a conda environment and activate it beforeproceeding with the pip installation of magic-pdf and required dependencies. This provides users with a clearer guide on setting up their environment.
-
myhloli authored
-
- 05 Aug, 2024 5 commits
-
-
myhloli authored
Update the download links for Anaconda in both Ubuntu and Windows CUDA Acceleration documents to use the Tuna mirror. This change helps ensure that users in China have faster access to the Anaconda distribution.
-
Xiaomeng Zhao authored
[fix bug] table recognition bug fixed#321
-
liukaiwen authored
-
liukaiwen authored
-
liukaiwen authored
# Conflicts: # docs/how_to_download_models_zh_cn.md
-
- 04 Aug, 2024 5 commits
-
-
myhloli authored
-
myhloli authored
If 'table-config' is not present in the configuration file, the table recognition feature will default to being disabled to ensure consistent behavior. This change adds a warning log and sets a default configuration for table recognition when the expected config is missing.
-
myhloli authored
Ensure proper formatting of inline equations by adding spaces outside the equation delimitersto prevent markdown from interpreting the equation content as part of a link. This addresses the issue where inline OCR equations appear without the correct markdown formatting.
-
myhloli authored
The restriction on the matplotlib version has been updated to only apply on Windows platforms, where precompiled packages are not available starting from version 3.9.1. This change enables users on Linux and macOS to install newer versions of matplotlib, addressing compatibility issues with recent bug fixes.
-
myhloli authored
fix(dependencies): remove unnecessary pypandoc and struct-eqtable packages;fix matplotlib>=3.9.1 not support Windows system without compilation environment.
-
- 02 Aug, 2024 15 commits
-
-
myhloli authored
Update the README documents to clarify that the "models-dir" in the configuration should be an absolute path. Also, provide additional guidance for Windows users on how to correctly format the path to avoid common issues with path escaping in JSON files.
-
myhloli authored
Add instructions to download the magic-pdf.template.json file using wget on Ubuntu and a PowerShell script on Windows in the respective README files. This is to facilitate the setup process by providing direct download options, replacing manual file transfers.
-
sfk authored
add demo url
-
myhloli authored
Add a note in the README_zh-CN_v2.md to clarify the availability of GPU acceleration for devices supporting CUDA, directing users to specific tutorials based on their operating system.
-
myhloli authored
Update the install commands in both Ubuntu and Windows CUDA Acceleration guides to specify Magic-PDF version 0.6.2b1, ensuring consistency andavoiding potential version mismatches.
-
myhloli authored
Update the FAQ to clarify the dependency installation issue when using magic-pdf. Ensure users are directed to install the specific version of magic-pdf that resolves the dependency error, rather than listing all individual dependencies. This simplifies the troubleshooting process and provides a direct solution for users encountering the "Required dependency not installed" error.
-
myhloli authored
Update the download methods for models in the Chinese documentation to reflect the latest options available from ModelScope. Simplify the section titles and revise download instructions for clarity and consistency.
-
myhloli authored
-
myhloli authored
Update the documentation to reflect the latest model download procedures, emphasis on model file integrity checks, and expanded instructions for setting up CUDA accelerationon Ubuntu and Windows environments. The README files for various OS have been enhanced with additional details to assist users in configuring and verifying their environments for optimal performance.
-
xuchao authored
-
xuchao authored
-
xuchao authored
-
myhloli authored
Update the Chinese documentation to include detailed steps for installingmagic-pdf using CPU and GPU. These updates clarify the process for end users, addressing common issues such as configuration file placement and model weight file downloads. The documentation now provides users with direct links and version validation steps to ensure a smoother installation experience.
-
Kaiwen Liu authored
* # add table recognition using struct-eqtable ## Changelog 31/07/20204 - Support table recognition. Table images will be converted into html. ### how to use the new feature: set the attribute 'table-mode' to 'true' in magic-pdf.json ### caution: it takes 200s to 500s to convert a single table image using cpu * # add table recognition using struct-eqtable ## Changelog 31/07/20204 - Support table recognition. Table images will be converted into LaTex. ### how to use the new feature: set the attribute 'table-mode' to 'true' in magic-pdf.json ### caution: it takes 200s to 500s to convert a single table image using cpu * # feat(model inference): add table recognition and convertion to LaTeX # What's Changed ### New Features - Add table content recognition, we use weights of [StructEqTable](https://github.com/UniModal4Reasoning/StructEqTable-Deploy) to convert table image to LaTex. ### Instruction - pip install pypandoc struct-eqtable==0.1.0 - Download [StructEqTable weights](https://huggingface.co/wanderkid/PDF-Extract-Kit/tree/main/models/TabRec) and put it under models/ directory. - Edit 'table-mode' value to turn on table recognition function which is turned off by default. - If you did not download any models before, refer to [how to download models](docs/how_to_download_models_zh_cn.md)。 * add table recognition and convertion to LaTeX * add table recognition and conversion to LaTeX * add table recognition and conversion to LaTeX * add table recognition and conversion to LaTeX --------- Co-authored-by:
liukaiwen <liukaiwen@pjlab.org.cn>
-
myhloli authored
- Fix the description of the 'poly' coordinate format in the output file documentation to correctly reflect the order of coordinates: left-top, right-top, right-bottom, left-bottom. - Update various table-related descriptions for clarity and consistency, including field names and their corresponding explanations. - Add version name field description in 'middle.json' structure to document the version of the magic-pdf used in the parsing process. - Refactor the block and line description tables to improve readability and alignment with the rest of the documentation.
-