Unverified Commit ac869888 authored by Xiaomeng Zhao's avatar Xiaomeng Zhao Committed by GitHub

Update README.md

parent 70e3083b
...@@ -267,13 +267,26 @@ Usage: magic-pdf [OPTIONS] ...@@ -267,13 +267,26 @@ Usage: magic-pdf [OPTIONS]
Options: Options:
-v, --version display the version and exit -v, --version display the version and exit
-p, --path PATH local pdf filepath or directory [required] -p, --path PATH local pdf filepath or directory [required]
-o, --output-dir TEXT output local directory -o, --output-dir PATH output local directory [required]
-m, --method [ocr|txt|auto] the method for parsing pdf. -m, --method [ocr|txt|auto] the method for parsing pdf. ocr: using ocr
ocr: using ocr technique to extract information from pdf, technique to extract information from pdf. txt:
txt: suitable for the text-based pdf only and outperform ocr, suitable for the text-based pdf only and
auto: automatically choose the best method for parsing pdf outperform ocr. auto: automatically choose the
from ocr and txt. best method for parsing pdf from ocr and txt.
without method specified, auto will be used by default. without method specified, auto will be used by
default.
-l, --lang TEXT Input the languages in the pdf (if known) to
improve OCR accuracy. Optional. You should
input "Abbreviation" with language form url: ht
tps://paddlepaddle.github.io/PaddleOCR/en/ppocr
/blog/multi_languages.html#5-support-languages-
and-abbreviations
-d, --debug BOOLEAN Enables detailed debugging information during
the execution of the CLI commands.
-s, --start INTEGER The starting page for PDF parsing, beginning
from 0.
-e, --end INTEGER The ending page for PDF parsing, beginning from
0.
--help Show this message and exit. --help Show this message and exit.
......
Markdown is supported
0% or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment