Update README.md

ac869888 · Xiaomeng Zhao · GitHub · 70e3083b · ac869888
Unverified Commit ac869888 authored Sep 27, 2024 by Xiaomeng Zhao Committed by GitHub Sep 27, 2024
Hide whitespace changes
Inline Side-by-side

Showing with 20 additions and 7 deletions

README.md README.md +20 -7

No files found.
--- a/README.md
+++ b/README.md
@@ -267,13 +267,26 @@ Usage: magic-pdf [OPTIONS]
 Options:
  -v, --version                display the version and exit
  -p, --path PATH              local pdf filepath or directory  [required]
-  -o, --output-dir TEXT        output local directory
+  -o, --output-dir PATH        output local directory  [required]
-  -m, --method [ocr|txt|auto]  the method for parsing pdf.
+  -m, --method [ocr|txt|auto]  the method for parsing pdf. ocr: using ocr
-                               ocr: using ocr technique to extract information from pdf,
+                               technique to extract information from pdf. txt:
-                               txt: suitable for the text-based pdf only and outperform ocr,
+                               suitable for the text-based pdf only and
-                               auto: automatically choose the best method for parsing pdf
+                               outperform ocr. auto: automatically choose the
-                                  from ocr and txt.
+                               best method for parsing pdf from ocr and txt.
-                               without method specified, auto will be used by default.
+                               without method specified, auto will be used by
+                               default.
+  -l, --lang TEXT              Input the languages in the pdf (if known) to
+                               improve OCR accuracy.  Optional. You should
+                               input "Abbreviation" with language form url: ht
+                               tps://paddlepaddle.github.io/PaddleOCR/en/ppocr
+                               /blog/multi_languages.html#5-support-languages-
+                               and-abbreviations
+  -d, --debug BOOLEAN          Enables detailed debugging information during
+                               the execution of the CLI commands.
+  -s, --start INTEGER          The starting page for PDF parsing, beginning
+                               from 0.
+  -e, --end INTEGER            The ending page for PDF parsing, beginning from
+                               0.
  --help                       Show this message and exit.