• Xiaomeng Zhao's avatar
    fix(pdf-extract): adjust box threshold for OCR detection (#447) · 041b9465
    Xiaomeng Zhao authored
    Tuned the detection box threshold parameter in the OCR model initialization to improve the
    accuracy of text extraction from images. The threshold was modified from 0.6 to
    0.3 to filter out smaller detection boxes, which is expected to enhance the quality of the extracted
    text by reducing noise and false positives in the OCR process.
    041b9465
Name
Last commit
Last update
.github Loading commit data...
demo Loading commit data...
docs Loading commit data...
magic_pdf Loading commit data...
signatures/version1 Loading commit data...
tests Loading commit data...
.gitignore Loading commit data...
.pre-commit-config.yaml Loading commit data...
Dockerfile Loading commit data...
LICENSE.md Loading commit data...
MinerU_CLA.md Loading commit data...
README.md Loading commit data...
README.md.bak Loading commit data...
README_ja-JP.md Loading commit data...
README_zh-CN.md Loading commit data...
README_zh-CN.md.bak Loading commit data...
magic-pdf.template.json Loading commit data...
requirements-qa.txt Loading commit data...
requirements.txt Loading commit data...
setup.py Loading commit data...
update_version.py Loading commit data...