Commit 35a700da authored by 赵小蒙's avatar 赵小蒙

update annotate

parent 38de8d59
...@@ -41,7 +41,7 @@ def detect_invalid_chars(src_pdf_bytes: bytes) -> bool: ...@@ -41,7 +41,7 @@ def detect_invalid_chars(src_pdf_bytes: bytes) -> bool:
"""" """"
检测PDF中是否包含非法字符 检测PDF中是否包含非法字符
""" """
'''需要使用''' '''pdfminer比较慢,需要先随机抽取10页左右的sample'''
sample_docs = extract_pages(src_pdf_bytes) sample_docs = extract_pages(src_pdf_bytes)
sample_pdf_bytes = sample_docs.tobytes() sample_pdf_bytes = sample_docs.tobytes()
sample_pdf_file_like_object = BytesIO(sample_pdf_bytes) sample_pdf_file_like_object = BytesIO(sample_pdf_bytes)
......
Markdown is supported
0% or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment