-
liukaiwen authored
# What's Changed ### New Features - Add table content recognition, we use weights of [StructEqTable](https://github.com/UniModal4Reasoning/StructEqTable-Deploy) to convert table image to LaTex. ### Instruction - pip install pypandoc struct-eqtable==0.1.0 - Download [StructEqTable weights](https://huggingface.co/wanderkid/PDF-Extract-Kit/tree/main/models/TabRec) and put it under models/ directory. - Edit 'table-mode' value to turn on table recognition function which is turned off by default. - If you did not download any models before, refer to [how to download models](docs/how_to_download_models_zh_cn.md)。
d04f3f22
Name |
Last commit
|
Last update |
---|---|---|
.. | ||
cli | ||
dict2md | ||
filter | ||
layout | ||
libs | ||
model | ||
para | ||
pipe | ||
post_proc | ||
pre_proc | ||
resources | ||
rw | ||
spark | ||
train_utils | ||
__init__.py | ||
pdf_parse_by_ocr.py | ||
pdf_parse_by_txt.py | ||
pdf_parse_for_train.py | ||
pdf_parse_union_core.py | ||
user_api.py |