• Kaiwen Liu's avatar
    feat(model inference): add table recognition and conversion to LaTeX (#284) · 37925f36
    Kaiwen Liu authored
    * # add table recognition using struct-eqtable
    ## Changelog
    31/07/20204
    - Support table recognition. Table images will be converted into html.
    
    ### how to use the new feature:
    set the attribute 'table-mode' to 'true' in magic-pdf.json
    
    ### caution:
    it takes 200s to 500s to convert a single table image using cpu
    
    * # add table recognition using struct-eqtable
    ## Changelog
    31/07/20204
    - Support table recognition. Table images will be converted into LaTex.
    
    ### how to use the new feature:
    set the attribute 'table-mode' to 'true' in magic-pdf.json
    
    ### caution:
    it takes 200s to 500s to convert a single table image using cpu
    
    * # feat(model inference): add table recognition and convertion to LaTeX
    
    # What's Changed
    
    ### New Features
    
    - Add table content recognition, we use weights of [StructEqTable](https://github.com/UniModal4Reasoning/StructEqTable-Deploy) to convert table image to LaTex.
    
    ### Instruction
    
    - pip install pypandoc struct-eqtable==0.1.0
    - Download [StructEqTable weights](https://huggingface.co/wanderkid/PDF-Extract-Kit/tree/main/models/TabRec) and put it under models/ directory.
    - Edit 'table-mode' value to turn on table recognition function which is turned off by default.
    - If you did not download any models before, refer to [how to download models](docs/how_to_download_models_zh_cn.md)。
    
    * add table recognition and convertion to LaTeX
    
    * add table recognition and conversion to LaTeX
    
    * add table recognition and conversion to LaTeX
    
    * add table recognition and conversion to LaTeX
    
    ---------
    Co-authored-by: 's avatarliukaiwen <liukaiwen@pjlab.org.cn>
    37925f36
Name
Last commit
Last update
..
dict2md Loading commit data...
filter Loading commit data...
layout Loading commit data...
libs Loading commit data...
model Loading commit data...
para Loading commit data...
pipe Loading commit data...
post_proc Loading commit data...
pre_proc Loading commit data...
resources Loading commit data...
rw Loading commit data...
spark Loading commit data...
tools Loading commit data...
__init__.py Loading commit data...
pdf_parse_by_ocr.py Loading commit data...
pdf_parse_by_txt.py Loading commit data...
pdf_parse_union_core.py Loading commit data...
user_api.py Loading commit data...