Closed ggservice007 closed 7 months ago
Use the tabula-py the extract table from table.
https://github.com/chezou/tabula-py
def get_table_tabula_py(): import tabula print("使用tabula来提取表格") filename = "财务报销管理细则-V1.00-202201.pdf" dfs = tabula.read_pdf(filename, pages='all') for i in range(len(dfs)): df = dfs[i] print(f"第{i + 1}表格") print(df) print("\n") print('处理结束') if __name__ == '__main__': get_table_tabula_py()
The effect is not very satisfactory.
@ggservice007 Please track in same issue with another comments
what
Use the tabula-py the extract table from table.
github
https://github.com/chezou/tabula-py
code
result
conclusion
The effect is not very satisfactory.
related issues
42