Open Silence-o0 opened 1 month ago
Develop a formatter to parse PDF and DOCX files, extract text and tables while handling complex layouts.
Note: It presumably can be implemented using two different approaches.
@GeorgyPetriv please write your high level thoughts here (what should be done and how)
Develop a formatter to parse PDF and DOCX files, extract text and tables while handling complex layouts.
Note: It presumably can be implemented using two different approaches.