Open pjrinaldi opened 5 years ago
I like the idea of using catdoc and docx2txt to dump readable text and then display that content... I could also use xls2csv, catppt and other such utilities to simply display the readable text... Images would be lost, but i could carve those out..
There is also apache-tiki, which is a java thing. I would have to call the command line to parse the files to something I could display...
I implemented code to pull out the readable text from a word document and display it in the plain text viewer. I will do the same for pptx, xlsx, html, and every other artifact.
doc, xls, and ppt are cfb files, so i should be able to use my code to get there on viewers to pull out the text.
Implement viewers for doc, docx, xls, xlsx, ppt, pptx, etc.