Ghirensics / ghiro

Automated image forensics tool
http://getghiro.org
483 stars 96 forks source link

Thoughts about using Apache Tika for file detection and also for EXIF parsing? #10

Open chrismattmann opened 9 years ago

chrismattmann commented 9 years ago

Hey everyone - Apache Tika has file detection support for 1400+ formats (and growing), and also support for parsing image EXIF metadata and so forth. There is a Tika-Python also. Any interest in using Apache Tika?

jekil commented 9 years ago

In ghiro 0.3 we refactored the module architecure to allow many modules contribute with the same info (i.e. exif). In ghiro 0.4 is planned to develop support to fetch data with many other libraries (tika is one of them). Thank you really much for this feature request, really appreciated, it will be planned for ghiro 0.4

chrismattmann commented 9 years ago

thanks @jekil I will look at if I can help on this. Please let me know what code I should look at. Thanks!

jekil commented 9 years ago

@chrismattmann it will be awesome! Any kind of help is really appreciated! Just look at processing plugins, you just need to write one.