asepaprianto / crawler4j

Automatically exported from code.google.com/p/crawler4j
0 stars 0 forks source link

Parse Binary Content #272

Closed GoogleCodeExporter closed 9 years ago

GoogleCodeExporter commented 9 years ago
Currently, there is no support for binary parsing - only a stub.

Add it using Tika:
https://code.google.com/r/terojankkila-improvements/source/detail?r=603a486f87c2
89d0699792f503a9d96826014049

Which gives us with the full power of Tika, more than a thousand different 
binary types parsing

This issue was fixed in rev hash: 1e3995cbc6d3  

Original issue reported on code.google.com by avrah...@gmail.com on 11 Aug 2014 at 8:37