ipfs-search / ipfs-tika

Java web application taking IPFS hashes, extracting (textual) content and metadata through Apache's Tika.
GNU Affero General Public License v3.0
32 stars 5 forks source link

cryptographic hash of their contents #3

Closed arni077 closed 5 years ago

arni077 commented 6 years ago

how this algo know to infer from the hash the contents of the website? if someone search "sport" then how the code know that the website contain the word "sport" ?

dokterbob commented 5 years ago

Check out Apache's Tika. It downloads the file and processes it with Tika, which yields the contents which are then indexed by Elasticsearch.