commonsearch / cosr-back

Backend of Common Search. Analyses webpages and sends them to the index.
https://about.commonsearch.org
Apache License 2.0
123 stars 24 forks source link

Avoid indexing data URIs for images #21

Closed sylvinus closed 8 years ago

sylvinus commented 8 years ago

Currently for images we index the alt attribute as well as the filename.

However we don't exclude Data URIs, which we should do because it makes no sense to index that.