Closed saloneerege closed 8 years ago
I am trying to crawl weapons images and since the nutch that memex uses has the mimetypes.txt to only accept text/html will I have to add images/* there so that it accepts all images as memex crawls through the seedlist ?
Hi @saloneerege. That sounds right to me. @chrismattmann - do you have some thoughts on this?
correct
I am trying to crawl weapons images and since the nutch that memex uses has the mimetypes.txt to only accept text/html will I have to add images/* there so that it accepts all images as memex crawls through the seedlist ?