matthias-samwald / find-me-evidence

An open-source medical search engine
GNU Affero General Public License v3.0
9 stars 1 forks source link

data_source_name not set for webpages crawled with spider #21

Closed matthias-samwald closed 10 years ago

matthias-samwald commented 10 years ago

It seems like data_source_name field is left empty (as an empty string) when crawling with the spider, e.g, when calling

crawl("Medscape", "http://emedicine.medscape.com/home", "Evidence-based summary", 10)

matthias-samwald commented 10 years ago

This also means that no source is shown in the web interface screenshot-of-missing-data-source

@gpetz could you confirm on your local installation?

gpetz commented 10 years ago

I can confirm it:

unbenannt

I think that $crawler->set_site_name($site_name); is missing somewhere here: https://github.com/matthias-samwald/find-me-evidence/blob/Version-1.X/retrieval_and_indexing/start_crawl.php#L185-L186