DASISH / TERESAH

DASISH Task 2.3 Tools Registry
European Union Public License 1.1
4 stars 2 forks source link

Enable crawling via the admin interface #34

Open borsna opened 10 years ago

borsna commented 10 years ago

The crawling of "arts and humanities", "Bamboo dirt", "History Online" etc is done via a Python script at the moment. A better solution (to minimize the dependencies) would be to port this code to PHP and enable it to be initiated directly via the admin interface.

PonteIneptique commented 10 years ago

I think the first crawling function would be to be able to crawl other instances of T23 tool registry too. What do you think ?

borsna commented 10 years ago

This sounds like a good startting point, the tasks after this would be to implement other crawlers directly into the admin interface

PonteIneptique commented 10 years ago

I am a bit worried that our python code might be a little hard to convert... That will surely take a lot of time ;)

PonteIneptique commented 10 years ago

Maybe some rdf crawling fonctions as well ? DBPedia or so ?