Open borsna opened 10 years ago
I think the first crawling function would be to be able to crawl other instances of T23 tool registry too. What do you think ?
This sounds like a good startting point, the tasks after this would be to implement other crawlers directly into the admin interface
I am a bit worried that our python code might be a little hard to convert... That will surely take a lot of time ;)
Maybe some rdf crawling fonctions as well ? DBPedia or so ?
The crawling of "arts and humanities", "Bamboo dirt", "History Online" etc is done via a Python script at the moment. A better solution (to minimize the dependencies) would be to port this code to PHP and enable it to be initiated directly via the admin interface.