Open arw36 opened 6 years ago
Yes, this is the project for which I was asking about time-versioned taxonomies over in the taxa repo. ICTV has no API and doesn't deliver the data in bulk, but I have a scraper that uses the website's internal API. They have a generous robots.txt
and all the non-data text is CC-BY-SA 4.0. Probably the thing to do would be to set up a bulk data set of scraped data, maybe have a package for it but also point taxize to the raw data. If so, what format should the raw bulk data be in, @sckott?
thanks @arw36 and @noamross
For bulk data, tabular is of course easy to work with, but we can also parse json or xml here if needed.
should we keep this open - still going to happen or ?
This is definitely in the "need an intern/volunteer to make it happen" category.
I'd like to volunteer, any starting points? But, I've never contributed to opensource projects before. For sure, I'd have difficult time to understand work-flow of controls in the beginning.
There are downloadedable species list, historical taxonomies, and open reports. No current API. I think @noamross has ideas on ways to make into package, but may be more appropriate to include here.