ropensci / taxize

A taxonomic toolbelt for R
https://docs.ropensci.org/taxize
Other
268 stars 60 forks source link

International Committee on Taxonomy of Viruses #695

Open arw36 opened 6 years ago

arw36 commented 6 years ago

There are downloadedable species list, historical taxonomies, and open reports. No current API. I think @noamross has ideas on ways to make into package, but may be more appropriate to include here.

noamross commented 6 years ago

Yes, this is the project for which I was asking about time-versioned taxonomies over in the taxa repo. ICTV has no API and doesn't deliver the data in bulk, but I have a scraper that uses the website's internal API. They have a generous robots.txt and all the non-data text is CC-BY-SA 4.0. Probably the thing to do would be to set up a bulk data set of scraped data, maybe have a package for it but also point taxize to the raw data. If so, what format should the raw bulk data be in, @sckott?

sckott commented 6 years ago

thanks @arw36 and @noamross

For bulk data, tabular is of course easy to work with, but we can also parse json or xml here if needed.

sckott commented 5 years ago

should we keep this open - still going to happen or ?

noamross commented 5 years ago

This is definitely in the "need an intern/volunteer to make it happen" category.

complexgenome commented 4 years ago

I'd like to volunteer, any starting points? But, I've never contributed to opensource projects before. For sure, I'd have difficult time to understand work-flow of controls in the beginning.