Open RichardLitt opened 9 years ago
A lot of this is just webscraping. We might be able to use something like Kimono. This is a problem that's been solved by other companies like Mendeley. Maybe we could something/someone that's already solved this problem.
Which sites do we want to scrape?
I haven't used Kimono yet, that might be a good call. I would say we look around for other people who have solved this using Node - it doesn't make sense to bring in to many other languages to deal with this. If it's in Python or something, we can probably just rewrite it faster by looking at their code.
Start with PLOS or PubMed.