CrossRef / pdfextract

MOVED TO https://gitlab.com/crossref/pdfextract
https://gitlab.com/crossref/pdfextract
MIT License
508 stars 89 forks source link

Running pdfextract from R? #3

Closed sckott closed 6 years ago

sckott commented 12 years ago

Any thoughts on running pdfextract from R? Perhaps an R package? Or is there a hack? This would make an awesome workflow is possible.

Thanks! Scott Chamberlain

kjw commented 12 years ago

Don't know much about R myself, but I'm open to suggestions.

After a quick search the only project I can find that bridges R <-> Ruby is RinRuby [1], but that embeds R in Ruby, the opposite of what you're asking for. Not clear to me if it provides access to Ruby objects from R.

Can R execute commands and parse XML? A solution might be an R package that wraps the pdf-extract command.

[1] https://sites.google.com/a/ddahl.org/rinruby-users/

sckott commented 12 years ago

I develop R packages with other folks to search and retrieve metadata (and full content for open access) from science journals at rOpenSci (https://github.com/ropensci). This is why I think your tool would be a nice addition to our packages. I will try to figure out how to let users call pdf-extract from R.

p.s. we also use the CrossRef API, so thanks for that!