goldideas / pdfetch

Automatically exported from code.google.com/p/pdfetch
0 stars 0 forks source link

Feature request #1

Closed GoogleCodeExporter closed 9 years ago

GoogleCodeExporter commented 9 years ago
This is fantastic. There's another way that I often access PDF's which is
that when I get the table of contents by email. I would love to be able to
use the same or a similar script from the journal's article page,

a page like this
http://www.sciencedirect.com/science?_ob=ArticleURL&_udi=B6WSN-4KV343K-T&_coverD
ate=09%2F08%2F2006&_alid=476430865&_rdoc=1&_fmt=&_orig=search&_qd=1&_cdi=7051&_s
ort=d&view=c&_acct=C000050264&_version=1&_urlVersion=0&_userid=1010281&md5=0e7c6
e028a817bb8795482e9b7873985
tiny url: http://tinyurl.com/yfthq8

-holly

Original issue reported on code.google.com by hbe...@gmail.com on 27 Oct 2006 at 9:26

GoogleCodeExporter commented 9 years ago
Have you tried the version with ferret (search engine) integration? it indexes 
all your pdfs (even their full-content) and allows you to search them via the 
browser. To do that you need to 'gem install ferret' and also 
have 'pdftotext' in the path (e.g., into the WINDOWS dir). You can find 
pdftotext here <ftp://ftp.foolabs.com/pub/xpdf/xpdf-3.01pl2-win32.zip>.

Going back to your feature request... it should actually be very easy to 
download the pdf reprint from the publishers page (which where the script 
downloads the pdf from anyway)... but it wouldn't know which pmid to assign 
(which is also required by the indexing since all the bib data come from 
pubmed). So, I usually look for a PubMed citation link in the pub page, go 
there 
and exec the script or I go back and search for the article in pubmed pasting 
the 
title copied from the pub page. Or we should think of another way of doing it 
directly, perhaps using some other info from the pub page to uniquely identify 
the 
article in pubmed.

Original comment by edoardo....@gmail.com on 27 Oct 2006 at 10:11

GoogleCodeExporter commented 9 years ago

Original comment by edoardo....@gmail.com on 20 Jun 2007 at 12:11