jonathansick / ads_bibdesk

(Unmaintained) Mac OS X service for frictionless import of NASA ADS and arXiv publications into BibDesk.
GNU General Public License v3.0
37 stars 20 forks source link

Preprint updating problems #38

Open lmoustakas opened 10 years ago

lmoustakas commented 10 years ago

Hi Jonathan,

I'm getting persistent issues with adsbibdesk -u

  1. 2011arXiv1103.4329A has not changed
  2. 2011arXiv1103.3272N has become 2011MNRAS.416.1486N
  3. 2011arXiv1105.0006R has become 2011MNRAS.416.1983R Updating 91 entries, continue? (y/[n]) y (to prevent ADS flooding, we will wait for a while between each update, so go grab a coffee) Traceback (most recent call last): File "/Users/leonidas/bin/adsbibdesk", line 9, in load_entry_point('adsbibdesk==3.1.1', 'console_scripts', 'adsbibdesk')() File "/Users/leonidas/Library/Python/2.7/site-packages/adsbibdesk.py", line 173, in main update_arxiv(options, prefs) File "/Users/leonidas/Library/Python/2.7/site-packages/adsbibdesk.py", line 465, in update_arxiv process_articles(changed, prefs) File "/Users/leonidas/Library/Python/2.7/site-packages/adsbibdesk.py", line 192, in process_articles process_token(articleToken, prefs, bibdesk) File "/Users/leonidas/Library/Python/2.7/site-packages/adsbibdesk.py", line 245, in process_token pdf = ads.getPDF() File "/Users/leonidas/Library/Python/2.7/site-packages/adsbibdesk.py", line 1066, in getPDF parser.parse(url) File "/Users/leonidas/Library/Python/2.7/site-packages/adsbibdesk.py", line 1275, in parse self.feed(urllib2.urlopen(url).read()) File "/Library/Frameworks/EPD64.framework/Versions/7.3/lib/python2.7/HTMLParser.py", line 114, in feed self.goahead(0) File "/Library/Frameworks/EPD64.framework/Versions/7.3/lib/python2.7/HTMLParser.py", line 158, in goahead k = self.parse_starttag(i) File "/Library/Frameworks/EPD64.framework/Versions/7.3/lib/python2.7/HTMLParser.py", line 305, in parse_starttag attrvalue = self.unescape(attrvalue) File "/Library/Frameworks/EPD64.framework/Versions/7.3/lib/python2.7/HTMLParser.py", line 472, in unescape return re.sub(r"&(#?[xX]?(?:[0-9a-fA-F]+|\w{1,8}));", replaceEntities, s) File "/Library/Frameworks/EPD64.framework/Versions/7.3/lib/python2.7/re.py", line 151, in sub return _compile(pattern, flags).sub(repl, string, count) UnicodeDecodeError: 'ascii' codec can't decode byte 0xe2 in position 14: ordinal not in range(128) [leonidas-1821248:~/Dropbox/bibdesk] leonidas%

Do you have some insight? Thanks! -leonidas

jonathansick commented 10 years ago

I'm now able to ingest the article under the newest ADS to BibDesk. Can you try this?

git clone https://github.com/jonathansick/ads_bibdesk.git
cd ads_bibdesk
python setup.py install
adsbibdesk --debug -u