diging / tethne

Python module for bibliographic network analysis.
http://diging.github.io/tethne/
GNU General Public License v3.0
81 stars 32 forks source link

Error in reading all wos files in a folder #161

Closed kevinbsc closed 7 years ago

kevinbsc commented 7 years ago

I am using a template ipython notebook "1. Working with data from the Web of Science". Several files from Web of Science are located in a folder ("data/ds"). So I write

corpus = wos.read('data/ds/')

This returns an error:

`ValueError Traceback (most recent call last)

in () ----> 1 corpus = wos.read('data/ds/') /home/anaconda2/lib/python2.7/site-packages/tethne/readers/wos.pyc in read(path, corpus, index_by, streaming, parse_only, corpus_class, **kwargs) 393 papers += read(os.path.join(path, sname), 394 corpus=False, --> 395 parse_only=parse_only) 396 else: # A single data file. 397 papers = WoSParser(path).parse(parse_only=parse_only) /home/anaconda2/lib/python2.7/site-packages/tethne/readers/wos.pyc in read(path, corpus, index_by, streaming, parse_only, corpus_class, **kwargs) 395 parse_only=parse_only) 396 else: # A single data file. --> 397 papers = WoSParser(path).parse(parse_only=parse_only) 398 399 if corpus: /home/anaconda2/lib/python2.7/site-packages/tethne/readers/base.pyc in parse(self, parse_only) 123 break 124 --> 125 self.handle(tag, data) 126 self.last_tag = tag 127 return self.data /home/anaconda2/lib/python2.7/site-packages/tethne/readers/base.pyc in handle(self, tag, data) 164 handler = self._get_handler(tag) 165 if handler is not None: --> 166 data = handler(data) 167 168 if tag in self.tags: # Rename the field. /home/anaconda2/lib/python2.7/site-packages/tethne/readers/wos.pyc in handle_PY(self, value) 136 WoS publication years are cast to integers. 137 """ --> 138 return int(value) 139 140 def handle_AU(self, value): ValueError: invalid literal for int() with base 10: 'Z8 0'`
erickpeirson commented 7 years ago

@kevinbsc Did you figure out what went wrong?

kevinbsc commented 7 years ago

I figured it out. Thanks for getting back to me.