I am using a template ipython notebook "1. Working with data from the Web of Science".
Several files from Web of Science are located in a folder ("data/ds"). So I write
corpus = wos.read('data/ds/')
This returns an error:
`ValueError Traceback (most recent call last)
in ()
----> 1 corpus = wos.read('data/ds/')
/home/anaconda2/lib/python2.7/site-packages/tethne/readers/wos.pyc in read(path, corpus, index_by, streaming, parse_only, corpus_class, **kwargs)
393 papers += read(os.path.join(path, sname),
394 corpus=False,
--> 395 parse_only=parse_only)
396 else: # A single data file.
397 papers = WoSParser(path).parse(parse_only=parse_only)
/home/anaconda2/lib/python2.7/site-packages/tethne/readers/wos.pyc in read(path, corpus, index_by, streaming, parse_only, corpus_class, **kwargs)
395 parse_only=parse_only)
396 else: # A single data file.
--> 397 papers = WoSParser(path).parse(parse_only=parse_only)
398
399 if corpus:
/home/anaconda2/lib/python2.7/site-packages/tethne/readers/base.pyc in parse(self, parse_only)
123 break
124
--> 125 self.handle(tag, data)
126 self.last_tag = tag
127 return self.data
/home/anaconda2/lib/python2.7/site-packages/tethne/readers/base.pyc in handle(self, tag, data)
164 handler = self._get_handler(tag)
165 if handler is not None:
--> 166 data = handler(data)
167
168 if tag in self.tags: # Rename the field.
/home/anaconda2/lib/python2.7/site-packages/tethne/readers/wos.pyc in handle_PY(self, value)
136 WoS publication years are cast to integers.
137 """
--> 138 return int(value)
139
140 def handle_AU(self, value):
ValueError: invalid literal for int() with base 10: 'Z8 0'`
I am using a template ipython notebook "1. Working with data from the Web of Science". Several files from Web of Science are located in a folder ("data/ds"). So I write
corpus = wos.read('data/ds/')
This returns an error:
`ValueError Traceback (most recent call last)