Closed aliabbasjp closed 8 years ago
Thanks for the report. I will be able to take a look later today. Do the files in your corpus all have a .txt
extension?
NO
Alright. Try making sure they are plain text, with a .txt extension, and go for it again. Your corpus can be either a folder of files, or a folder containing subfolders which contain files.
Now I get following error: on command : parse corpus
ParserAnnotator: 173.3 sec.
NERCombinerAnnotator: 14.6 sec.
DeterministicCorefAnnotator: 8.2 sec.
TOTAL: 197.9 sec. for 45060 tokens at 227.7 tokens/sec.
Pipeline setup: 6.7 sec.
Total time for StanfordCoreNLP pipeline: 211.0 sec.
14:34:22: Parsing finished. Moving parsed files into place ...
Traceback (most recent call last):
File "/home/domas/anaconda2/lib/python2.7/site-packages/corpkit/env.py", line 2168, in interpreter
out = run_command(tokens)
File "/home/domas/anaconda2/lib/python2.7/site-packages/corpkit/env.py", line 1113, in run_command
out = command(tokens[1:])
File "/home/domas/anaconda2/lib/python2.7/site-packages/corpkit/env.py", line 1437, in parse_corpus
parsed = to_parse.parse(**kwargs)
File "/home/domas/anaconda2/lib/python2.7/site-packages/corpkit/corpus.py", line 930, in parse
**kwargs
File "/home/domas/anaconda2/lib/python2.7/site-packages/corpkit/make.py", line 388, in make_corpus
rename_all_files(outpath)
UnboundLocalError: local variable 'outpath' referenced before assignment
Closing; see #41 for the UnboundLocalError
I get following error on Ubuntu 14 machine