gkunter / coquery

Coquery is a free corpus query tool for linguists, lexicographers, translators, and anybody who wishes to search and analyse a text corpus.
GNU General Public License v3.0
18 stars 4 forks source link

Corpus builder broken #161

Closed gkunter closed 8 years ago

gkunter commented 8 years ago

Originally reported by: gkunter (Bitbucket: gkunter, GitHub: gkunter)


The corpus builder is broken. If POS tagging is turned on, the build is slow, indicating that the NLTK methods are indeed being used. However, the resulting corpus module doesn't seem to have POS activated.

If POS tagging is turned off, a KeyError(?) excpetion is raised.


gkunter commented 8 years ago

Original comment by gkunter (Bitbucket: gkunter, GitHub: gkunter):


Fix POS tagging. Resolve Issue #161

gkunter commented 8 years ago

Original comment by gkunter (Bitbucket: gkunter, GitHub: gkunter):


Can't replicate. This issue may be related to Issue #160.

gkunter commented 8 years ago

Original comment by gkunter (Bitbucket: gkunter, GitHub: gkunter):


#!python
 corpusbuilder.py, line 2086: build
   corpusbuilder.py, line 1441: build_load_files
     corpusbuilder.py, line 1421: process_file
       corpusbuilder.py, line 1162: process_text_file
         corpusbuilder.py, line 1196: add_token
           corpusbuilder.py, line 446: get_or_insert
             corpusbuilder.py, line 421: add
             > self._add_cache[tuple([row[x] for x in self._row_order])] = (self._current_id, row)