nsaef / text_exploration

Tool for analyzing big unstructred collections of digital text documents. Master thesis in Digital Humanities.
3 stars 0 forks source link

NER doesn't work on big corpora (java heap out of bounds) #73

Closed nsaef closed 6 years ago

nsaef commented 6 years ago

Solution: process small batches of documents (f.i. 1000) at a time