dbpedia-spotlight / wikipedia-stats-extractor

Raw Wikipedia counts for entity linking
19 stars 5 forks source link

Tokens from Sfs #3

Open nmadhire opened 9 years ago

nmadhire commented 9 years ago

Use of collect() in building the tokens from the list of surfaceforms. Tokentype needs to be serialized. Below logic should be modified,

val token = getSfs().collect().toList.flatMap( sf => lst.tokenizeUnstemmed(sf) )