brmson / yodaqa

A Question Answering system built on top of the Apache UIMA framework.
http://ailao.eu/yodaqa
Other
619 stars 205 forks source link

I ran the wikipedia extractor and it stopped at folder IF. Did I do something wrong? #18

Closed simonhughes22 closed 8 years ago

simonhughes22 commented 8 years ago

Awesome project btw. I didn't get any errors but the extractor stopped at directory IF. Should it go to ZZ?

pasky commented 8 years ago

I think that's normal, the number of folders corresponds to size of the wiki corpus.

simonhughes22 commented 8 years ago

Ah ok thanks. I thought it was something to do with the topics. Markups comes back very clean. We've been struggling with parsing that, and I don't want to hand roll my own so thanks for sharing.

pasky commented 8 years ago

Glad it helps :)