nltk / nltk_data

NLTK Data
1.47k stars 1.05k forks source link

Adding Norwegian Dependency Treebank to nltk #86

Closed Bsmil3y closed 5 years ago

Bsmil3y commented 7 years ago

Hi I was hoping to add a Norwegian corpus to nltk.

Full name of corpus: Norwegian Dependecy Treebank

source of the corpus: http://www.nb.no/sprakbanken/show?serial=sbr-10

NLTK name for the corpus: ndt

identify an existing corpus reader for the corpus, or else explain how you plan to create one: There is no corpus reader that fully gives the content of the corpus to the user, thats why I plan/have created one.

document the fact that the corpus is freely redistributable: _http://www.nb.no/sbfil/dok/LICENSE_NDT.txt_

(files: corpus: https://github.com/Bsmil3y/nltk_data reader: https://github.com/Bsmil3y/nltk)

Bob