nltk / nltk_data

NLTK Data
1.44k stars 1.04k forks source link

Update data index #220

Closed ekaf closed 1 month ago

ekaf commented 1 month ago

Rebuilt index.xml, so that the newly added packages can be downloaded.

ekaf commented 1 month ago

NB: @stevenbird, @alvations, @purificant, this PR is critical because it is not possible to test the new packages without an updated package index. I used nltk v. 3.8.1 to _make pkgindex, and it complains that it leaves 3 packages out of the index because they lack an '.xml' file: listing.csv, ptb3 and _tagsetsjson. The first two seem ok to leave out, but I'm not sure about _tagsetsjson. The corresponding pickles in help/tagsets.zip contain only simple types, and load ok with the RestrictedUnpickler. Anyway, adding the missing tagsets_json.xml file can only help.