issues
search
nltk
/
nltk_data
NLTK Data
1.43k
stars
1.03k
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Hindi Stopwords Missing
#223
PreyumKr
opened
1 week ago
0
polish stopwords missing
#222
GabrielaMajstrak
opened
3 weeks ago
0
German Punkt: more time units in `##number## word` pattern
#221
adbar
opened
1 month ago
0
Update data index
#220
ekaf
closed
1 month ago
1
Move the pickles to a special collection
#219
ekaf
opened
1 month ago
7
Store maxent_treebank_pos_tagger as tab files
#218
ekaf
closed
1 month ago
2
maxent_ne chunkers data stored as tab-files
#217
ekaf
closed
1 month ago
3
task: Use forked url for package download url
#216
iandesj
closed
1 month ago
0
PunktParameters stored as tab files
#215
ekaf
closed
1 month ago
2
Fixing the empty zipballs in the previous commit for tagsets_json
#214
alvations
closed
2 months ago
2
Convert the pickle tagsets dictionary to json
#213
alvations
closed
2 months ago
1
Patching classes order for russian tagger
#212
alvations
closed
2 months ago
1
Fix the class for average perceptron tagger list
#211
alvations
closed
2 months ago
1
Changed the sorted to list
#210
alvations
closed
2 months ago
1
Updated the index with the new zips for taggers
#209
alvations
closed
2 months ago
1
Added tagger in json format
#208
alvations
closed
2 months ago
4
Minor error in the RSLP's step0 file
#207
felipovysk
opened
5 months ago
0
some french stopwords are wrong (punkt)
#206
sylvan-ermit
opened
6 months ago
2
Albanian stopwords added
#205
ArditXhaferi
opened
7 months ago
1
Albanian stopwords missing
#204
ArditXhaferi
opened
7 months ago
0
not able to download stopwords
#203
jaggi01
opened
9 months ago
0
I am not able to download punkt.zip file for tokenization purpose
#202
sumitsharmatops
opened
11 months ago
4
Adding our African Stopwords
#201
chrisemezue
opened
11 months ago
0
fix(index.xml): unzip corpora so they are found by nltk.data.find
#200
tuky
closed
2 months ago
2
Added Tamil stopwords
#199
khaleeljageer
opened
1 year ago
0
Tamil stopwords were missing
#198
khaleeljageer
opened
1 year ago
0
Averaged Perceptron Tagger is free for commercial use?
#197
goseaplay
opened
1 year ago
0
Dear Jan Strunk: license of NLTK Data
#196
hiDevman
opened
1 year ago
0
Please rename Slovene stopwords to Slovenian
#195
PrimozGodec
opened
1 year ago
1
Thousands of duplicate entries in OMW packages
#194
ekaf
closed
7 months ago
2
Add Open English Wordnet 2022
#193
ekaf
closed
1 year ago
1
Server Index link is not working
#192
thewebcoder2009
closed
1 year ago
11
Add bcp47 data for handling language tags
#191
ekaf
closed
1 year ago
0
nltk 2.3.4 is not working with the zipped version of wordnet
#190
lipsa-vlad
closed
2 years ago
5
Update license for universal-pos-tags
#189
mrahtz
closed
2 years ago
2
license of punkt in nltk_data
#188
happyMindHaha
opened
2 years ago
1
Some corpora are unnecessarily unzipped
#187
ekaf
closed
2 years ago
0
Afaan Oromoo
#186
teshomegit
closed
2 years ago
1
WordNet key ERROR on ADJ for Resnik Similarity
#185
scramblingbalam
closed
2 years ago
1
Correct link for Project Gutenberg, .org not .net
#184
mattwigway
closed
2 years ago
1
Update Extended OMW
#183
ekaf
opened
2 years ago
8
Add script to automatically build critical collections
#182
tomaarsen
closed
2 years ago
1
packages/corpora/names2.zip, packages/corpora/names2.xml: creation
#181
davidam
closed
2 years ago
9
Add Extended Open Multilingual WordNet (extended_omw)
#180
ExplorerFreda
closed
2 years ago
1
Extended Open Multilingual WordNet
#179
ExplorerFreda
closed
2 years ago
1
Packages missing a corresponding .xml file
#178
ekaf
closed
2 years ago
0
Add wordnet2021.xml
#177
ekaf
closed
2 years ago
1
Add `omw-1.4.xml` to allow OMW 1.4 to be downloaded
#176
tomaarsen
closed
2 years ago
5
OMW compatibility with old NLTK versions
#175
ekaf
closed
2 years ago
0
Resolve critical installation and usage issue of inaugural data
#174
tomaarsen
closed
2 years ago
1
Next