nltk / nltk_data

NLTK Data
1.4k stars 1.03k forks source link

Add bcp47 data for handling language tags #191

Closed ekaf closed 1 year ago

ekaf commented 1 year ago

Add databases for handling language tags according to the BCP-47 specification (https://www.rfc-editor.org/bcp/bcp47.html).

This package is necessary for the bcp47.py CorpusReader from PR #3060 in NLTK (https://github.com/nltk/nltk/pull/3060).