davidjurgens / equilid

Socially-Equitable Language Identification
Other
78 stars 16 forks source link

Missing language data #6

Open DonaldTsang opened 4 years ago

DonaldTsang commented 4 years ago

Where are N-gram or linguistic data from Equilid? Can't find it.

davidjurgens commented 4 years ago

Due to the terms of service for several datasets, we can't officially release the training data via github so there's no n-gram data in it.

On Sat, Nov 23, 2019 at 3:42 PM Donald Tsang notifications@github.com wrote:

Where are N-gram or linguistic data from Equilid? Can't find it.

— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub https://github.com/davidjurgens/equilid/issues/6?email_source=notifications&email_token=AAHO4LP4NW5MWGMTFSWCNV3QVGIUJA5CNFSM4JQ32ZQ2YY3PNVWWK3TUL52HS4DFUVEXG43VMWVGG33NNVSW45C7NFSM4H3SUPPA, or unsubscribe https://github.com/notifications/unsubscribe-auth/AAHO4LM2D3F2SYF74XNOZ7TQVGIUJANCNFSM4JQ32ZQQ .

DonaldTsang commented 4 years ago

Are there any other datasets that does not have such license restrictions?