globalwordnet / OMW

The Open Multilingual Wordnet
http://compling.hss.ntu.edu.sg/omw/
MIT License
57 stars 9 forks source link

Replace iso-639 dependency with something with a permissive license #88

Closed goodmami closed 4 years ago

goodmami commented 4 years ago

The iso-639 package is licensed under the AGPL, which is copyleft. I think if that fact were noticed earlier we wouldn't have used it. Now we should replace it or we'll need to GPLify this repo, too.

goodmami commented 4 years ago

I found several packages that all use some GPL license, perhaps because they use an underlying data source that is GPL.

The best alternative I could find is https://github.com/LuminosoInsight/langcodes, but it is BCP-47 and not ISO-639-3 (the data is there, I think, but without an API to get the tags).

fcbond commented 4 years ago

I also found a simple table here: https://datahub.io/core/language-codes (Open Data Commons Public Domain Dedication and License (PDDL)). We could just download and read it in ourselves.

But let me see if we can get what we want from langcodes.

On Wed, May 20, 2020 at 10:51 PM Michael Wayne Goodman < notifications@github.com> wrote:

I found several packages that all use some GPL license, perhaps because they use an underlying data source that is GPL.

The best alternative I could find is https://github.com/LuminosoInsight/langcodes, but it is BCP-47 and not ISO-639-3 (the data is there, I think, but without an API to get the tags).

— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub https://github.com/globalwordnet/OMW/issues/88#issuecomment-631523702, or unsubscribe https://github.com/notifications/unsubscribe-auth/AAIPZRUWX3BHI44DLZH4RPTRSPU6VANCNFSM4NF7RIWQ .

-- Francis Bond http://www3.ntu.edu.sg/home/fcbond/ Division of Linguistics and Multilingual Studies Nanyang Technological University

goodmami commented 4 years ago

ISO-639-3, at least, is maintained by SIL: https://iso639-3.sil.org/

The only place we use the library is in scripts/seed-languages.py:

https://github.com/globalwordnet/OMW/blob/42a2d31e1efff4261fec446acf724c9651db0167/scripts/seed-languages.py#L7

then

https://github.com/globalwordnet/OMW/blob/42a2d31e1efff4261fec446acf724c9651db0167/scripts/seed-languages.py#L45

I think the part3 thing refers to ISO-639-3.