klpp / kurdnet

1 stars 3 forks source link

Would it be possible to add this to the Open Multilingual Wordnet? #1

Open fcbond opened 3 years ago

fcbond commented 3 years ago

Hi,

It is great to see some work on Kurdish. We are gathering data for many wordnets at https://github.com/bond-lab/omw-data. If you could (i) add a license file and (ii) convert to either the OMW 1.0 or 2.0 formats, we would be happy to add this.

We also have some data extracted from wiktionary that you may find useful:

I am afraid I speak no Kurdish so cannot judge it's quality, but evaluation on other languages we found an accuracy of around 85%.

Yours,

sinaahmadi commented 2 years ago

Hi @fcbond , Thanks for your nice suggestion. I am one of the creators of this project. As it was not in my repository, I couldn't see your issue until now!

That'll be great. Is there any tool or script that I can use for the conversion?

Thanks.

fcbond commented 2 years ago

Hi,

nice to hear from you.

We are in the process of moving (very slowly) from OMW 1.0 to OMW 2.0. Ultimately we prefer the OMW 2.0 format, but it might be easier to make the OMW 1.0 format and convert.

There is a script that goes from the OMW 1.0 format to the OMW 2.0 format, which might be the easiest to use.

You can find it here: https://github.com/bond-lab/omw-data, and the format is described here: https://github.com/bond-lab/omw-data/tree/main/wns

Basically it is a tab separated file with the princeton wordnet 3.0 synset, and then the lemma in kurdish.

To map between 2.0 and 3.0, I normally used these mappings. http://www.talp.upc.edu/content/wordnet-mappings-automatically-generated-mappings-among-wordnet-versions

I noticed that the Kurdish wordnet does not currently have an explicit licence. In order to be uploaded into the OMW, we need it to have an open license: we recommend CC BY, but any of the licenses listed here are fine: https://www.luismc.com/omw/omw/doc/metadata.

Sorry if this was a bit of a data dump:

TL:DR;

I am very happy to give advice,

On Mon, Aug 30, 2021 at 11:53 PM Sina Ahmadi @.***> wrote:

Hi @fcbond https://github.com/fcbond , Thanks for your nice suggestion. I am one of the creators of this project. As it was not in my repository, I couldn't see your issue until now!

That'll be great. Is there any tool or script that I can use for the conversion?

Thanks.

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/klpp/kurdnet/issues/1#issuecomment-908443538, or unsubscribe https://github.com/notifications/unsubscribe-auth/AAIPZRSXMGXH2VHZBTRRWHTT7OQETANCNFSM4T2XOEAA . Triage notifications on the go with GitHub Mobile for iOS https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675 or Android https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub.

-- Francis Bond http://www3.ntu.edu.sg/home/fcbond/ Division of Linguistics and Multilingual Studies Nanyang Technological University

sinaahmadi commented 2 years ago

Dear Francis (@fcbond), Thanks again for your useful suggestion. I have updated some of the files, provided a license, and created OMW-compatible files for KurdNet. Please check out the new repository: https://github.com/sinaahmadi/kurdnet

Given that the current repository is not active anymore, please keep me posted there for any future notifications. Thanks :-)

fcbond commented 2 years ago

Hi,

Thank you!

I tweaked it a bit, so it now also makes a file for OMW 2.0. Can you let me know if it looks ok, and if so accept the PR?

On Sat, Sep 18, 2021 at 12:01 AM Sina Ahmadi @.***> wrote:

Dear Francis, Thanks again for your useful suggestion. I have updated some of the files, provided a license, and created OMW-compatible files for KurdNet. Please check out the new repository: https://github.com/sinaahmadi/kurdnet

Given that the current repository is not active anymore, please keep me posted there for any future notifications. Thanks :-)

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/klpp/kurdnet/issues/1#issuecomment-921899393, or unsubscribe https://github.com/notifications/unsubscribe-auth/AAIPZRTSRUMHKWO5FUWCXY3UCNPFZANCNFSM4T2XOEAA . Triage notifications on the go with GitHub Mobile for iOS https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675 or Android https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub.

-- Francis Bond http://www3.ntu.edu.sg/home/fcbond/ Division of Linguistics and Multilingual Studies Nanyang Technological University