itkach / slob

Data store for Aard 2
GNU General Public License v3.0
241 stars 32 forks source link

Update slob download links #42

Closed Tarek-Hasan closed 1 year ago

Tarek-Hasan commented 1 year ago

Please, update these following dictionaries.

MHBraun commented 1 year ago

There is no dedicated maintainer for this area. The files are generated from different enthusiasts.

Pls check Ftp://halifax.rwth-aachen.de/aarddict For updated versions regulary as noted.

You may generate some dictionaries as well and publish thise here.

Markus


From: Tarek Hasan Faruk @.***> Sent: Thursday, January 19, 2023 17:38 To: itkach/slob Cc: Subscribed Subject: [itkach/slob] Update slob download links (Issue #42)

Please, update these following dictionaries. WordNet version is currently 3.1 which is last updated 2011 and now unmaintained. Open English WordNet is an actively maintained fork of Princeton WordNet, which just released its 2022 Edition in 31st December 2022. Simple English Wiktionary link is invalid and also last updated in 2020. Arabic Wiktionary last updated in 2020. Bengali Wiktionary link is invalid and also last updated in 2016. Korean and Japanese Wiktionary last updated in 2015. Wikispecies last updated in 2020 and link also invalid. Collaborative International Dictionary of English (GCIDE) version is 0.52, where latest version is 0.53. CC-CEDICT last updated 2021, newer version has 2,425 more entries. — Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you are subscribed to this thread.Message ID: @.***>

itkach commented 1 year ago

@Tarek-Hasan Thank you for the report and suggestions. I cleaned up the wiki and removed broken links. As @MHBraun said, this is community managed page, no guarantee that folks who compiled dictionaries and shared links at some point are willing or able to do so continuously, so maintenance is "best effort".

Taking a quick look at Open English WordNet, it appears that the data is available in WordNet database format, so perhaps existing converter https://github.com/itkach/wordnet2slob can be applied. I don't know who converted CC-CEDICT originally and how they did it, so can't help here.

Note that this is repository for slob file format and reference implementations, best place for dictionaries related discussions is aarddict Google group at https://groups.google.com/g/aarddict

itkach commented 1 year ago
  • Open English WordNet is an actively maintained fork of Princeton WordNet, which just released its 2022 Edition in 31st December 2022.

here: https://github.com/itkach/slob/wiki/Dictionaries#open-english-wordnet