scribe-org / Scribe-Data

Wikidata, Wiktionary and Wikipedia language data extraction
GNU General Public License v3.0
27 stars 61 forks source link

Generate keyword-emoji relationships in Basque #282

Open KesharwaniArpita opened 2 days ago

KesharwaniArpita commented 2 days ago

Terms

Description

This issue aims to create a file that generates keyword-emoji relationships for a selection of Basque words. This task involves associating relevant emojis with keywords (Basque lexemes) based on their meaning, context, or cultural significance.

KesharwaniArpita commented 2 days ago

@andrewtavis Can you assign me this issue?

andrewtavis commented 2 days ago

Hey @KesharwaniArpita 👋 Before we close this, I'm wondering if you have some suggestions for how to really simplify the emoji keywords process as much as possible 🤔 First, maybe we can get emojis_per_keyword as an argument when the file is called by the get process in a similar way that file_path is? From there, is there any other way we can simplify the generate_emoji_keywords files down that you can think of?

Let me know what you think!

KesharwaniArpita commented 2 days ago

Hey @andrewtavis, thanks for the suggestion, I think just like your suggestion we could pass the language as a command-line argument too. That way, it becomes flexible(robust) for other languages too. Do you think if we will need to make data type more dynamic to accommodate that?

One more small change can be introducing error handling in case the emoji generation (gen_emoji_lexicon) fails due to missing language data. (For example, if the input is a language we may not have the data for,??)

What do you think?