scribe-org / Scribe-Data

Wikidata, Wiktionary and Wikipedia language data extraction
GNU General Public License v3.0
30 stars 69 forks source link

[Deleted] Change translation data table values to averages per language #71

Closed andrewtavis closed 1 month ago

andrewtavis commented 8 months ago

Terms

Description

One element of the translation process that will be split over many issues is that we want to be able to change the amount of translations that we report as being available for each language in data_table.txt into the average of all the languages sources that translate into it. A simple example, if we have 200 English to German pairs and 100 French to German pairs, then we would want the number of German translations to be listed as an average of 150. The reason that we're doing averages based on the target language is so that it's in line with the other numbers that we report: that we have this many nouns for the given keyboard language and this many verbs, etc.

Contribution

This issue will be blocked until the other translation issues have been finished. Happy to support or work on this myself when those are getting closer to completion!

spalominor commented 7 months ago

Can I work on this?

andrewtavis commented 7 months ago

Just assigned, @spalominor :) You'll need to wait a little bit on this for other people to be done with their translation issues, but hopefully you can get started by the end of the week 😊

axif0 commented 3 months ago

Hello, @andrewtavis i can't find the file you mentioned data_table.txt . is it this ? or deleted ?

andrewtavis commented 3 months ago

Hey @axif0, the file was recently deleted :) Let's keep this one open as a reminder that we want to do this, but the work would be in Scribe-Server post the bi-weekly updates we have planned 😊

andrewtavis commented 3 months ago

Thanks for checking here!

andrewtavis commented 1 month ago

Deleting this as it's ultimately going to be remade as a Scribe-Server issue at some point :)