common-voice / common-voice

Common Voice is part of Mozilla's initiative to help teach machines how real people speak.
https://commonvoice.mozilla.org/
Mozilla Public License 2.0
3.31k stars 844 forks source link

[BUG] Incorrect display of statistics at CV website for Catalan language #4200

Open Ecron opened 1 year ago

Ecron commented 1 year ago

Bug description The main CV website displays incorrect data “Recorded/Validated Hours” graph. I've been personally gathering this data since 2020, and now I realized that my data does not match with the one represented on the official website's graph. Specifically, the “Validated hour” data prior to July 2023 is not correct, as it shows Catalan had slightly more than 900 validated hours in April 2nd, but my records show that it had more than 2.000 validated hours back then.

imatge

The Recorded Hours graph seems to be just fine, according to my data.

To Reproduce Steps to reproduce the behavior:

  1. Go to 'Common Voice's website'
  2. Go to the Recorded/Validated graph.
  3. Click on the Language Selector > 'Català'
  4. Check the blue/green curve, and compare it to Catalan's recorded hours' curve, and to other main CV languages (English, Esperanto, German, Belorussian...). It's clear that there's a bug there. I can back this up with my own personal data gathering.

Screenshots imatge imatge imatge imatge

jessicarose commented 1 year ago

Thank you so much for flagging this, I've made a ticket to have the backend team have a look when they're able to make time and I massively appreciate your exceptional dedication and eye for detail.