issues
search
HarikalarKutusu
/
cv-tbox-dataset-compiler
GNU Affero General Public License v3.0
0
stars
0
forks
source link
[PR] Post CV v17.0 work
#35
Closed
HarikalarKutusu
closed
5 months ago
HarikalarKutusu
commented
5 months ago
Major changes:
Adds s5 algorithm
Rework on bins (remove unnecessary 0 count, add intermedia values, allow larger values)
Implement "safe" reader for corrupt rows in reported.tsv to get more accurate results (see
https://github.com/common-voice/common-voice/issues/4429
)
Start to implement pyarrow dtypes
Refactor pack_splits to include statistics & progress bar
Major changes: