HarikalarKutusu / cv-tbox-split-maker

Creates alternative splits for Mozilla Common Voice datasets for further analysis. Supports delta-version upgrades.
Mozilla Public License 2.0
1 stars 0 forks source link

[PR] Feat lib clips #5

Closed HarikalarKutusu closed 1 month ago

HarikalarKutusu commented 7 months ago

Rework on extract.py and add "middleware" to expand clips into an hierarchical structure to overcome OS bottlenecks from hundreds of thousands of clips in a directory.

HarikalarKutusu commented 4 months ago

This PR became invalid, as we are moving all related repos under a monorepo with proper CLI interfaces and extended functionality. We will close this PR and archive the repo when the other project is finished.