issues
search
Speech-to-text-data-collection
/
STT-data-collection
A data engineering pipeline that allows recording millions of Amharic and Swahili speakers reading digital texts on app and web platforms.
1
stars
7
forks
source link
Create a kafka cluster
#3
Open
Bethelsis
opened
3 years ago
Bethelsis
commented
3 years ago
[ ] write a code that can generate an ID for a randomly selected text and its audio equivalent, receives an ID from an API, sends back as JSON the ID + audio to Kafka like URL