KaleidOk invites participants to use a new kind of interactive media tool and take part in an emerging experience which explores speech recognition, media retrieval and visuals generating in a collaborative context (between people, and between people and machines).
The transcription duration could be much better (at least on my machine), but it's about on par with Google's transcription service considering network latency. This may be tweaked with smaller training sets and specialised configuration parameters at the cost of some quality, which seems to be alright at the moment. It looks like a lot of trial and error will be necessary for that.
The transcription duration could be much better (at least on my machine), but it's about on par with Google's transcription service considering network latency. This may be tweaked with smaller training sets and specialised configuration parameters at the cost of some quality, which seems to be alright at the moment. It looks like a lot of trial and error will be necessary for that.
Resources: