scribear / ScribeAR.github.io

Live Transcription for Augmented Reality Glasses
11 stars 16 forks source link

AudioStreamBucket injection for processing diverse audio #175

Open ammpr opened 7 months ago

ammpr commented 7 months ago

The Audio stream bucket data structure is used to feed audio to APIs. Importantly, this means audio is not directly integrated into APIs. Looks like memory is limited to 10kb per comments.

Feeding MP3s into audio stream buckets would allow for both improved testing and appeal. My current audio testing solution operates at the driver level, and unfortunately captures all system-wide audio. Many students have asynchronous lectures or audio based coursework. Students with DRES accommodations are sometimes given lecture recordings.