fixie-ai / ultravox

A fast multimodal LLM for real-time voice
https://ultravox.ai
MIT License
871 stars 47 forks source link

Add adapter for HiSanta data #47

Open mdepinet opened 3 months ago

mdepinet commented 3 months ago

@farzadab It wasn't clear to me whether VoiceDatasetArgs are optional customizations to be used by some datasets or whether there are some that Datasets are required to respect. (I imagine at least max_audio_duration_secs is required?) Should be pretty easy to add support for the required ones once I know which those are.

Note to self: Need to set up new service account.

farzadab commented 3 months ago

I believe include_audio, shuffle, max_audio_duration_secs, and split should be respected. The other args can be situational.

mdepinet commented 3 months ago

Putting this on ice for now.