In the Audio Understanding section of the notebook:
As far as I can tell, the mp3 has 4 distinct voices: the host, the voice that announces the name of the podcast and host, and the two guests. Transcription lists 5 speakers (A, B, C, D and E).
Relevant log output
No response
Code of Conduct
[X] I agree to follow this project's Code of Conduct
File Name
intro_gemini_1_5_pro.ipynb
What happened?
In the Audio Understanding section of the notebook:
Relevant log output
No response
Code of Conduct