Gemini 1.5 Flash 002 Hallucinates Timestamps when transcribing audio

google-gemini / generative-ai-js

The official Node.js / Typescript library for the Google Gemini API

https://www.npmjs.com/package/@google/generative-ai

Apache License 2.0

721 stars 152 forks source link

Gemini 1.5 Flash 002 Hallucinates Timestamps when transcribing audio #269

Open JamMaster1999 opened 2 weeks ago

JamMaster1999 commented 2 weeks ago

Description of the bug:

The new flash model completely hallucinates timestamps when performing transcription.

Actual vs expected behavior:

The timestamps should be accurate based on when that word or phrase was spoken. The original flash model is excellent at this. The new model completely hallucinates.

Any other information you'd like to share?

Just simply try it. IT's so off it becomes obvious the second you try.

gmKeshari commented 2 weeks ago

Hi @JamMaster1999 ,

I have escalated this to the internal team.

JamMaster1999 commented 6 days ago

Thank you