nvtransfer / RULER

This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?
Apache License 2.0
646 stars 43 forks source link

Gemini flash 1.5 results #43

Open augusto-rehfeldt opened 2 months ago

augusto-rehfeldt commented 2 months ago

Does anyone have the results for this model? Seems to hallucinate quite a lot in long context prompts, even though it has a context size of a million tokens.

Thanks.

iofu728 commented 1 month ago

Same question. If there are results from the Gemini-1.5-flash, it would be a great help.