Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Find a needle in a haystack : 지푸라미더미에서 바늘 찾기, 서울에서 김서방 찾기, 모래사장에서 바늘 찾기 다 비슷한 의미

그래서 사람들이 text retrieval 능력을 평가하기 위해서

Place a random fact or statement (the 'needle') in the middle of a long context window (the 'haystack')
Ask the model to retrieve this statement
Iterate over various document depths (where the needle is placed) and context lengths to measure performance

아주 긴 context window 중간에 statement를 심어 놓고 모델에게 찾아보라고 하는 test

결국 multilingual, multimodal 다 하나의 모델에서 가능한 것인가?

paperswithlove / papers-we-read