Open the21st opened 5 months ago
Would love to see results for gpt-4o. There was some claimed improvement in its abilities: http://nian.llmonpy.ai/
We also plan to run evaluation for gpt-4o! Looks like gpt-4o has large improvement to solve lost-in-the-middle issue.
so ? when?
Looking forward to it for the latest versions of 4o and 4o-mini!
Would love to see results for gpt-4o. There was some claimed improvement in its abilities: http://nian.llmonpy.ai/