Closed kgilpin closed 3 weeks ago
Benchmark Google Gemini.
https://docs.google.com/spreadsheets/d/1BJXnulBWL8CA0FTNk82nyBWq_37unpuIk2iT6UlSkoY/edit?gid=1071194749#gid=1071194749
Similar results to gpt-4o on the verified 33% set.
Benchmark Google Gemini.
https://docs.google.com/spreadsheets/d/1BJXnulBWL8CA0FTNk82nyBWq_37unpuIk2iT6UlSkoY/edit?gid=1071194749#gid=1071194749
Similar results to gpt-4o on the verified 33% set.