There is an issue where incorrect models are still appearing in the result list. For instance, the GPT-4 model is listed without specifying its exact version, which leads to inaccurate performance evaluations. These models should be hidden, but they are still visible in the results.
There is an issue where incorrect models are still appearing in the result list. For instance, the GPT-4 model is listed without specifying its exact version, which leads to inaccurate performance evaluations. These models should be hidden, but they are still visible in the results.