This PR warns readers about the data contamination issue within UltraFeedback, since as of the recent MistralAI reports on data contamination, AllenAI also reported some of those affecting the TruthfulQA benchmark within UltraFeedback, so the scores Zephyr and ourselves got for TruthfulQA are not correct / fair due to the contamination.
Besides that, also a short explanation has been included so that users know why Notus, and where does it comes from.
Description
This PR warns readers about the data contamination issue within UltraFeedback, since as of the recent MistralAI reports on data contamination, AllenAI also reported some of those affecting the TruthfulQA benchmark within UltraFeedback, so the scores Zephyr and ourselves got for TruthfulQA are not correct / fair due to the contamination.
Besides that, also a short explanation has been included so that users know why Notus, and where does it comes from.