咨询下，LLM的数据污染检测（判断数据集是否训练见过）技术方向靠谱吗？有推荐论文吗？ - Githubissues

MLGroupJLU / LLM-eval-survey

The official GitHub page for the survey paper "A Survey on Evaluation of Large Language Models".

https://arxiv.org/abs/2307.03109

1.33k stars 86 forks source link

咨询下，LLM的数据污染检测（判断数据集是否训练见过）技术方向靠谱吗？有推荐论文吗？ #21

Open gongjunjin opened 10 months ago

cyp-jlu-ai commented 9 months ago

No description provided.

Hello, data contamination detection for LLM is an important research area, especially in ensuring the quality and reliability of model training data.

Here are some recommended papers on data contamination detection for LLM: