MLGroupJLU / LLM-eval-survey

The official GitHub page for the survey paper "A Survey on Evaluation of Large Language Models".
https://arxiv.org/abs/2307.03109
1.33k stars 86 forks source link

咨询下,LLM的数据污染检测(判断数据集是否训练见过)技术方向靠谱吗?有推荐论文吗? #21

Open gongjunjin opened 10 months ago

cyp-jlu-ai commented 9 months ago

No description provided.

Hello, data contamination detection for LLM is an important research area, especially in ensuring the quality and reliability of model training data.

Here are some recommended papers on data contamination detection for LLM:

  1. Time Travel in LLMs: Tracing Data Contamination in Large Language Models
  2. Can we trust the evaluation on ChatGPT?
  3. Large language models are few-shot testers: Exploring llm-based general bug reproduction
  4. BAMBOO: A Comprehensive Benchmark for Evaluating Long Text Modeling Capacities of Large Language Models
  5. Anatomy of an AI-powered malicious social botnet
  6. Origin Tracing and Detecting of LLMs