Closed lygjwy closed 2 months ago
Hi, I don't understand the function of code 'labels[0, :start_token] = -100' in Line 68 from cherry_seletion/data_analysis.py file.
Sorry for the late reply! The use of = -100 is to make LLMs ignore the calculation of loss on those positions.
= -100
Hi, I don't understand the function of code 'labels[0, :start_token] = -100' in Line 68 from cherry_seletion/data_analysis.py file.