Closed SKDDJ closed 3 months ago
Thank you for your feedback. Firstly, the average and median of linear layer experiment is right. The reason for this is that parameter generation is random and accidental. The average is susceptible to extremely low values, in other words, if one or more examples produce very poor results, the average will drag down.
In the table of the paper, there seems to be a discrepancy between the average and median values reported for the linear layer parameters. Specifically, the average is listed as 47.3, while the median is reported as 71.1.
This disparity raises the question: Is this a typographical or numerical error in the reported values, or does it accurately reflect the underlying data distribution? In the latter case, the lower average compared to the median could potentially indicate that the model generates a significant number(almost half) of low quality parameters, thereby skewing the average downward while the median remains higher.
Could you please clarify whether the values presented are correct, and if so, provide some insight into the possible reasons behind this divergence between the average and median for the linear layer parameters?
Thanks.