Length normalizing and temperature

hkust-zhiyao / RTL-Coder

A new LLM solution for RTL code generation, achieving state-of-the-art performance in non-commercial solutions and outperforming GPT-3.5.

132 stars 16 forks source link

Thanks for nice work! I have two questions. The first one is about length norm in calculating the conditional log probability. According to the paper and common practice, the denominator should be the length of response. However, according to the code: https://github.com/hkust-zhiyao/RTL-Coder/blob/3394cce416fb0d70f76d81f809be5d0c32de0c55/train/mle_scoring.py#L199 the denominator seems to include the padding part. Could you please check it?

The second question I wonder is the proper way to show experiment results. The paper says, Do you mean choosing the best result under each temperature , or choose the best temperature according to Pass@1 or something? Thank you for reply.

hkust-zhiyao / RTL-Coder

Length normalizing and temperature #9