domenicrosati / training-time-domain-authorization

0 stars 1 forks source link

Port over Stability Evaluation code from Previous RepNoise Project [Dom] #3

Open domenicrosati opened 3 months ago

domenicrosati commented 3 months ago

This includes:

ToDo:

domenicrosati commented 3 months ago

I think another thing that is very valuable is somehow develop metrics on the fluency of the method. LM Harness isn't really great at understanding the impact of fluecny and impact of a method on long form generation.