gururise / AlpacaDataCleaned

Alpaca dataset from Stanford, cleaned and curated
Apache License 2.0
1.5k stars 149 forks source link

PIQA dataset's metric #59

Open rattlesnakey opened 1 year ago

rattlesnakey commented 1 year ago

how does PIQA got 78 acc ? I see the eval folder's readme file, it says the metric is not trustworthy ?