gururise / AlpacaDataCleaned

Alpaca dataset from Stanford, cleaned and curated
Apache License 2.0
1.46k stars 146 forks source link

Fixing more URL and Code instructions. #34

Closed HideLord closed 1 year ago

HideLord commented 1 year ago

Fixing more URL and Code instructions. Something I noticed is that code instructions are incredibly repetitive (at least 7-8 'add two numbers' and just as much Fibonacci, factorial and sort instructions). They are also very simplistic.