Kipok / NeMo-Skills

A pipeline to improve skills of large language models
https://kipok.github.io/NeMo-Skills/
Apache License 2.0
185 stars 41 forks source link

SFT data processing commands #182

Closed shtoshni closed 3 weeks ago

shtoshni commented 4 weeks ago

Added decontamination and SFT data preparation commands.

shtoshni commented 3 weeks ago

Resolved all the comments. Let me know if there are any remaining issues.

shtoshni commented 3 weeks ago

Let me know how it looks now.