Kipok / NeMo-Skills

A pipeline to improve skills of large language models
https://kipok.github.io/NeMo-Skills/
Apache License 2.0
185 stars 41 forks source link

WIP - Llama code interpreter and improved LLM as a judge #89

Closed Kipok closed 1 month ago

Kipok commented 2 months ago

This is still work-in-progress, but want to push to allow other people to provide suggestions and reuse the code while we are ironing out the details.

The current implementation does not have a proper integration for llm-as-a-judge - this requires a larger refactoring which I will try to do next week.

It also breaks openmathinstruct-1 backward compatibility which is probably fine since people can always switch back to the v0.1.1 tag and we can't support that code forever as we move to the new format with llama models

Kipok commented 1 month ago

Merged in igitman/refactoring branch