Remove "context" parameter as it's not really used anymore and is just confusing for people.
Update trtllm arguments in the docs
Add llama3 chat format support to sft and data preparation. This is a bit hacky at the moment and will be fixed when this is properly supported in nemo
Remove system message from llama3 sft config
Fix gpu tests and upgrade finetuning test to llama3