This Pull Request addresses the following changes:
Bug Fix: Correct 'sample.answers' access
Updated cur_answer = sample.answers to cur_answer = sample.answers['value'] to fix a data access issue.
Bug Fix: Apply 'mistral_custom_template' to llama
Set custom_chat_template=None to custom_chat_template=mistral_custom_template to resolve template-related errors in the llama3 model.
Docs Update: Improve usage instructions in README.md
Updated usage examples to include a missing option in the megatron-energon section.
These changes aim to enhance model stability and usability while improving documentation for developers. Please review and provide feedback. Thank you!
This Pull Request addresses the following changes:
Bug Fix: Correct 'sample.answers' access
cur_answer = sample.answers
tocur_answer = sample.answers['value']
to fix a data access issue.Bug Fix: Apply 'mistral_custom_template' to llama
custom_chat_template=None
tocustom_chat_template=mistral_custom_template
to resolve template-related errors in the llama3 model.Docs Update: Improve usage instructions in README.md
megatron-energon
section.These changes aim to enhance model stability and usability while improving documentation for developers. Please review and provide feedback. Thank you!