Closed jpil-builder closed 3 months ago
Issue #, if available:
Description of changes: When Merging the model adapters for llama 3 using any other torch dtype besides float16 is giving NAN values in the model.
By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.
Issue #, if available:
Description of changes: When Merging the model adapters for llama 3 using any other torch dtype besides float16 is giving NAN values in the model.
By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.