Closed Abolfazl-kr closed 9 months ago
Model training and fine-tuning
Chinese-LLaMA-2 (7B/13B)
Linux
what should we do to use deep speed stage3? what changes should apply on deep speed config and the codes?
# Please copy-and-paste your dependencies here.
# Please copy-and-paste your logs here.
Check before submitting issues
Type of Issue
Model training and fine-tuning
Base Model
Chinese-LLaMA-2 (7B/13B)
Operating System
Linux
Describe your issue in detail
what should we do to use deep speed stage3? what changes should apply on deep speed config and the codes?
Dependencies (must be provided for code-related issues)
Execution logs or screenshots