Closed wangrx33 closed 5 months ago
Please provide a clear and concise description of what the bug is. If applicable, add screenshots to help explain your problem, especially for visualization related problems.
大佬,看到之前的issue里有关于loss降为0的问题,您提到解决方法是设置torch_dtype=float32,但是我这边设置了之后loss还是会降为0。 使用了deepspeed zero2,deepspeed 版本是0.12.6
Describe the bug
Please provide a clear and concise description of what the bug is. If applicable, add screenshots to help explain your problem, especially for visualization related problems.
大佬,看到之前的issue里有关于loss降为0的问题,您提到解决方法是设置torch_dtype=float32,但是我这边设置了之后loss还是会降为0。 使用了deepspeed zero2,deepspeed 版本是0.12.6