Closed Sylva-Lin closed 8 months ago
I have not never encountered errors like this. Please strictly follow the readme to build the code and run the experiments
I have not never encountered errors like this. Please strictly follow the readme to build the code and run the experiments
You are right, thank!
I have not never encountered errors like this. Please strictly follow the readme to build the code and run the experiments
You are right, thank!
This is because there are some parameters defined in your network that are not involved in the final loss computation, resulting in them having no gradients. While running on a single GPU might not be an issue, it leads to this situation on multiple GPUs
I have not never encountered errors like this. Please strictly follow the readme to build the code and run the experiments
You are right, thank!
This is because there are some parameters defined in your network that are not involved in the final loss computation, resulting in them having no gradients. While running on a single GPU might not be an issue, it leads to this situation on multiple GPUs
Thanks for your reply, I have solved this question.
Parameter indices which did not receive grad for rank 1: 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 142 143 144 145 146 147 148 149 150 151 ...