octree-nn / octformer

OctFormer: Octree-based Transformers for 3D Point Clouds
MIT License
259 stars 18 forks source link

Multiple GPUs Error #20

Closed Sylva-Lin closed 8 months ago

Sylva-Lin commented 8 months ago

Parameter indices which did not receive grad for rank 1: 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 142 143 144 145 146 147 148 149 150 151 ...

wang-ps commented 8 months ago

I have not never encountered errors like this. Please strictly follow the readme to build the code and run the experiments

Sylva-Lin commented 8 months ago

I have not never encountered errors like this. Please strictly follow the readme to build the code and run the experiments

You are right, thank!

fyj534609 commented 7 months ago

I have not never encountered errors like this. Please strictly follow the readme to build the code and run the experiments

You are right, thank!

This is because there are some parameters defined in your network that are not involved in the final loss computation, resulting in them having no gradients. While running on a single GPU might not be an issue, it leads to this situation on multiple GPUs

Sylva-Lin commented 7 months ago

I have not never encountered errors like this. Please strictly follow the readme to build the code and run the experiments

You are right, thank!

This is because there are some parameters defined in your network that are not involved in the final loss computation, resulting in them having no gradients. While running on a single GPU might not be an issue, it leads to this situation on multiple GPUs

Thanks for your reply, I have solved this question.