Open Levishery opened 10 months ago
Hi @Levishery
I asked myself the same question. I found a reason : it give to each ref point a favored direction and a favored radius :
h
(h in range(nb_head)
) : preferred direction h *(2*Pi / nb_head)
k+1
for k in range(nb_ref_points)
I mean "favored" because it's independent of the data, it's sampling_offsets.bias
.
But based on the data and sampling_offsets.weight
the real sampling offset might not respect the favored directions and radius.
in class MSDeformAttn reset_parameters line 69
Why the sampling_offset bias don't get updated during training? I saw that in some other implementations, this parameter would be updated.
Thanks!