Wrong dimension for L2 normalization?

When ReduceDim

https://github.com/gabeur/mmt/blob/0d848cd0811dbcbee06a62cf474636e68d728944/model/model.py#L715-L725

is applied to video expert embeddings it do F.normalize on shape (batch_size, num_tokens, embedding_dim), and reduce by default along dim=1 (num_tokens)

https://github.com/gabeur/mmt/blob/0d848cd0811dbcbee06a62cf474636e68d728944/model/model.py#L431-L434

But here it is applied to shape (batch_size, embedding_dim)

https://github.com/gabeur/mmt/blob/0d848cd0811dbcbee06a62cf474636e68d728944/model/model.py#L423-L426

So normalization along embedding_dim axis.

Is it mistake or not?

gabeur / mmt

Wrong dimension for L2 normalization? #1