Closed sashakunitsyn closed 11 months ago
Not sure if this is a bug or a feature, but this kwarg https://github.com/lucidrains/x-transformers/blob/main/x_transformers/x_transformers.py#L985 is set but never used. Maybe it should be used here as additional condition https://github.com/lucidrains/x-transformers/blob/main/x_transformers/x_transformers.py#L1137?
@sashakunitsyn oh yes, i was initially using that when dealing with ResiDual paper, which had an exotic pre + post-norm combination
ResiDual
removed it for clarity! thank you!
Not sure if this is a bug or a feature, but this kwarg https://github.com/lucidrains/x-transformers/blob/main/x_transformers/x_transformers.py#L985 is set but never used. Maybe it should be used here as additional condition https://github.com/lucidrains/x-transformers/blob/main/x_transformers/x_transformers.py#L1137?