This pull request fixes a problem with tile sizes below 64 when they otherwise should be allowed. In practice though, it's not expected that these small tile sizes will lead to improved performance except in some edge cases with small sequence length.
This pull request fixes a problem with tile sizes below 64 when they otherwise should be allowed. In practice though, it's not expected that these small tile sizes will lead to improved performance except in some edge cases with small sequence length.