Open neumannjan opened 1 year ago
Since numeric convertor already multiplies the number by a learnable scale weight, shouldn't it also add bias?
Since numeric convertor already multiplies the number by a learnable scale weight, shouldn't it also add bias?