A toolkit for training deep learning models on genotype, tabular, sequence, image, array and binary data.
GNU Affero General Public License v3.0
24
stars
5
forks
source link
Add support to reuse transformer weights across long sequences #13
Closed
arnor-sigurdsson closed 2 years ago
Used by sliding across a longer input sequence, while still encoding global position.