issues
search
nverma1
/
merging-text-transformers
Code for "Merging Text Transformers from Different Initializations"
https://arxiv.org/pdf/2403.00986.pdf
MIT License
19
stars
0
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Why LN doesn't require any P^{T}
#3
daidaiershidi
closed
3 months ago
4
Applying fusion to a ViT from a different package
#2
idankinderman
closed
7 months ago
1
Amazing work. I have a question related to the ROPE embedding.
#1
shamanez
closed
7 months ago
3