issues
search
bminixhofer
/
zett
Code for Zero-Shot Tokenizer Transfer
https://arxiv.org/abs/2405.07883
101
stars
7
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
How to use the hyper network for Llama3-8b
#10
gushu333
opened
3 weeks ago
0
Issue running the tranfer script for Mistral - RAM OOM
#9
elements72
opened
3 weeks ago
0
OOM on training Mistral hypernet
#8
kdcyberdude
opened
4 weeks ago
2
add a minimal implementation of batching without sampling
#7
KathyHaem
closed
4 days ago
2
Error when training a hypernetwork
#6
jubgjf
opened
1 month ago
1
Implement proper batching (without repeats) for hypernetwork inference
#5
KathyHaem
closed
4 days ago
3
Issue in running the transfer script
#4
zaidalyafeai
opened
1 month ago
11
A question
#3
noforit
closed
1 month ago
1
Missing flax_model.msgpack for TinyLlama
#2
LorrinWWW
closed
1 month ago
2
Llama3
#1
zf0x00
opened
1 month ago
3