issues
search
erfanzar
/
EasyDeL
Accelerate your training with this open-source library. Optimize performance with streamlined training and serving options with JAX. 🚀
https://easydel.readthedocs.io/en/latest/
Apache License 2.0
167
stars
19
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
EasyDeL
#164
kuangdao
closed
1 week ago
1
oom when llama2-7b sft
#163
kuangdao
opened
1 week ago
4
Version `0.0.69`
#162
erfanzar
closed
2 weeks ago
0
TPU-v3 Kaggle not working after update
#161
s-smits
closed
3 weeks ago
5
Update base_trainer.py for handling total_batch_size>1
#159
s-smits
closed
4 weeks ago
0
value error using flash attention
#158
mohammad0081
closed
4 weeks ago
1
Logging into wandb.ai
#157
mohammad0081
closed
1 month ago
2
NaN loss in ORPOTrainer with legacy_sharded_vanilla
#156
nyl199310
closed
3 weeks ago
9
Out of Memory issue in new easydel version.
#155
nyl199310
closed
1 month ago
6
Falcon-11B: Dict key mismatch; expected keys: ['input_layernorm', 'mlp', 'self_attention']; dict: {'self_attention': {'query_key_value': {'kernel': Array
#154
s-smits
closed
1 month ago
9
[Feature Request] Add support for tiiuae/falcon-11B
#152
s-smits
closed
1 month ago
4
Import Error
#150
mohammad0081
closed
1 month ago
1
Mosaic kernels cannot be automatically partitioned. Please wrap the call in a shard_map or xmap
#149
nyl199310
closed
1 month ago
3
Can't load checkpoints continue training
#148
IvoryTower800
closed
2 months ago
7
AssertionError: Precision DEFAULT requested together with quantization.
#147
peterniu19
closed
1 month ago
5
training does not start using latest easydel
#146
IvoryTower800
closed
2 months ago
6
'LoraWeight' object has no attribute 'tolist'
#145
defdet
closed
2 months ago
4
Please provide support for LLama3 or provide example on how to serve it using Easydel
#144
jchauhan
closed
2 months ago
4
load_in_8bit doesn't work on Kaggle TPU
#143
IvoryTower800
closed
2 months ago
2
Out of memory for serving example
#142
xu3kev
closed
2 months ago
3
Import Union
#141
xu3kev
closed
2 months ago
1
Kaggle training examples don't work
#140
jcole75
closed
2 months ago
14
Add support for iterable dataset loading
#138
yhavinga
closed
2 months ago
0
Updating Beta Branch
#136
erfanzar
closed
2 months ago
0
Add gradient norm logging, fix metric collection on multi-worker setup
#135
yhavinga
closed
2 months ago
0
checkpoint's size is increasing everytime.
#134
IvoryTower800
closed
1 month ago
3
Unable to Load EasyDeL State
#133
w11wo
closed
3 months ago
6
Error converting easydel checkpoint to huggingface model.
#132
IvoryTower800
closed
3 months ago
2
How to reduce TPU RAM when finetuning?
#131
IvoryTower800
closed
2 months ago
8
Attention Mask for Packed Sequences (via Attention Bias)
#129
xingyaoww
closed
2 months ago
3
Transformers-like API for inference
#128
Froggy111
closed
3 months ago
19
Add save_total_limit argument to delete older checkpoints
#127
yhavinga
closed
3 months ago
0
How to continue training from a previous saved easydel checkpoint?
#126
IvoryTower800
closed
3 months ago
9
a question about how to increase batch size.
#125
IvoryTower800
closed
3 months ago
6
Time whole train loop instead of only call to train step function
#124
yhavinga
closed
3 months ago
0
Ignore token label smooth z loss
#123
yhavinga
closed
3 months ago
0
Model configs pass attributes to PretrainedConfig to prevent override…
#122
yhavinga
closed
3 months ago
0
Docs site is broken https://erfanzar.github.io
#121
nigh8w0lf
closed
3 months ago
1
Training with Ring Attention Failed
#120
IvoryTower800
closed
3 months ago
10
Update Beta branch version to EasyDeL 0.0.55
#119
erfanzar
closed
3 months ago
0
Install from git not working
#118
sr5434
closed
3 months ago
8
Training in kaggle's TPU is failing
#117
saidineshpola
closed
3 months ago
5
Output Differs from Hugging Face Transformer Result and EasyDel Results
#116
jchauhan
closed
4 months ago
4
Updating Beta Branch
#115
erfanzar
closed
4 months ago
0
[Urgent] Exception while load AdaptLLM/medicine-chat, variant of llama
#114
jchauhan
closed
4 months ago
6
Added zephyr prompter
#112
jchauhan
closed
4 months ago
1
Easydel support on TPU v4.8 - getting exception
#111
jchauhan
closed
4 months ago
1
Support HuggingFaceH4/zephyr-7b-beta serving using EasyDel
#110
jchauhan
closed
4 months ago
1
None of the examples scripts works, that used to work earlier. Please test your examples again and update docs
#109
jchauhan
closed
4 months ago
1
GPT2 (150M model) support on Tv2.8. Example scripts goes out of memory
#108
jchauhan
closed
4 months ago
1
Next