easydel Search Results - Githubissues

88 results
for easydel

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

erfanzar/EasyDeL #108

GPT2 (150M model) support on Tv2.8. Example scripts goes ou…

**Describe the bug** Out of memory for a smaller gpt2 model with 150M params ``` jaxlib.xla_extension.XlaRuntimeError: RESOURCE_EXHAUSTED: XLA:TPU compile permanent error. Ran out of memory in me…

jchauhan updated 9 months ago
1
erfanzar/EasyDeL #109

None of the examples scripts works, that used to work earlie…

**Describe the bug** while trying to run tinyllama ``` File "/home/**/research/easydel/.venv/lib/python3.10/site-packages/EasyDel/modules/llama/modelling_llama_flax.py", line 933, in __call__ …

jchauhan updated 9 months ago
1
erfanzar/EasyDeL #129

Attention Mask for Packed Sequences (via Attention Bias)

Hi @erfanzar, Thanks for the great repo! It looks really useful for training open-source models on TPU and GPUs! I wonder if it is easy to implement the feature that allows users to pass in pack…

xingyaoww updated 7 months ago
3
erfanzar/EasyDeL #104

Error while finetuning Tinyllama on Kaggle TPU

**Describe the bug** An error while training tiny llama on kaggle ``` /root /usr/local/lib/python3.10/site-packages/pydantic/_internal/_fields.py:149: UserWarning: Field "model_name" has conflic…

jchauhan updated 9 months ago
5
erfanzar/EasyDeL #131

How to reduce TPU RAM when finetuning?

**Describe the bug** Hi, I really appreciate your continued commitment to this project and make it better and better. I'm one of the people who benefit greatly. Thank you. Now, I am trying to fine…

IvoryTower800 updated 7 months ago
8
erfanzar/EasyDeL #120

Training with Ring Attention Failed

**Describe the bug** Hi, I ran below code on Kaggle's tpu vm v3-8. when i set the attn_mechanism to "normal", it worked well. However, when I changed the attn_mechanism to ring. below error raised. C…

IvoryTower800 updated 8 months ago
10
erfanzar/EasyDeL #125

a question about how to increase batch size.

**Describe the bug** Hi, I tried to finetune gemma-2b model with sharding_array=(1, 1, 1, -1) on Kaggle tpu vm v3-8. there are two parameters about batch size in TrainArguments: total_batch_size, …

IvoryTower800 updated 8 months ago
6
erfanzar/EasyDeL #93

Error while training a Phi2 model

**To Reproduce** ``` Time Took to Complete Task configure dataloaders (microseconds) : 0.3025531768798828 Time Took to Complete Task configure Model ,Optimizer ,Scheduler and Config (microsec…

jchauhan updated 10 months ago
1
erfanzar/EasyDeL #94

Error running remote model that has custom code

``` ValueError: Loading this model requires you to execute custom code contained in the model repository on your local machine. Please set the option `trust_remote_code=True` to permit loading of thi…

jchauhan updated 10 months ago
2
erfanzar/EasyDeL #96

ValueError: `params` cannot be accessed from model when the …

**Describe the bug** Getting the following error while running llama model after training using EasyDel and converting to Hugging face. ``` python serve_llama_tpu_easydel.py Loading checkpoint…

jchauhan updated 10 months ago
2

上一页 1...2 3 4 5 6 7 8...9 下一页

88 results for easydel

88 results
for easydel