easydel Search Results - Githubissues

88 results
for easydel

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

erfanzar/EasyDeL #80

Error while serving model as per documentation,

**Describe the bug** Error while serving a toy model using EasyDel with the following exception. **To Reproduce** Context * Using Google TPU VM * Follow the instructions as suggested here ht…

jchauhan updated 10 months ago
2
erfanzar/EasyDeL #128

Transformers-like API for inference

Is there a versatile transformers-like API (like model.generate()) equivalent for this? I tried JAXServer but it is quite confusing, and I couldnt get flashattention to work. Could you maybe provide s…

Froggy111 updated 8 months ago
19
erfanzar/EasyDeL #87

Error while running a GPT2 model

An exception is generated while running a gpt2 format model as show below **To Reproduce** Prepare to serve a model ``` run examples/serving/causal-lm/artgpt2tox-chat.py --pretrained_model_nam…

jchauhan updated 10 months ago
3
erfanzar/EasyDeL #83

Training on TPU Using Flash Attention

Hi, I tried finetune a model on TPU VM v3-8. when not using flash attention, it works. However, when I set config.use_flash_attention =True, an error occurs: block_q=1024 should be smaller or equal to…

IvoryTower800 updated 10 months ago
9
erfanzar/EasyDeL #71

Help train on tpu v3-32

#34 I read this issue and tried it. but couldn't make it work :( Hi, Thank you for your amazing work. I've been trying few days to make tpu v3-32 to work. I used tpu VM 'tpu-ubuntu220…

StableFluffy updated 10 months ago
12
erfanzar/EasyDeL #78

RESOURCE_EXHAUSTED: XLA:TPU compile permanent error

**To Reproduce** Use a TPU v2_8 with vm architecture ``` Time For configure functions and sharding them (ms) : 2012.7429962158203 Action : Sharding Passed Parameters Model Contain 1.1000483…

jchauhan updated 10 months ago
1
erfanzar/EasyDeL #69

The model generate repeated words.

**Describe the bug** I use the example code provided on the documentation (https://erfanzar.github.io/EasyDeL/Llama2/). But I changed the model to 'hfl/chinese-alpaca-2-7b-16k'. (The model is base…

IvoryTower800 updated 10 months ago
15
erfanzar/EasyDeL #86

Output of Tiny Llama using Easydel vs hugging face transform…

**Describe the bug** A clear and concise description of what the bug is. **To Reproduce** Output from Hugging Face Transformer APIs on local env ``` You are an oracle who knows the anwer…

jchauhan updated 10 months ago
4
erfanzar/EasyDeL #70

Dependency Error on latest version

**Describe the bug** pip install easydel /usr/local/lib/python3.8/dist-packages/pkg_resources/__init__.py:123: PkgResourcesDeprecationWarning: 0.1.36ubuntu1 is an invalid version and will not be sup…

StableFluffy updated 11 months ago
1
erfanzar/EasyDeL #90

Step time increasing as training progresses

In one of the longer training runs that is now running on a tpu-v3-8 I noticed the training ETA kept getting later and later. Also in the step-time wandb log (picture below) the higher the step numbe…

yhavinga updated 10 months ago
8

上一页 1...3 4 5 6 7 8 9...9 下一页

88 results for easydel

88 results
for easydel