-
Hey Team,
I'm trying to use FSDP1/2 with Float8InferenceLinear but seems have some issues (with torch 2.3.1+cu118). Do you suggestion to bump to higher version of torch and have a try or maybe use …
-
Hi,
I have used a similar dataset as image_classification_albumentations.ipynb and reused the notbook code completely but model training failing with Target size (torch.Size([32, 224, 224, 3])) mus…
-
Hey! Great work on this project! I got it t work on a couple of t5 instruction tuned models from huggingface, I was just curious, has anyone been able to get the code to work with quantized modes? Cur…
-
Hi, I am trying out this great framework with a self trained GPT-2.
I wanted to use a custom trained model and the base model as tokenizer.
No matter if I use this approach or solely the base mo…
-
Hi~ could I train the model and update parameters by mutate prompting code?
-
Will there be collection for cvpr 2023?
-
When ran InternLM2 inference , it reported errors as below:
oneAPI :2024.0.1.46
ipex-llm: 2.1.0b2
transformers: 4.37.2 ,4.38.2
----------------------------------------------------------------…
-
**Describe the bug**
Hello. I'm an active user of deepspeed for multi-node training.
I've always used zero3, but this time I tried attaching the hpz feature of zero++ for the first time. The issue…
-
when I run the model,I got the error below.
Traceback (most recent call last):
File "/Users/scs/Desktop/simbert-master/test_longformer.py", line 41, in
output = model(input_ids, attention_…
-