local-and-global-modeling Search Results

1000+ results
for local-and-global-modeling

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

pytorch/ao #704

Question: How to use Float8InferenceLinear with FSDP1/2?

Hey Team, I'm trying to use FSDP1/2 with Float8InferenceLinear but seems have some issues (with torch 2.3.1+cu118). Do you suggestion to bump to higher version of torch and have a try or maybe use …

qingquansong updated 4 weeks ago
15
huggingface/notebooks #185

image_classification_albumentations.ipynb failing with Targe…

Hi, I have used a similar dataset as image_classification_albumentations.ipynb and reused the notbook code completely but model training failing with Target size (torch.Size([32, 224, 224, 3])) mus…

amitkml updated 2 years ago
2
abertsch72/unlimiformer #19

Working with 8bit and 4bit quantized models

Hey! Great work on this project! I got it t work on a couple of t5 instruction tuned models from huggingface, I was just curious, has anyone been able to get the code to work with quantized modes? Cur…

jordancole21 updated 1 year ago
10
shap/shap #2617

Explainer Index out of range

Hi, I am trying out this great framework with a self trained GPT-2. I wanted to use a custom trained model and the base model as tokenizer. No matter if I use this approach or solely the base mo…

LukasFides updated 2 weeks ago
23
infinitylogesh/mutate #2

How to train the model by prompting?

Hi~ could I train the model and update parameters by mutate prompting code?

liyang619 updated 9 months ago
2
tinatiansjz/hmr-survey #2

Will there be collection for cvpr 2023?

Will there be collection for cvpr 2023?

imabackstabber updated 1 year ago
3
intel-analytics/ipex-llm #11744

Run InternLM2 , reports error:TypeError: internlm2_attention…

When ran InternLM2 inference , it reported errors as below: oneAPI :2024.0.1.46 ipex-llm: 2.1.0b2 transformers: 4.37.2 ,4.38.2 ----------------------------------------------------------------…

johnysh updated 1 month ago
5
microsoft/DeepSpeed #4901

[BUG] ZERO++ | AssertionError: ZeRO parameter intra parallel…

**Describe the bug** Hello. I'm an active user of deepspeed for multi-node training. I've always used zero3, but this time I tried attaching the hpz feature of zero++ for the first time. The issue…

dhkim0225 updated 2 months ago
3
allenai/longformer #215

TypeError: forward() takes from 2 to 7 positional arguments …

when I run the model，I got the error below. Traceback (most recent call last): File "/Users/scs/Desktop/simbert-master/test_longformer.py", line 41, in output = model(input_ids, attention_…

SCS2017 updated 1 year ago
3
NVIDIA/apex #773

adding --fp16 to run_language_modeling and increase batch si…

mahdirezaey updated 4 years ago
1

上一页 1...6 7 8 9 10 11 12...100 下一页

1000+ results for local-and-global-modeling

1000+ results
for local-and-global-modeling