bfloat16 Search Results

1000+ results
for bfloat16

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

heeju-kim2/workspace_w-intern #1

BF16 PEFT training references

### Reference code - Llama-recipes code [https://github.com/meta-llama/llama-recipes/tree/b7fd81c71239c67345d897c0eb6529eba076e8b8](https://github.com/meta-llama/llama-recipes/tree/b7fd81c71239c…

heeju-kim2 updated 3 months ago
2
pytorch/pytorch #130480

[Feature Request] Lazy upcasting for mmap'd state dicts

In transformers as a rule we load models always in as `float32` for stability, even if the weights are in `bfloat16`. As a result, loading `llama-3-8B` can't be done lazily via mmap, since we have to …

muellerzr updated 2 months ago
11
RulinShao/VL-Instruct #1

change bf16 to fp16

Hi, I am very happy to find a repo that can be used to fine-tune blip2 quickly While using it (Llava instruction data), I ran into some issues. I only have V100, but the model appears the following …

palchenli updated 1 year ago
1
huggingface/peft #1997

Support optimum-quanto

### Feature request Let's add a new quantization method to LoRA, namely [optimum-quanto](https://github.com/huggingface/optimum-quanto). There is some more context in [this diffusers issue](https:…

BenjaminBossan updated 1 week ago
13
intel/he-transformer #41

Build failure when building the docker images

Hi All, I'm trying to build the docker images in https://github.com/IntelAI/he-transformer/tree/master/contrib/docker on an Ubuntu 16.04 machine. When I run the command **make check_gcc**, it gives…

kasundharmadasa updated 4 years ago
2
kohya-ss/sd-scripts #407

[Bug] Input type (c10::BFloat16) and bias type (float) shoul…

```python epoch 1/200 /home/ubuntu/tools/sd-scripts/.venv/lib/python3.10/site-packages/torch/utils/checkpoint.py:31: UserWarning: None of the inputs have requires_grad=True. Gradients will be None …

usamaa-saleem updated 1 year ago
3
rowanz/merlot #10

Running funetuning on GPU

Thanks for releasing your great work. I was wondering if there is a way to run the finetuning and zero-shot inference code on GPU rather than TPU? What king of adjustment would I need to make? Thanks

insundaycathy updated 1 year ago
2
NVIDIA/nccl #1435

Bandwidth is different for GPU 0,1 and GPU 6,7

Hello, I have a problem about bandwidth when using GPU 0, 1 and GPU 6, 7. The bandwidth is different. export CUDA_VISIBLE_DEVICES=0,1 ./build/all_gather_perf -b 16M -e 1024M -i 16777216 -g 2 -d bfloa…

JuiceLemonLemon updated 2 weeks ago
3
intel/intel-extension-for-pytorch #387

Unexplainable bf16 performance drop when using numactl to bi…

### Describe the issue Hi, I am using ipex to apply bf16 to the SpeechT5 model. I use both `ipex.optimize(model,dtype=torch.bfloat16)` and `with torch.cpu.amp.autocast(enabled=True, dtype=torch.…

Spycsh updated 1 year ago
6
ZHO-ZHO-ZHO/ComfyUI-PhotoMaker-ZHO #3

cutlassF: no kernel found to launch!

Error occurred when executing PhotoMaker_Zho: cutlassF: no kernel found to launch! File "E:\ComfyUI\Blender_ComfyUI\ComfyUI\execution.py", line 155, in recursive_execute output_data, outp…

PLFENG updated 8 months ago
1

上一页 1...93 94 95 96 97 98 99...100 下一页

1000+ results for bfloat16

1000+ results
for bfloat16