bfloat16 Search Results

1000+ results
for bfloat16

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Lightning-AI/lightning-thunder #824

enter/exit_autocast of torch.amp.autocast_mode

## 🚀 Model / language coverage I'm trying to get a fuller picture of what we need to support NeVA. As such I'm using: ```python def thunder_backend(gm, args): gm.real_recompile() from thu…

tfogal updated 1 month ago
6
google-deepmind/alphageometry #63

Test lm_inference_test.py fails

Hello team, Please i need help to solve this issue, the test is failing: python lm_inference_test.py --meliad_path=$MELIAD_PATH --data_path=$DATA I0130 03:37:30.642391 139830854076224 nn_comp…

stephtchoko updated 6 months ago
1
kohya-ss/sd-scripts #412

[Bug] No operator for 'memory_efficient_attention_forward'

```python NotImplementedError: No operator found for `memory_efficient_attention_forward` with inputs: query : shape=(200, 9126, 1, 64) (torch.float32) key : shape=(200, 912…

usamaa-saleem updated 1 year ago
7
tenstorrent/tt-metal #4599

softmax fails with padded tile layout

The softmax in tt-lib will fail if the tensor is in tile layout with a shape of 8,2,2 padded to 8,32,32 using a padded value of zero instead of -inf. We have a workaround, at the moment we fallback…

eyonland updated 4 months ago
6
milvus-io/pymilvus #2024

[Bug]: bfloat16 and float16 vector does not support panda Da…

### Is there an existing issue for this? - [X] I have searched the existing issues ### Describe the bug bfloat16 and float16 vector does not support panda Dataframe data type with not user fr…

binbinlv updated 5 months ago
1
pytorch/pytorch #122813

cpu performance for int4mm kernels

### 🐛 Describe the bug Repro: https://github.com/pytorch-labs/ao/pull/93 currently the cpu time for running weight only int4 quantization seems to be slow in x86, looks the same as unlowered cpu m…

jerryzh168 updated 2 weeks ago
5
heeju-kim2/workspace_w-intern #1

BF16 PEFT training references

### Reference code - Llama-recipes code [https://github.com/meta-llama/llama-recipes/tree/b7fd81c71239c67345d897c0eb6529eba076e8b8](https://github.com/meta-llama/llama-recipes/tree/b7fd81c71239c…

heeju-kim2 updated 3 months ago
2
unslothai/unsloth #986

Gemma batch inference much slower than Mistral

Hi. Raising this issue as I am experimenting a much slower inference time with Gemma-1 models. > Environment: > - xformers 0.0.26.post1 pypi_0 pypi > - unsloth …

lctdulac updated 3 weeks ago
4
RulinShao/VL-Instruct #1

change bf16 to fp16

Hi, I am very happy to find a repo that can be used to fine-tune blip2 quickly While using it (Llava instruction data), I ran into some issues. I only have V100, but the model appears the following …

palchenli updated 1 year ago
1
matatonic/openedai-whisper #1

Feature Request: Model unload

Hi, thx for the hard work! Would it be possible to unload the model from VRAM after a certain time? For testing and VRAM contraints, when using multiple services, that would be really helpfull. …

neubsi updated 3 months ago
2

上一页 1...92 93 94 95 96 97 98...100 下一页

1000+ results for bfloat16

1000+ results
for bfloat16