bfloat16 Search Results

1000+ results
for bfloat16

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ml-explore/mlx-examples #980

I tried madlad400, but there is a problem with the output if…

Hi. I tried madlad400, but there is a problem with the output if it is float16 ``` $ python convert.py --model google/madlad400-3b-mt $ python t5.py --model google/madlad400-3b-mt --prompt "A ta…

otmb updated 1 month ago
5
pytorch/pytorch #91879

ddp vs fsdp

### 🐛 Describe the bug I used fsdp+ShardedGradScaler to train my model. Compared with apex. amp+ddp, the precision of my model has decreased. The ddp is like ``` model, optimizer = amp.initial…

chexiangying updated 1 year ago
9
woct0rdho/triton-windows #18

the compilation invocation is missing the typical includes, …

Logs showing a failure to configure the toolchain using your automatic discovery code: ``` ... File "D:\appmana\.venv\Lib\site-packages\torch\_dynamo\output_graph.py", line 1465, in _call_user_…

doctorpangloss updated 15 hours ago
1
pytorch/pytorch #75458

Placing model on bfloat16 on CPU make it freeze/hang

### 🐛 Describe the bug Note : I know that bfloat16 should obviously not be used on a CPU model. Maybe it's a better practice to do `to(self.device).to(bfloat16)` than `.to(bfloat16).to(self.devi…

ierezell updated 2 years ago
11
hiyouga/LLaMA-Factory #5878

显存充足，无法调用，显示只使用一点显存

### Reminder - [X] I have read the README and searched the existing issues. ### System Info - `llamafactory` version: 0.9.1.dev0 - Platform: Linux-6.5.0-28-generic-x86_64-with-glibc2.35 - Python …

Lgugeng updated 2 weeks ago
1
mlfoundations/open_clip #930

❓ [Question] Can't reproduce imagenet results of RN50 model …

This is my scripts: ``` torchrun --nnodes 1 \ --nproc_per_node 8 \ -m open_clip_train.main \ --model RN50 \ --train-data 'datasets/cc3m/cc3m-train-{0000..0575}.tar' \ --trai…

clownrat6 updated 2 months ago
1
haotian-liu/LLaVA #1582

device mis-match error on pre-training

### Describe the issue Issue: I wanted to run the pre-train code `https://github.com/haotian-liu/LLaVA/blob/main/scripts/v1_5/pretrain.sh`, but it ends to a device mis-match error. It seems that the…

oroojlooy updated 2 weeks ago
4
jax-ml/jax #24047

dot_product_attention has inconsistent dot types in backward…

### Description `jax.nn.dot_product_attention` does the first dot product with `preferred_element_type=jnp.float32` (see [here](https://github.com/jax-ml/jax/blob/7f655972c47658768b6ecce752fa29c3a…

sbodenstein updated 1 month ago
7
FoundationVision/VAR #83

Cannot reproduce results in Table 1, especially the IS score

Has anyone been able to reproduce the results in Table 1 of the paper? Could you please share the inference script? We use B=50 for each class and var_d16 for evaluation. - report |FID|IS|Pre|R…

adreamwu updated 2 months ago
1
QwenLM/Qwen2-VL #299

RuntimeError: probability tensor contains either `inf`, `nan…

I have been trying to fix this error for a while now, and the ongoing threads are of NO help. I have checked these (and ALL issue on the HF community page for this model): * https://github.com/Qwe…

varungupta31 updated 1 month ago
2

上一页 1...91 92 93 94 95 96 97...100 下一页

1000+ results for bfloat16

1000+ results
for bfloat16