bfloat16 Search Results

1000+ results
for bfloat16

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

dillonhuff/clockwork #163

BFloat implementation

Several of my applications use bfloat types, including applications intended to test the bfloat hardware on the CGRA. What is the plan for supporting bfloat16_t?

jeffsetter updated 3 years ago
2
pytorch/ao #639

Replace flash_4 with FlexAttention

https://github.com/pytorch-labs/segment-anything-fast/ uses [custom Triton code](https://github.com/pytorch-labs/segment-anything-fast/blob/main/segment_anything_fast/flash_4.py) to implement a varian…

cpuhrsch updated 1 month ago
15
cboutsikas/stoch_rounding_iplicit_reg #1

Use StochasticRounding.jl ?

Just came across https://arxiv.org/abs/2403.12278 🎉 And I see that you're actually using Julia. Just to point you to [StochasticRounding.jl](https://github.com/milankl/StochasticRounding.jl) which im…

milankl updated 7 months ago
3
pytorch/vision #8503

Can we add datatype support for examples under references

### 🚀 The feature currently the examples under references only support default datatype (float32), can we support a argument like --data-type to allow user to specify the datatype for the model? ###…

wincent8 updated 4 months ago
2
shenyunhang/APE #14

docker环境问题

1. 使用docker pull keyk13/ape_image:v1 拉取了在以上问题中提供的镜像 2. 但是在容器中没有找到xformers库，pip install xformers 会安装0.0.23版本，自动更新torch版本；如果安装0.0.17版本，会有以下报错 NotImplementedError: No operator found for `memory_efficie…

zhiwenhou1227 updated 5 months ago
11
netease-youdao/QAnything #233

RTX 2080 Ti 22GB 显存不支持 Bfloat16，能不能让 vllm 启动时配置 float16

**Please Describe The Problem To Be Solved** 在 Ubuntu 22.04 运行 ``` bash sudo ./run.sh -c local -i 1 -b vllm -m Qwen-7B-QAnything -t qwen-7b-qanything -p 1 -r 0.85 ``` 报错： qanything-cont…

ghost updated 4 months ago
3
AI-Hypercomputer/maxtext #1005

PGLE doesn't work for Tensor Parallelism

We observed good overlap with FSDP + PGLE: ![Bq7PCuqyJbygSuL](https://github.com/user-attachments/assets/0cff27c4-6499-43d0-b436-ef01a2833ae0). Turning on and off PGLE makes a big difference here. …

wang2yn84 updated 1 week ago
3
Lightning-AI/lightning-thunder #1252

Strength reduction: fold transpose into a subsequent GEMM ca…

## 🚀 Feature The program: ``` class DynamoModule(torch.nn.Module): def forward(self, L_intermediate_parallel_ : torch.Tensor, L_self_modules_dense_4h_to_h_parameters_weight_ : torch.nn. p…

tfogal updated 1 week ago
1
tenstorrent/tt-metal #12705

TTNN `to_layout` does not support UNet Shallow output shape

In UNet Shallow, the output tensor from the final layer is of shape=`[1, 1, 337920, 1[32]]`. It is in TILE layout but it would be faster if I converted it to RM layout (to eliminate the padding) befor…

esmalTT updated 3 weeks ago
2
yangjianxin1/Firefly #279

微调Qwen2-1.5B-Instruct，loss始终是0

如题，请问怎么解决呢

frederichen01 updated 4 months ago
7

上一页 1...94 95 96 97 98 99 100...100 下一页

1000+ results for bfloat16

1000+ results
for bfloat16