bmm Search Results - Githubissues

1000+ results
for bmm

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

iree-org/iree #16128

[Stream] Transient buffer adding extra copies to Llama2 infe…

In the Llama2 model, the concatenation of the growing context is currently getting lowered into a copy to a transient buffer before copying into the global variable. The global_state tensor is origina…

Max191 updated 8 months ago
45
SolidRusT/srt-model-quantizing #3

Llama 3.1 Quantization - Expected all tensors to be on the s…

I wanted to quantize `model_name = "cognitivecomputations/dolphin-2.9.4-llama3.1-8b"` But i am getting an error: ``` import os os.environ['model_name'] = model_name model_name_awq = model_name.sp…

vackosar updated 1 month ago
1
tenstorrent/tt-metal #8112

ttnn.matmul - allow ND tensors to remove reshape

Will need to handle nD tensors for matmul. Tensors of rank 1 should fail since that's not even a matrix. For other tensors, should assume the product of ranks 0..size(ranks)-2 is the batch size.…

bbradelTT updated 4 months ago
6
pytorch/functorch #984

No Batching rules for aten::_linalg_solve_ex, aten::linalg_s…

TL;DR - `torch.linalg.slogdet` is over one order of magnitude slower in computing per-sample gradients in the latest nightly version of PyTorch/FuncTorch (`1.13.0.dev20220721` / ` 0.3.0a0+e8a68f4`) th…

AlphaBetaGamma96 updated 1 year ago
16
MegaMek/megamek #1486

Request: Add option to skip firing turn for mechs/vees that …

Since they can't fire any weapons while sprinting there is no point in controlling them during the weapon phase, so the game could skip them like it does for mechs with no melee targets in the physica…

Tamren updated 5 days ago
9
pytorch/pytorch #84039

[MPS] MPSNDArray error: product of dimension sizes > 2**31

### 🐛 Describe the bug ## Full error message (no traceback): ``` AppleInternal/Library/BuildRoots/20d6c351-ee94-11ec-bcaf-7247572f23b4/Library/Caches/com.apple.xbs/Sources/MetalPerformanceShaders…

junukwon7 updated 6 months ago
36
Xinyu-Yi/TransPose #66

关于论文公式和代码对应问题

paper上说： ![image](https://github.com/Xinyu-Yi/TransPose/assets/11289552/53789574-572b-448f-9489-d78c56f4b630) 这里左乘“旋转矩阵的逆”，相当于变换参考系，我也认为应该这样做很合理，为什么代码里面却不是呢？ ```python def normalize_and_concat(glb…

jiakechong1991 updated 3 months ago
3
bfGraph/STGraph #11

Seastar - RGCN

Seastar's original implementation does not present a vertex centric program for RGCN, it rather uses a handwritten kernel in dgl-hack. Let's try to write a vertex-centric program for RGCN, this issue …

JoelMathewC updated 1 year ago
4
neuraloperator/neuraloperator #289

(<class 'RuntimeError'>, RuntimeError('Unsupported dtype Hal…

I am running on 'MPS' which does not support the datatype Complex64. Initally, I got: ``` RuntimeError: MPS device does not support bmm for non-float inputs ``` So, I tried setting ``` fno_bl…

dbl001 updated 3 months ago
3
bcc-code/bmm-web #416

Design: dynamic cover size

Currently the covers (ItemCard in code) have a fixed size of 208px. But as shown in Figma, the size is supposed to be flexible (158px - 250px) based on how much space there is. CSS Grid sounds like a…

kkuepper updated 3 months ago
3

上一页 1...61 62 63 64 65 66 67...100 下一页

1000+ results for bmm

1000+ results
for bmm