bmm Search Results - Githubissues

1000+ results
for bmm

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

foundation-model-stack/foundation-model-stack #192

Improve vLLM MoE fused kernel

The vLLM [fused moe kernel](https://github.com/vllm-project/vllm/blob/main/vllm/model_executor/layers/fused_moe.py) used for Mixtral uses the standard data parallel parallelization which works well wi…

AdnanHoque updated 4 months ago
2
pytorch/pytorch #106614

Case study of torch.compile / cpp inductor on CPU: min_sum /…

### 🐛 Describe the bug (I'll add actual benchmarking details and logs and output_code.py in a bit) I'm doing min_sum and mul_sum in two setups: 1. (D, ) x (D, ) -> scalar 2. (B, N, 1, D) x (B,…

vadimkantorov updated 6 months ago
17
kakaxi314/GuideNet #5

Question for the kernel design in multi-batch input.

Thank you for your nice work. Since the code is not yet open, I write down my question for your kernel design. In the paper, given the image feature information, you set this feature as the convol…

LifeBeyondExpectations updated 4 years ago
5
graykode/nlp-tutorial #75

Faster attention calculation in 4-2.Seq2Seq?

Thanks for sharing! Just found out `Attention.get_att_weight` is calculating attention in a for-loop? this looks rather slow isn't it? `4-2.Seq2Seq(Attention)/Seq2Seq(Attention).ipynb` ```pyth…

shouldsee updated 5 months ago
1
lkwq007/stablediffusion-infinity #102

[Bug] expected scalar type Half but found Float

**Describe the bug** Hello, After clicking on "Outpaint" in the screenshot below I get the following error: ![image](https://user-images.githubusercontent.com/4301170/197384390-e9e8672e-db9e-48b…

P4l1ndr0m updated 1 year ago
13
Cambricon/mlu-ops #1007

【新算子】- linalg.lu 算子开发

开发计划可参考以下节点： 1. 方案撰写，xx.xx~xx.xx 2. 开发自测，xx.xx~xx.xx 3. 提出 PR/MR，xx.xx~xx.xx 4. review（ 3个赞），xx.xx~xx.xx 6. maintainer 合入

PetrelYy updated 1 week ago
20
lancopku/SGM #16

请问在decode中前向传播的时候，为什么rnn的输入是标签向量和state，而没有经过attention后得到的c(t…

您代码里面的解码的前向传播没怎么看懂 `< def forward(self, inputs, init_state, contexts): if not self.config.global_emb: embs = self.embedding(inputs) outputs, state, attns = [], i…

HaimianYu updated 3 years ago
5
xcmyz/FastSpeech #81

error in new commit

hi @xcmyz after successful run of preprocess.py when i run train.py it gives following error ``` Use FastSpeech Model Has Been Defined Number of TTS Parameters: 25367169 Load data to buffer …

Ahmad-noborders updated 4 years ago
9
aleximmer/Laplace #111

Help for Running Laplace on Image Segmentation Tasks

Hello, I am using a U-Net augmentation (specifically: https://github.com/juntang-zhuang/LadderNet) to perform segmentation of hands. To be specific, I am classifying each pixel of an image to one o…

SouLeo updated 3 months ago
4
pytorch/pytorch #70008

Torch function runtime seemingly dependent on scipy call

### 🐛 Describe the bug From https://discuss.pytorch.org/t/torch-function-runtime-dependent-on-scipy-call/139483: I've noticed that certain PyTorch functions run slower when I make calls to `scipy.…

williamwen42 updated 2 years ago
1

上一页 1...33 34 35 36 37 38 39...100 下一页

1000+ results for bmm

1000+ results
for bmm