tensor-trace Search Results

1000+ results
for tensor-trace

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

pytorch-labs/attention-gym #76

Flex attention - gaps in profiler

Repost from the [PyTorch forum](https://discuss.pytorch.org/t/flex-attention-gaps-in-profiler/211917/1) I have recently been playing with Flex attention, trying to replace some of my custom triton …

tugot17 updated 1 week ago
7
UKPLab/sentence-transformers #3012

CrossEncoder RuntimeError during torch.jit.trace: Cannot ins…

I encountered an issue when attempting to trace a CrossEncoder model using torch.jit.trace. The error occurs during the tracing process when calling the forward method. Below is a minimal reproducible…

Temchaz updated 2 weeks ago
3
pytorch/pytorch #141111

[torch.compile] Mutating backward input of autograd function…

### 🐛 Describe the bug This issue was raised from tracing Megatron/xlformers, where ```torch.distributed.all_reduce``` was called in backward of an ```autograd.Function``` and then it was rewritten b…

yanboliang updated 1 day ago
4
pytorch/pytorch #139718

Cannot export a quantized ResNet-18 model

### 🐛 Describe the bug After quantizing ResNet-18 model with PyTorch 2 Export Post Training Quantization it is not possible to export the model. ```python import torch from torchvision.model…

corehalt updated 1 week ago
2
llvm/llvm-project #116344

[MLIR]`-gpu-module-to-binary=format=%gpu_compilation_format`…

Test on commit: https://github.com/llvm/llvm-project/commit/6548b6354d1d990e1c98736f5e7c3de876bedc8e steps to reproduce: ``` mlir-opt test.mlir --gpu-module-to-binary=format=%gpu_compilation_format…

xisang0 updated 1 week ago
1
ROCm/rocmProfileData #68

[Issue]: runTracer.sh trace aborted (Failed)

### Problem Description I install rDP and do tracing example follow the README.md. But it run Aborted(failed) root@tw024:/ws/Try_rPD# runTracer.sh python matmult_gpu.py Creating empty rpd: tra…

alexhegit updated 2 weeks ago
1
zama-ai/concrete-ml #918

Is there a way for using gpu acceleration in the finetune gp…

When I try to use `compile_model` with CUDA as the specified device, I encounter the following error. Is there a way to resolve this, or is the `lora.py` code not yet compatible with running on a GPU?…

filippo-merlo updated 1 month ago
1
tenstorrent/tt-metal #12961

UNet end-to-end performance scales poorly in data parallel m…

### Summary On the current `main` (commit 861fb7ef87bf9c20ee7a4c1632e3852681cc8ef4) - the single chip performance of UNet is approx. 329 fps. Running the same test except data parallel on N300 meas…

esmalTT updated 2 weeks ago
2
aws-neuron/aws-neuron-sdk #914

DataParallel Support on CRF inference

The issue customer faces with batch inferencing using this approach proposed in https://github.com/aws-neuron/aws-neuron-sdk/issues/906. "My output is in Tuple[List[torch.Tensor]] which works well…

jyang-aws updated 1 week ago
1
pytorch/TensorRT #3263

🐛 [Bug] Const indices failed with embedding bag

## Bug Description indices are const tensor, which gets const folded into frozen param. The meta of the frozen param node is empty dict, leading to converter validation check failure [here](https:…

sean-xiang-applovin updated 3 weeks ago
1

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for tensor-trace

1000+ results
for tensor-trace