simt Search Results - Githubissues

495 results
for simt

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

NVIDIA/TensorRT #4020

TensorRT trtexec Layerwise Compute Precision, Sparsity, Tens…

## Description Previously, I remember I can use `--exportLayerInfo` to dump the comprehensive layerwise info of the engine, including the precision of the layer, and the IO tensor datatype and layo…

leimao updated 3 months ago
3
djgroen/FabFlee #5

rename simulation_period to something shorter?

`simulation_period` is a parameter which users frequently type on the command-line, but it's relatively lengthy to type. Should we use a shorthand phrase instead, such as `T` or `duration`? Please …

djgroen updated 1 year ago
2
NVIDIA/cutlass #1106

[QST] Lots of "incomplete type is not allowed" when changing…

I adapted from the CUTLASS conv2d example, and tried to change it to run with float16 and on sm86: Environment: CUDA Version: 12.0 Device: A10 ``` using ElementAccumulator = float; …

yibolu96 updated 7 months ago
8
celeritas-project/celeritas #127

Analyze performance impact of masking vs partitioning

In some of our kernels (namely, the `Model::interact` kernels in physics) only an arbitrary subset of the threads will be active in that particular kernel. There are two main ways to operate such kern…

sethrj updated 1 year ago
2
NVIDIA/cutlass #1103

[QST] Is it possible to achieve "same padding" like scipy.si…

I tried to preform batch 1D FIR filter by adapting conv2d_fprop_tensorop example, given a 2D continuous row major data layout for (M,N),which has M entry of N length 1D signal, A 1D filter of length S…

artmortal93 updated 1 year ago
7
TUDelft-CNS-ATM/bluesky #481

Getting Time From Bluesky Sim

Hello, I am trying to use the sim time in my plugin but I'm not sure how to access it. Based on the wiki I should use "settings.simtclock" I think, however, the only attribute that I am able to ac…

nl22-nmsu updated 1 year ago
2
hlt-mt/FBK-fairseq #3

Can't get EDATT to work

I cloned the FBK-fairseq repo (https://github.com/hlt-mt/FBK-fairseq.git), installed it following the instructions [here](https://github.com/hlt-mt/FBK-fairseq.git), and tried to run the EDATT model o…

RomanKoshkin updated 1 year ago
6
triton-lang/triton #2658

WSMaterialization generates invalid IR -- modifies module's …

In #5227, I am adding a verifier for an invariant of TritonGPU IR. The invariant is not new, it's just that we didn't check it before. The invariant is, if a tensor has a blocked layout with `warp…

jlebar updated 1 year ago
6
bytedance/byteir #41

[Compiler GPU] LLVM ERROR: operation destroyed but still has…

I'm getting `LLVM ERROR: operation destroyed but still has uses` when running `gpu-opt` pipeline. The erroring pass is `ConvertFuncToGPUPass`. error msg: ``` ../test.mlir:374:13: error: 'scf.for…

zhekunz2 updated 1 year ago
5
traveller59/spconv #575

Can't find algo Simt_**_SKD in prebuilt. compile with nvrtc.…

Hi Yanyan. Thanks for your excellent work! I tried to install the latest Spconv (cu113) with pip and run my task in an image with PyTorch 1.12.1 (PyTorch official image). And I encounter the follow…

Gofinge updated 1 year ago
9

上一页 1...19 20 21 22 23 24 25...50 下一页

495 results for simt

495 results
for simt