cuda-programming Search Results

1000+ results
for cuda-programming

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

bazel-contrib/rules_cuda #26

Generate PTX artifacts

Hi. Is there a way to use rules_cuda to generate PTX? For example, this is useful for OptiX programming.

graphicsMan updated 1 year ago
7
QuantEcon/lecture-python-programming.myst #242

Add a Lecture on Numba Cuda

Add a lecture based on the [numba_cuda notebook](https://colab.research.google.com/github/cbernet/maldives/blob/master/numba/numba_cuda.ipynb#scrollTo=5U0yngpWU1Sg) written by @jstac with an introduct…

HumphreyYang updated 8 months ago
4
csuhan/s2anet #93

3090 can only install 11 series of cuda, what should I do?

I cannot reproduce the code with the 3090 graphics card. Suspected is the reason for cuda programming.

lyccol updated 3 years ago
4
iree-org/iree #18077

[runtime][hip] The driver does not manage its devices proper…

There are a number of HIP functions that assume a selected device for the current thread and they operate on this device. For example `hipModuleLoadDataEx`. We need to set the correct HIP device duri…

sogartar updated 1 month ago
9
cupy/cupy #8239

Possibly mistaken usage of the `cupyx.jit.thrust.device` exe…

When discussing the Thrust JIT support with the CCCL team, a question was raised regarding the usage of the `jit.thrust.device` policy in the test suite, ex: https://github.com/cupy/cupy/blob/be5d7f…

leofang updated 5 months ago
3
inducer/pycuda #319

[FEATURE REQUEST] Implement Stream priorities

I think I'm able to say that stream priority isn't implemented in pycuda right now. Although Cuda gives the possibility to allocate priorities to Streams, as pointed out here : https://docs.nvidi…

dmenig updated 2 years ago
2
coreylowman/dfdx #360

CUDA Graphs

We should consider whether it is possible and desired to automatically combine kernels into CUDA graphs to reduce overhead of calling individual kernels. Here is the relevant documentation: - http…

ViliamVadocz updated 1 year ago
2
ObrienlabsDev/blog #1

High Performance Computing: CUDA and GCP

- see https://github.com/ObrienlabsDev/machine-learning/issues/10 ## Use Cases Tensor cores have 3.5x the performance on NVidia GPUs than cuda cores ### LLM and Generative AI - https://github.…

obriensystems updated 6 months ago
5
Mozilla-Ocho/llamafile #372

CudaMalloc failed: out of memory with TinyLlama-1.1B

I am trying to make working with GPU Tinyllama with: ```bash ./TinyLlama-1.1B-Chat-v1.0.F32.llamafile -ngl 9999 ``` But it seem not possible to allocate 66.50 MB of memory on my card, even if I j…

Lathanao updated 3 months ago
4
thomasbrandon/mish-cuda #2

Performance gain vs normal way implementation

I'm not familiar with CUDA programming. Could you explain a little bit about the key factors in this implementation that brings performance gain? Thanks a lot!

anhle-uet updated 3 years ago
1

上一页 1...6 7 8 9 10 11 12...100 下一页

1000+ results for cuda-programming

1000+ results
for cuda-programming