cuda-programming Search Results

1000+ results
for cuda-programming

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

HazyResearch/ThunderKittens #21

[Question] Supported compute capabilities?

I've been working on porting FlashAttention-2 to pre-SM80 architectures (Turing and Volta) and was wondering if TK supports SM70 and SM75 hardware. Writing 100 lines of TK primitives sounds a lot easi…

bayley updated 4 months ago
3
NVIDIA/Fuser #315

[Feature Request] codegen should be able to produce scalar o…

Opening an issue to track this. Want to get an idea on how easy it is for us to plumb support for codegen to output scalar values. Currently the cases we are looking at are where the input is just …

jjsjann123 updated 1 year ago
12
breandan/kotlingrad #8

Publish roadmap and contributing doc?

I'm curious if your visions include making it a feature-complete NN training framework? What will be the master plan? Integrating with Torch/TF/MXNet or build hardware-level compilation framework f…

tribbloid updated 4 years ago
18
apache/lucene #8796

Explore GPU acceleration [LUCENE-7745]

There are parts of Lucene that can potentially be speeded up if computations were to be offloaded from CPU to the GPU(s). With commodity GPUs having as high as 12GB of high bandwidth RAM, we might be …

asfimport updated 4 years ago
44
microsoft/onnxruntime-genai #833

Certain prompts crash for Phi 3 mini int4 DML (with simpler …

(C# DirectML int4 phi 3 mini onnx) Using genai api. Very specific certain prompts crash. Although I haven't yet found a pattern. It isn't to do with the length of the prompt either since certain sh…

elephantpanda updated 1 week ago
20
facebookresearch/fairo #527

'interaction_loggings.json' does not exist when run the agen…

## Type of Issue Select the type of issue: - [x] Bug report (to report a bug) - [ ] Feature request (to request an additional feature) - [ ] Tracker (I am just using this as a tracker) - [ ] Re…

snyxan updated 3 years ago
1
facebookarchive/caffe2 #1898

error: mismatched argument pack lengths while expanding ‘std…

Hi, I'm trying to build caffe2 with GPU support. cmake configuration runs fine but then when building I get the output as below. Can someone help me with that please ? Thanks a lot ! ### Syste…

elcou updated 5 years ago
18
starpu-runtime/starpu #42

How to run SimGrid simulations if my code just provides Pyth…

Hi! I implement a Python program, that uses StarPU under the hood. The Python program simply calls Python/C++ wrappers, which pass execution to C++ routines which then call StarPU task-related functio…

Muxas updated 6 months ago
14
f1l1b0x/bcryptopencldigest #1

opencl based bcrypt digest needed

I am looking for help creating a poc tool that is rapidly digesting bcrypt hashes based on a list containing password:salt The goal for you is to extract a already working open source GPU based bc…

f1l1b0x updated 4 years ago
5
lllyasviel/stable-diffusion-webui-forge #996

SVD missing

The built-in SVD extension was the reason to switch to Forge, but this and other built-in extensions and tabs are missing. When will they be available again?

akmesb updated 1 week ago
17

上一页 1...90 91 92 93 94 95 96...100 下一页

1000+ results for cuda-programming

1000+ results
for cuda-programming