-
I've been working on porting FlashAttention-2 to pre-SM80 architectures (Turing and Volta) and was wondering if TK supports SM70 and SM75 hardware. Writing 100 lines of TK primitives sounds a lot easi…
-
Opening an issue to track this. Want to get an idea on how easy it is for us to plumb support for codegen to output scalar values.
Currently the cases we are looking at are where the input is just …
-
I'm curious if your visions include making it a feature-complete NN training framework?
What will be the master plan? Integrating with Torch/TF/MXNet or build hardware-level compilation framework f…
-
There are parts of Lucene that can potentially be speeded up if computations were to be offloaded from CPU to the GPU(s). With commodity GPUs having as high as 12GB of high bandwidth RAM, we might be …
-
(C# DirectML int4 phi 3 mini onnx) Using genai api.
Very specific certain prompts crash. Although I haven't yet found a pattern. It isn't to do with the length of the prompt either since certain sh…
-
## Type of Issue
Select the type of issue:
- [x] Bug report (to report a bug)
- [ ] Feature request (to request an additional feature)
- [ ] Tracker (I am just using this as a tracker)
- [ ] Re…
-
Hi, I'm trying to build caffe2 with GPU support.
cmake configuration runs fine but then when building I get the output as below. Can someone help me with that please ?
Thanks a lot !
### Syste…
elcou updated
5 years ago
-
Hi! I implement a Python program, that uses StarPU under the hood. The Python program simply calls Python/C++ wrappers, which pass execution to C++ routines which then call StarPU task-related functio…
Muxas updated
6 months ago
-
I am looking for help creating a poc tool that is rapidly digesting bcrypt hashes based on a list containing password:salt
The goal for you is to extract a already working open source GPU based bc…
-
The built-in SVD extension was the reason to switch to Forge, but this and other built-in extensions and tabs are missing. When will they be available again?