spgemm Search Results - Githubissues

274 results
for spgemm

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

kokkos/kokkos-kernels #1062

Work around kk_get_free_total_memory not being available for…

Some backends, e.g. `SYCL` don't allow querying the amount of free device space. Nevertheless, `KokkosSPGEMM::compute_num_pool_chunks`, https://github.com/kokkos/kokkos-kernels/blob/0a1740890e84a145d6…

masterleinad updated 1 year ago
6
ginkgo-project/ginkgo #1486

NaN Residuals with CUDA GMRES+ParILUT

Hello, I have been testing ParILUT with GMRES with linear systems extracted from my application. I am seeing NaN residuals with the CUDA exec on H100. The versions of the exact same code but wit…

iontcheva updated 9 months ago
11
kokkos/kokkos-kernels #1113

Will the following operations cause errors?

When I want to change the first value. When choosing to OpenMP. And I set the number of threads to be greater than 1 in intel CPU。 like this:omp_set_num_threads(16); The program will take a long t…

yuanAIhan updated 3 years ago
4
libxsmm/libxsmm #805

Segfault in fsspmdm

I observe that libxsmm_fsspmdm_create is giving a segfault when ldb and ldc are large. The cutoff ldb/ldc value for segfault seems to vary a bit with the size of the A matrix. I managed to recreate…

semi-h updated 1 year ago
6
pytorch/pytorch #68323

sparse.mm: CUDA error: internal error when calling `cusparse…

Consider the following script: ```python A = torch.sparse_coo_tensor( indices=[ [1500, 1505, 1506], [8347, 8347, 8347], ], values = [1., 1., 1.], size = [2523, 13716], dev…

saluto updated 1 year ago
10
kokkos/kokkos-kernels #655

Updating KokkosSparse.hpp?

@srajama1 @ndellingwood @brian-kelley @vqd8a It seems that we might want to add new algorithms that were developed recently to that header: - KokkosSparse_spadd.hpp - KokkosSparse_spiluk.hpp - K…

lucbv updated 4 years ago
2
kokkos/kokkos-kernels #978

KokkosSparse_gauss_seidel_spec.hpp not compiling

With the fixes, kokkos PRs 4014 and 4029, kokkos-kernels PR #958, and and trilinos PR 9123, this is the last remaining issue for building the new trilinos stack on Windows-LLVM without CUDA. In fil…

jrobcary updated 3 years ago
2
ginkgo-project/ginkgo #1240

Clean separation between functionality of core and device.

This issue is in reference to the discussion regarding having sequential operations run on the host rather than on the device kernels (reference, openmp, cuda etc). I would propose for having a cl…

pratikvn updated 1 year ago
1
GraphBLAS/LAGraph #131

Algebraic Multigrid Solver implemented in GraphBLAS language…

Just notice this nice community effort on GraphBLAS-based algorithms. I am curious if there are any attempts & interests on translating a complete [AMG solver](https://en.wikipedia.org/wiki/Multigr…

learning-chip updated 2 years ago
3
IST-DASLab/sparsegpt #15

How should I verify the speedup effect of the algorithm?

As shown in paper, CUTLASS library is used for speedup. But I did not find codes rely on these settlement.How should I verify SparseGPT is faster than dense models when doing inference? Even with end-…

moonlightian updated 1 year ago
4

上一页 1...1 2 3 4 5 6 7...28 下一页

274 results for spgemm

274 results
for spgemm