issues
search
TensorBFS
/
CuTropicalGEMM.jl
The fastest Tropical number matrix multiplication on GPU
MIT License
9
stars
0
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Benchmark image broken in README
#28
GiggleLiu
opened
7 months ago
0
Add support of CuStream control
#27
ArrogantGao
closed
9 months ago
4
Add device detector
#26
ArrogantGao
closed
11 months ago
3
revised the binary dependency, and new version 0.1.1
#25
ArrogantGao
closed
11 months ago
1
Added CUDA version detect
#24
ArrogantGao
closed
11 months ago
6
Silent break on CUDA@v12.2
#23
GiggleLiu
closed
10 months ago
4
Example: generic tensor network
#22
GiggleLiu
opened
11 months ago
7
TagBot trigger issue
#21
JuliaTagBot
closed
11 months ago
3
Using TropicalGemmC_jll.jl and removing ./deps
#20
ArrogantGao
closed
1 year ago
0
Failure of BenchmarkTools
#19
ArrogantGao
opened
1 year ago
3
Optimizations for long and narrow matrices
#18
ArrogantGao
opened
1 year ago
1
Polish README
#17
GiggleLiu
closed
11 months ago
2
Consider adding more tropical number types
#16
GiggleLiu
opened
1 year ago
0
fixed a sync problem
#15
ArrogantGao
closed
1 year ago
4
Setting up a benchmark repo
#14
GiggleLiu
closed
11 months ago
2
Register package (after setting up CI)
#13
GiggleLiu
closed
11 months ago
0
Cleanup repo
#12
GiggleLiu
closed
1 year ago
1
Setup CI/CD
#11
GiggleLiu
closed
11 months ago
0
Unstable result in the GenericTensorNetwork example
#10
GiggleLiu
closed
1 year ago
4
Direct transpose and overloaded `LinearAlgebra.mul!`
#9
ArrogantGao
closed
1 year ago
27
Overwrite `LinearAlgebra.mul!`
#8
GiggleLiu
closed
1 year ago
1
Invoke different routines based on tensor element type
#7
GiggleLiu
closed
1 year ago
1
C cuda
#6
ArrogantGao
closed
1 year ago
11
Max mul, recover the close PR#2
#5
ArrogantGao
closed
1 year ago
0
More semiring algebras
#4
GiggleLiu
closed
1 year ago
1
Added the max-mul operation and benchmark by Nvidia Nsight Compute
#3
ArrogantGao
closed
1 year ago
3
Investigate the performance issues and consider moving to GemmKernels.jl
#2
GiggleLiu
opened
1 year ago
5
Added wrapped C cuda code and runable examples
#1
ArrogantGao
closed
1 year ago
6