issues
search
sarah-quinones
/
gemm
MIT License
76
stars
12
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Performance improvement: multithreaded gemv_rowmajor
#32
bgergely0
opened
3 weeks ago
0
gemm_f16: Build fails in debug mode for AArch64
#31
brunocaballero
opened
5 months ago
5
Build fails on i386: no function or associated item named `new` found for struct `CpuId` in the current scope
#30
yurivict
opened
5 months ago
1
Maybe bump raw-cpuid version?
#29
wisp3rwind
opened
6 months ago
0
Don't crash on Linux machines with L4 cache
#28
dfyz
opened
6 months ago
2
Update `diol` dependency
#27
abrown
closed
6 months ago
1
Provide benchmark with throughput units (GFlops/s TFlops/s)
#26
mratsim
opened
7 months ago
1
Compilation error when compiling to aarch64-apple-ios
#25
santiagomed
opened
7 months ago
2
Low Level API with Pre Allocated Work Space Exposed
#24
RoyiAvital
opened
1 year ago
0
Apple amx
#23
sarah-quinones
closed
12 months ago
1
Add GemmType trait for dispatching gemm fn calls
#22
ivarflakstad
opened
1 year ago
2
try out fcma
#21
sarah-quinones
closed
1 year ago
1
Candle example uses 10% of CPU when fma is active for x86
#20
kstavro
opened
1 year ago
7
Prepacking
#19
sarah-quinones
closed
1 year ago
0
Generic simd
#18
sarah-quinones
closed
1 year ago
0
Gemv example
#17
LaurentMazare
closed
1 year ago
0
[do-not-merge] Faster gemv using simd
#16
LaurentMazare
closed
1 year ago
0
[DUMMY] F16 lane
#15
Narsil
opened
1 year ago
0
F16 intrinsics standalone
#14
Narsil
closed
1 year ago
0
M1 f16 intrinsics
#13
Narsil
opened
1 year ago
0
Fixing large multi-threading by chunking on fewer threads.
#12
Narsil
opened
1 year ago
0
This improves drastically overthreading issue (>48cores)
#11
Narsil
opened
1 year ago
2
Slow parallelism on large number of threads
#10
Narsil
opened
1 year ago
0
Adding SIMD128 for wasm.
#9
Narsil
opened
1 year ago
0
F16 vectorize pack
#8
LaurentMazare
closed
7 months ago
2
`const` misused as `static`
#7
cbeuw
closed
1 year ago
2
Support for Mixed precision f32-f16
#6
mert-kurttutan
opened
1 year ago
1
Regarding Panic Comment in gemm.rs
#5
mert-kurttutan
opened
1 year ago
1
[Question] Suggested way to use Parallelism for libraries using gemm?
#4
coreylowman
closed
1 year ago
3
Neon and Web Assembly Support
#3
mert-kurttutan
opened
1 year ago
3
std::any::TypeId is different for types from different versions of the same library
#2
coreylowman
closed
1 year ago
2
remove packed_rhs
#1
lvtuwjl
closed
1 year ago
1