issues
search
ROCm
/
hipBLASLt
hipBLASLt is a library that provides general matrix-matrix operations with a flexible API and extends functionalities beyond a traditional BLAS library
https://rocm.docs.amd.com/projects/hipBLASLt/en/latest/index.html
MIT License
64
stars
89
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
add BGRADB NT Gridbase
#1393
Jinp800125
opened
5 hours ago
0
Fix UnrollLoopSwapGlobalReadOrder=1 bug for small GRVW.
#1392
hcman2
opened
16 hours ago
2
Fix: abnormal terminal output if the first solution is rejected when printWinnerOnly is enabled
#1391
jichangjichang
opened
17 hours ago
0
Aquavanjaram942 20CU Tune HHS NN and TN GEMM sizes, equality and grid library
#1390
aferoz21
opened
20 hours ago
0
Add extended profile logging along with flush and rotating size
#1389
NaveenElumalaiAMD
opened
1 day ago
0
update gfx942 gridbased bf16 TN/NN
#1388
aazz44ss
closed
19 hours ago
1
Bump rocm-docs-core from 1.8.3 to 1.9.0 in /docs/sphinx
#1387
dependabot[bot]
closed
1 day ago
0
Update hstu bmm logic yaml.
#1386
hcman2
closed
19 hours ago
0
question about FP16 compute_type
#1385
jinz2014
opened
1 day ago
3
PrintSolutionRejectionReason = True doesn't work at 4_LibraryClient stage
#1384
Jay0521
closed
1 day ago
3
update the lock mechanism in the user offline tuning tool
#1383
Jay0521
opened
2 days ago
0
Logic fix to exclude streamk by default
#1382
mahmoodw
opened
4 days ago
0
Find python
#1381
ellosel
closed
1 day ago
0
Fp8 tuning upstream
#1380
fsx950223
opened
4 days ago
0
Fp8 tuning
#1379
fsx950223
closed
4 days ago
0
Update 35 Equality logic yaml sizes.
#1378
geotseng-amd
closed
3 days ago
1
[Experimental] hipBLASLt tensor swizzling integration
#1377
Serge45
opened
4 days ago
0
Add gfx942 xf32 NN/NT/TN Equality yamls for 1105 xf32
#1376
AndySu12
closed
4 days ago
1
update gfx942 xf32 freesize
#1375
aazz44ss
closed
4 days ago
1
Code object compression via bundling
#1374
bstefanuk
opened
5 days ago
0
Avoid divide by 0 when calculating predicted performance with streamk
#1373
daineAMD
closed
3 hours ago
0
update 38 Equality logic yaml sizes
#1372
mengzcai
closed
3 days ago
0
Fix: incorrect required workspace size for singleKernel GSU
#1371
jichangjichang
closed
4 days ago
1
Update 12 Equality logic yamls.
#1370
mengzcai
closed
1 day ago
2
[Sparse] fix sparse kernel generation failure
#1369
Jay0521
closed
1 day ago
0
[Hotfix] Disable setOccupancyLimit for gfx120X
#1368
KKyang
closed
6 days ago
2
Remove PackageLibrary option
#1367
ellosel
opened
6 days ago
1
Skip the very first cold iteration from gpu time measurement
#1366
TomokoKurotobi
closed
4 days ago
2
Library Logic Format Simplification
#1365
b-shi
opened
1 week ago
6
Add setOccupancyLimit
#1364
KKyang
closed
1 week ago
2
Bump rocm-docs-core from 1.8.3 to 1.8.5 in /docs/sphinx
#1363
dependabot[bot]
closed
1 day ago
1
gridbased search for batched gemm
#1362
aazz44ss
closed
4 days ago
1
Remove alias for MirrorDims in logic yaml
#1361
alex391a
closed
6 days ago
0
Fix F32 FMAC Perf Bugs for gfx11/12
#1360
wenchuanchen
opened
1 week ago
0
Fix invalid stream-k test case, make dynamic grid the default
#1359
AlexBrownAMD
closed
4 days ago
4
Refactoy the pack scheduling for scheduleIterAlg = 3.
#1358
vin-huang
opened
1 week ago
0
Bump rocm-docs-core from 1.8.3 to 1.8.4 in /docs/sphinx
#1357
dependabot[bot]
closed
1 week ago
1
Modify to check if alpha is in host memory.
#1356
geotseng-amd
opened
1 week ago
1
Remove Min/Max/TotalVgprNumber in Common.py
#1355
KKyang
closed
1 week ago
2
Regression Tree for ranking solutions
#1354
yenong-amd
opened
1 week ago
0
[OPT] Optimize tail loop
#1353
briannwu
opened
1 week ago
0
[BB] fix build break with ROCM build# < 14361
#1352
cmingch
closed
1 week ago
0
Update gfx942 BBS/S NT/TN/TT GridBased yamls for 1105 MRS Training
#1351
AndySu12
closed
1 week ago
1
Fix CI errors: no DeviceMaxFreq in GroupedGemm test
#1350
jichangjichang
closed
1 week ago
1
Add sgpr occupancy
#1349
KKyang
closed
1 week ago
0
Revert "Use stream-k dynamic grid size model by default"
#1348
jichangjichang
closed
1 week ago
1
Add initial optional stream-k libraries
#1347
AlexBrownAMD
opened
1 week ago
1
Change syntax of Union for earlier python versions
#1346
daineAMD
closed
5 days ago
1
gfx942 38cu F8BS NN TN NT grid tune
#1345
m-kim
closed
4 days ago
1
Set Python_ROOT virtual.env
#1344
ellosel
closed
5 days ago
0
Next