graviton3e Search Results

6 results
for graviton3e

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

OpenMathLib/OpenBLAS #4644

Multi-threaded DGEMM becomes less efficient on many-core CPU…

OpenBLAS DGEMM achieves high efficiency, for example, over 90% of peak performance with 1 thread on Graviton3E, but the efficiency drops to about 73% when running DGEMM with 64 threads. As is known, …

yamazakimitsufumi updated 6 months ago
1
OpenMathLib/OpenBLAS #4580

Openblas sgemm is slower for small size matrices in aarch64

I have built openblas in graviton3E with make USE_OPENMP=1 NUM_THREADS=256 TARGET=NEOVERSEV1. mkl is built in icelake machine. I have used openblas sgemm as `cblas_sgemm(CblasRowMajor, CblasNoTr…

akote123 updated 3 months ago
16
OpenMathLib/OpenBLAS #4590

The parameter GEMM_PREFERED_SIZE is not set for Neoverse V1

Hello. The parameter GEMM_PREFERED_SIZE is set for recent XEON and POWER CPUs, but no specific value is set for Arm CPUs in param.h. Isn’t it better to set the parameter, especially for CPUs with SIM…

tetsuzo-usui updated 7 months ago
2
openvinotoolkit/openvino.genai #438

FullyConnected nodes use slow reference kernel on ARM

@Wovchena This is related to #406 but goes deeper, hence I decided to make this a new issue. ## TL;DR - For `greedy_causal_lm` inference on `arm`, large matmuls (e.g. `1x2x4096:4096x4096` in query…

NishantPrabhuFujitsu updated 5 months ago
18
OSGeo/gdal #8164

Uppercased file extensions become lowercased in osgeo.gdal.D…

## Expected behavior and actual behavior. Given: - multi-file dataset (e.g., Shapefile) w/ one of the files having an uppercased extension (e.g., `.PRJ`) - e.g., "poly" Shapefile dataset from th…

gorloffslava updated 1 year ago
2
OSGeo/gdal #8165

CSV driver doesn't honor CSVT sidecar in Dataset.GetFileList…

## Expected behavior and actual behavior. Given: - CSV dataset w/ both `.csv` index and `.csvt` sidecar. - For example, `testcsvt.csv` and `testcsvt.csvt` from GDAL autotest data: https://github…

gorloffslava updated 1 year ago
5

6 results for graviton3e

Multi-threaded DGEMM becomes less efficient on many-core CPU…

Openblas sgemm is slower for small size matrices in aarch64

The parameter GEMM_PREFERED_SIZE is not set for Neoverse V1

FullyConnected nodes use slow reference kernel on ARM

Uppercased file extensions become lowercased in osgeo.gdal.D…

CSV driver doesn't honor CSVT sidecar in Dataset.GetFileList…

6 results
for graviton3e