linnanwang / BLASX

a heterogeneous multiGPU level-3 BLAS library
45 stars 11 forks source link

CPU Level Parallelism #3

Open romanlarionov opened 8 years ago

romanlarionov commented 8 years ago

Hello!

This library looks great, but I was wondering if it has CPU multi-threading blas capabilities. Reading through the code for some of the *gemm files, it almost appears to be the case.

I'm trying to perform a benchmark on AWS between g2 and c4 instances. I was hoping to find some way of writing a single code base that will perform the same function on the two different instances.

linnanwang commented 8 years ago

For CPU multi-threading, it depends on what CPU BLAS you link, and how you configure them.

Please don't pay too much attention to CPU, this is a multiGPU BLAS. For more details, you can visit cuBLAS-XT. They are pretty same, but different approach.

romanlarionov commented 8 years ago

Alright, good to know and thanks for the lead.

Btw, are there any plans for supporting double complex based operations in the near future?

linnanwang commented 8 years ago

good catch. I just merged ZGEMM a few seconds ago.

romanlarionov commented 8 years ago

Ohh I didn't know that was a valid pull request. Is the current master golden? I pretty much need to support zgemm and would rather use this over cublas-xt.

linnanwang commented 8 years ago

Yes, the current master has ZGEMM.