tbenthompson / cutde

Python CPU and GPU accelerated TDEs, over 100 million TDEs per second!
MIT License
58 stars 15 forks source link

Add CUDA matrix-vector product functions that use the matrices output by `disp_block` and `disp_aca`. #12

Open tbenthompson opened 3 years ago

tbenthompson commented 3 years ago

Iterating over the blocks from Python is quite inefficient. See here: https://tbenthompson.com/book/tdes/hmatrix.html#a-matrix-vector-product