TypeError with Pardiso solver

reed-foster commented 7 months ago

I followed the instructions for installation (installing through PyPI) and ran through the quickstart. I ran the testing suite and all of the tests passed. However, I noticed a TypeError when I tried to change the solver type to use PyPardiso (which I installed using conda install -c conda-forge pypardiso):

Here's the stack trace:

Traceback (most recent call last):
  File "[.../lib/python3.10/site-packages/tdgl/solver/solver.py", line 799](http://localhost:8888/.../lib/python3.10/site-packages/tdgl/solver/solver.py#line=798), in solve
    data_was_generated = runner.run()
  File "[.../lib/python3.10/site-packages/tdgl/solver/runner.py", line 305](http://localhost:8888/.../lib/python3.10/site-packages/tdgl/solver/runner.py#line=304), in run
    success = self._run_stage(
  File "[.../lib/python3.10/site-packages/tdgl/solver/runner.py", line 416](http://localhost:8888/.../lib/python3.10/site-packages/tdgl/solver/runner.py#line=415), in _run_stage
    function_result = self.function(
  File "[.../lib/python3.10/site-packages/tdgl/solver/solver.py", line 675](http://localhost:8888/.../lib/python3.10/site-packages/tdgl/solver/solver.py#line=674), in update
    mu, supercurrent, normal_current = self.solve_for_observables(psi, dA_dt)
  File "[.../lib/python3.10/site-packages/tdgl/solver/solver.py", line 509](http://localhost:8888/.../lib/python3.10/site-packages/tdgl/solver/solver.py#line=508), in solve_for_observables
    mu = pypardiso.spsolve(operators.mu_laplacian, rhs)
  File "[.../lib/python3.10/site-packages/pypardiso/scipy_aliases.py", line 44](http://localhost:8888/.../lib/python3.10/site-packages/pypardiso/scipy_aliases.py#line=43), in spsolve
    solver._check_A(A)
  File "[.../lib/python3.10/site-packages/pypardiso/pardiso_wrapper.py", line 227](http://localhost:8888/.../lib/python3.10/site-packages/pypardiso/pardiso_wrapper.py#line=226), in _check_A
    raise TypeError(msg)
TypeError: PyPardiso requires matrix A to be in CSR or CSC format, but matrix A is: <class 'scipy.sparse._csc.csc_array'>

It looks like when mu_laplacian is generated, it gets generated as a csc_array, but PyPardiso requires a csc_matrix.

If the following section is modified: https://github.com/loganbvh/py-tdgl/blob/ac8b2d9e07b9c681fe6c72fb04fd0dbbbd856840/tdgl/finite_volume/operators.py#L300-L301

if self.sparse_solver is SparseSolver.CUPY:
...
elif self.sparse_solver is SparseSolver.PARDISO:
+ self.mu_laplacian = sp._csc.csc_matrix(self.mu_laplacian)
  self.mu_laplacian_lu = None

Then the simulation runs fine (although it's not any faster than using the default SparseLU solver, but perhaps that's just because of the structure of the example simulation geometry).

loganbvh commented 7 months ago

Hi Reed,

Thanks for reporting this. scipy sparse arrays and scipy sparse matrices should be interchangeable in this context, so it makes sense that your fix worked. I chose to use the sparse array type in pyTDGL because that's what's recommended by scipy (see note here). This check in PyPardiso is unnecessarily restrictive. I opened a PR in PyPardiso to address this https://github.com/haasad/PyPardiso/issues/68.

Other people have also reported that the MKL pardiso solver is not any faster than SuperLU despite being multithreaded, so you're probably better off just using SuperLU. If you have access to an NVIDIA GPU, using the GPU + SuperLU is the fastest combination I have found. If you're interested my testing is here https://github.com/loganbvh/py-tdgl/issues/34#issuecomment-1732427524

reed-foster commented 7 months ago

Hi Logan,

Thanks, that makes sense. After doing some further testing with the quickstart example, it definitely seems like SuperLU is the best choice for this geometry/mesh. Interestingly enough, using my NVIDIA GPU seems to slow things down for the quickstart example (4,671 mesh sites). When I monitor the GPU with nvidia-smi, it seems like the GPU utilization is rather low (typically around 10% utilization; not much more than when loading a webpage e.g., except when everything is solved on the GPU with CUPY where it reaches >90% utilization). I guess this is because the mesh is relatively small?

Here's the mesh information:

{
  'num_sites': 4671,
  'num_elements': 8748,
  'min_edge_length': 0.037649046290826015,
  'max_edge_length': 0.251608033926911,
  'mean_edge_length': 0.14058280371468113,
  'min_area': 0.0008161429472105218,
  'max_area': 0.035097437653613554,
  'mean_area': 0.016480442066212443,
  'coherence_length': 0.5,
  'length_units': 'um',
}