Currently the call to StreamingQueryDNADatabase.py doesn't specify the threads argument. This results in CMash's StreamingQueryDNADatabase.py using the default of multiprocessing.cpu_count() which returns the number of CPUs in the system.
This can cause application performance and memory issues on batch systems (e.g. Slurm) which contain hosts with large core counts.
Currently the call to StreamingQueryDNADatabase.py doesn't specify the threads argument. This results in CMash's StreamingQueryDNADatabase.py using the default of multiprocessing.cpu_count() which returns the number of CPUs in the system.
This can cause application performance and memory issues on batch systems (e.g. Slurm) which contain hosts with large core counts.