Closed catid closed 1 month ago
Ah the bug is because it should be sizeof(float)*nbEle
here. I guess no one is really using this compressor yet.
Thanks for pointing out. This line seems to be in the host pointer function (input on CPU and output on CPU, just use GPU for compression), which is not as well maintained as device pointer functions (input on GPU and output on GPU). I guess you can temporarily try these two device-pointer functions:
Then move data based on your requirements from GPU to CPU. Hope this can help to resolve this issue.
BTW, I am working on a major update to cuSZp, including the 2x~3x throughput and higher compression ratio (by a new lossless compression algorithm) [link]. The issue you mentioned will be updated by then. This will be done within July this year. Thanks again for your interest :)
Thanks for your work! I've released my own compression project here you may find interesting: https://github.com/catid/cuda_float_compress
Examples crash using Nivida 4090 GPU on Linux:
Seems to be on this line:
cudaMemcpy(d_oriData, oriData, sizeof(float)*pad_nbEle, cudaMemcpyHostToDevice);