open-mpi / ompi

Open MPI main development repository
https://www.open-mpi.org
Other
2.17k stars 861 forks source link

Getting "free(): corrupted unsorted chunks" error while running mpirun #12744

Closed NancyAgarwal013 closed 2 months ago

NancyAgarwal013 commented 3 months ago

Hi Team,

I was trying to run openMPI in one of my GPU Node for distributed testing but getting below error

image

Can anyone let me know what can be possible reason here, I downloaded HPCx inside my node and from there I was trying to run mpi.

nancyagarwal1301 commented 3 months ago

While debugging more I can see this negative size error. But from where is it picking it up and how to resolve this?

image

devreal commented 3 months ago

@NancyAgarwal013 can you please post the full stack trace (and not as screenshots) so we can see how we got to the UCP callsite of memset?

jsquyres commented 3 months ago

@NancyAgarwal013 Also, can you post all the information asked for in the bug template?

https://github.com/open-mpi/ompi/blob/main/.github/ISSUE_TEMPLATE/bug_report.md

github-actions[bot] commented 2 months ago

It looks like this issue is expecting a response, but hasn't gotten one yet. If there are no responses in the next 2 weeks, we'll assume that the issue has been abandoned and will close it.

github-actions[bot] commented 2 months ago

Per the above comment, it has been a month with no reply on this issue. It looks like this issue has been abandoned.

I'm going to close this issue. If I'm wrong and this issue is not abandoned, please feel free to re-open it. Thank you!