HDFGroup / hdf5

Official HDF5® Library Repository
https://www.hdfgroup.org/
Other
633 stars 259 forks source link

t_bigio test failure with 1.14.0 and mpich on Fedora rawhide #2510

Open opoplawski opened 1 year ago

opoplawski commented 1 year ago

Describe the bug Test segfault.

Platform (please complete the following information)

Additional context build.log

make[4]: Entering directory '/builddir/build/BUILD/hdf5-1.14.0/mpich/testpar'
============================
Testing: t_bigio 
============================
Test log for t_bigio 
============================
Testing Dataset1 write by ROW
Testing Dataset2 write by COL
Testing Dataset3 write select ALL proc 0, NONE others
Testing Dataset4 write point selection
Read Testing Dataset1 by COL
Read Testing Dataset2 by ROW
Read Testing Dataset3 read select ALL proc 0, NONE others
Read Testing Dataset4 with Point selection
Collective chunk I/O Test #1
Collective chunk I/O Test #2
Collective chunk I/O Test #3
Single Rank Independent I/O
===================================================================================
=   BAD TERMINATION OF ONE OF YOUR APPLICATION PROCESSES
=   PID 418601 RUNNING AT 2f0b397ee90643d0a0fdf03f7af50c55
=   EXIT CODE: 139
=   CLEANING UP REMAINING PROCESSES
=   YOU CAN IGNORE THE BELOW CLEANUP MESSAGES
===================================================================================
YOUR APPLICATION TERMINATED WITH THE EXIT STRING: Segmentation fault (signal 11)
derobins commented 1 year ago

@opoplawski - Is this still a problem with the hdf5_1_14 branch ?

opoplawski commented 1 year ago

Yes, still seen with latest hdf5_1_14 branch on Fedora Rawhide i686 with gcc 13.2.1-4.fc40 and mpich 4.1.2-7.fc40

opoplawski commented 8 months ago

I'm not seeing this anymore with the latest hdf5_1_14 branch and Fedora Rawhide gcc 14.0.1-0.12.fc41 and mpich 4.1.2-14.fc41

opoplawski commented 8 months ago

Hmm, perhaps spoke too soon. It may be intermittent:

make[4]: Entering directory '/builddir/build/BUILD/hdf5-hdf5_1_14/mpich/testpar'
============================
Testing: t_bigio 
============================
Test log for t_bigio 
============================
Testing Dataset1 write by ROW
Testing Dataset2 write by COL
Testing Dataset3 write select ALL proc 0, NONE others
Testing Dataset4 write point selection
Read Testing Dataset1 by COL
Read Testing Dataset2 by ROW
Read Testing Dataset3 read select ALL proc 0, NONE others
Read Testing Dataset4 with Point selection
Collective chunk I/O Test #1
Collective chunk I/O Test #2
Collective chunk I/O Test #3
Single Rank Independent I/O
===================================================================================
=   BAD TERMINATION OF ONE OF YOUR APPLICATION PROCESSES
=   PID 379886 RUNNING AT 5bf7a63e2ec241c9aaa87ccb56c0c91c
=   EXIT CODE: 139
=   CLEANING UP REMAINING PROCESSES
=   YOU CAN IGNORE THE BELOW CLEANUP MESSAGES
===================================================================================
YOUR APPLICATION TERMINATED WITH THE EXIT STRING: Segmentation fault (signal 11)
This typically refers to a problem with your application.
Please see the FAQ page for debugging suggestions
Command exited with non-zero status 139