Closed vbaconnet closed 2 months ago
It looks like the freeze is happening in
fld_file.f90
, here
Is this system specific or always occurs?
I observed issues on my workstation (debian, gnu fortran) and on the nj
computer. Occurs on Dardel GPU too with cray fortran, although the program freezes before the probes file is even created and written.
This is a bug which has to do with the sequential nature of the I/O performed on csv files.
This means that the generic check_exists halts the simulation since inside one issues a broadcast.
I am finalizing my thesis for printing so I don't have that much time to look into it further, but commenting this line helps.
I might be able to have more of a look at the problem late next week.
This is a bug which has to do with the sequential nature of the I/O performed on csv files.
This means that the generic check_exists halts the simulation since inside one issues a broadcast.
I am finalizing my thesis for printing so I don't have that much time to look into it further, but commenting this line helps.
I might be able to have more of a look at the problem late next week.
A quick fix would be to do the check on rank 0, with a barrier afterwards
if (pe_rank .eq. 0) then
call this%check_exists()
end if
call MPI_barrier(NEKO_COMM)
This is a bug which has to do with the sequential nature of the I/O performed on csv files.
This means that the generic check_exists halts the simulation since inside one issues a broadcast.
I am finalizing my thesis for printing so I don't have that much time to look into it further, but commenting this line helps.
I might be able to have more of a look at the problem late next week.
A quick fix would be to do the check on rank 0, with a barrier afterwards
if (pe_rank .eq. 0) then call this%check_exists() end if call MPI_barrier(NEKO_COMM)
No that doesn't fix it, I'll have a look today
The issue is the call to MPI_bcast in check_exists, which must be called on all ranks, otherwise there is a mismatch.
Problem observed
Running the
rayleigh-benard-cylinder
case with more than 1 rank will cause the simulation to freeze here:The probes output file is created and data is written on disk as expected. However the output of
field.fld
is not behaving properly, the file is created but nothing is getting written in it.Steps to reproduce
What I tried
Changing the type of simulation component or removing the simulation components entirely from the case file did not cause the issue to appear.
Running with only 1 rank will also not cause the issue to appear.
Commenting out the header related lines in
probes.F90
(here) did not fix the problem.