ParRes / Kernels

This is a set of simple programs that can be used to explore the features of a parallel platform.
https://groups.google.com/forum/#!forum/parallel-research-kernels
Other
404 stars 106 forks source link

Coarray fortran stencil code fails for certain numbers of images #99

Closed rfvander closed 8 years ago

rfvander commented 8 years ago

@afanfa Even though I request only 5 images, the stencil kernel thinks there are 6.

[rfvander@esgmonster Stencil]$ ./stencil-coarray 3 50 
Parallel Research Kernels version 2.16
CAF stencil execution on 2D grid
Number of images     =        5
Grid size            =       50
Radius of stencil    =        2
Type of stencil      = star
Data type            = double precision
Compact representation of stencil loop body
Untiled
Number of iterations =        3
forrtl: severe (772): Image number 6 is not a valid image number; valid numbers are 1 to 5
In coarray image 5
Image              PC                Routine            Line        Source             
libicaf.so         00007FC70DC73AFA  Unknown               Unknown  Unknown
stencil-coarray    0000000000407233  Unknown               Unknown  Unknown
stencil-coarray    0000000000403F1E  Unknown               Unknown  Unknown
libc.so.6          000000308521ED5D  Unknown               Unknown  Unknown
stencil-coarray    0000000000403DA9  Unknown               Unknown  Unknown

application called MPI_Abort(comm=0x84000000, 3) - process 4
forrtl: error (69): process interrupted (SIGINT)
In coarray image 1
Image              PC                Routine            Line        Source             
stencil-coarray    000000000047DD61  Unknown               Unknown  Unknown
stencil-coarray    000000000047BBC7  Unknown               Unknown  Unknown
stencil-coarray    000000000044C594  Unknown               Unknown  Unknown
stencil-coarray    000000000044C3A6  Unknown               Unknown  Unknown
stencil-coarray    000000000042FCF4  Unknown               Unknown  Unknown
stencil-coarray    0000000000408764  Unknown               Unknown  Unknown
Unknown            0000003085E0F7E0  Unknown               Unknown  Unknown
libpthread.so.0    0000003085E0B68A  Unknown               Unknown  Unknown
libmpi.so.12       00007F46461225A5  Unknown               Unknown  Unknown
libmpi.so.12       00007F4645F8158A  Unknown               Unknown  Unknown
libmpi.so.12       00007F464608CC29  Unknown               Unknown  Unknown
libmpi.so.12       00007F464608D26A  Unknown               Unknown  Unknown
libmpi.so.12       00007F4645F65EEF  Unknown               Unknown  Unknown
libmpi.so.12       00007F4645F65875  Unknown               Unknown  Unknown
libmpi.so.12       00007F4645F656CC  Unknown               Unknown  Unknown
libmpi.so.12       00007F4645F655FC  Unknown               Unknown  Unknown
libmpi.so.12       00007F4645F67482  Unknown               Unknown  Unknown
libicaf.so         00007F4645BF839E  Unknown               Unknown  Unknown
stencil-coarray    00000000004057D4  Unknown               Unknown  Unknown
stencil-coarray    0000000000403F1E  Unknown               Unknown  Unknown
libc.so.6          000000308521ED5D  Unknown               Unknown  Unknown
stencil-coarray    0000000000403DA9  Unknown               Unknown  Unknown

application called MPI_Abort(comm=0x84000000, 3) - process 0
forrtl: error (69): process interrupted (SIGINT)
In coarray image 2
Image              PC                Routine            Line        Source             
stencil-coarray    000000000047DD61  Unknown               Unknown  Unknown
stencil-coarray    000000000047BBC7  Unknown               Unknown  Unknown
stencil-coarray    000000000044C594  Unknown               Unknown  Unknown
stencil-coarray    000000000044C3A6  Unknown               Unknown  Unknown
stencil-coarray    000000000042FCF4  Unknown               Unknown  Unknown
stencil-coarray    0000000000408764  Unknown               Unknown  Unknown
Unknown            0000003085E0F7E0  Unknown               Unknown  Unknown
libpthread.so.0    0000003085E0B68A  Unknown               Unknown  Unknown
libmpi.so.12       00007F528AA355A5  Unknown               Unknown  Unknown
libmpi.so.12       00007F528A89458A  Unknown               Unknown  Unknown
libmpi.so.12       00007F528A99FC29  Unknown               Unknown  Unknown
libmpi.so.12       00007F528A99F922  Unknown               Unknown  Unknown
libmpi.so.12       00007F528A878E55  Unknown               Unknown  Unknown
libmpi.so.12       00007F528A878875  Unknown               Unknown  Unknown
libmpi.so.12       00007F528A8786CC  Unknown               Unknown  Unknown
libmpi.so.12       00007F528A8785FC  Unknown               Unknown  Unknown
libmpi.so.12       00007F528A87A482  Unknown               Unknown  Unknown
libicaf.so         00007F528A50B39E  Unknown               Unknown  Unknown
stencil-coarray    00000000004057D4  Unknown               Unknown  Unknown
stencil-coarray    0000000000403F1E  Unknown               Unknown  Unknown
libc.so.6          000000308521ED5D  Unknown               Unknown  Unknown
stencil-coarray    0000000000403DA9  Unknown               Unknown  Unknown

application called MPI_Abort(comm=0x84000000, 3) - process 1
forrtl: error (69): process interrupted (SIGINT)
In coarray image 3
Image              PC                Routine            Line        Source             
stencil-coarray    000000000047DD61  Unknown               Unknown  Unknown
stencil-coarray    000000000047BBC7  Unknown               Unknown  Unknown
stencil-coarray    000000000044C594  Unknown               Unknown  Unknown
stencil-coarray    000000000044C3A6  Unknown               Unknown  Unknown
stencil-coarray    000000000042FCF4  Unknown               Unknown  Unknown
stencil-coarray    0000000000408764  Unknown               Unknown  Unknown
Unknown            0000003085E0F7E0  Unknown               Unknown  Unknown
libpthread.so.0    0000003085E0B68A  Unknown               Unknown  Unknown
libmpi.so.12       00007F808DDC15A5  Unknown               Unknown  Unknown
libmpi.so.12       00007F808DC2058A  Unknown               Unknown  Unknown
libmpi.so.12       00007F808DD2BC29  Unknown               Unknown  Unknown
libmpi.so.12       00007F808DD2B922  Unknown               Unknown  Unknown
libmpi.so.12       00007F808DC04E55  Unknown               Unknown  Unknown
libmpi.so.12       00007F808DC04875  Unknown               Unknown  Unknown
libmpi.so.12       00007F808DC046CC  Unknown               Unknown  Unknown
libmpi.so.12       00007F808DC045FC  Unknown               Unknown  Unknown
libmpi.so.12       00007F808DC06482  Unknown               Unknown  Unknown
libicaf.so         00007F808D89739E  Unknown               Unknown  Unknown
stencil-coarray    00000000004057D4  Unknown               Unknown  Unknown
stencil-coarray    0000000000403F1E  Unknown               Unknown  Unknown
libc.so.6          000000308521ED5D  Unknown               Unknown  Unknown
stencil-coarray    0000000000403DA9  Unknown               Unknown  Unknown

application called MPI_Abort(comm=0x84000000, 3) - process 2
forrtl: error (69): process interrupted (SIGINT)
In coarray image 4
Image              PC                Routine            Line        Source             
stencil-coarray    000000000047DD61  Unknown               Unknown  Unknown
stencil-coarray    000000000047BBC7  Unknown               Unknown  Unknown
stencil-coarray    000000000044C594  Unknown               Unknown  Unknown
stencil-coarray    000000000044C3A6  Unknown               Unknown  Unknown
stencil-coarray    000000000042FCF4  Unknown               Unknown  Unknown
stencil-coarray    0000000000408764  Unknown               Unknown  Unknown
Unknown            0000003085E0F7E0  Unknown               Unknown  Unknown
libpthread.so.0    0000003085E0B68A  Unknown               Unknown  Unknown
libmpi.so.12       00007FA8A59775A5  Unknown               Unknown  Unknown
libmpi.so.12       00007FA8A57D658A  Unknown               Unknown  Unknown
libmpi.so.12       00007FA8A58E1C29  Unknown               Unknown  Unknown
libmpi.so.12       00007FA8A58E1922  Unknown               Unknown  Unknown
libmpi.so.12       00007FA8A57BAE55  Unknown               Unknown  Unknown
libmpi.so.12       00007FA8A57BA875  Unknown               Unknown  Unknown
libmpi.so.12       00007FA8A57BA6CC  Unknown               Unknown  Unknown
libmpi.so.12       00007FA8A57BA5FC  Unknown               Unknown  Unknown
libmpi.so.12       00007FA8A57BC482  Unknown               Unknown  Unknown
libicaf.so         00007FA8A544D39E  Unknown               Unknown  Unknown
stencil-coarray    00000000004057D4  Unknown               Unknown  Unknown
stencil-coarray    0000000000403F1E  Unknown               Unknown  Unknown
libc.so.6          000000308521ED5D  Unknown               Unknown  Unknown
stencil-coarray    0000000000403DA9  Unknown               Unknown  Unknown

application called MPI_Abort(comm=0x84000000, 3) - process 3
afanfa commented 8 years ago

@rfvander ,

it is a consequence of the code deactivated by #98. I'll see to fix this issue.

jeffhammond commented 8 years ago

Sorry, I guess my bug fix was actually bug transmutation :-(

afanfa commented 8 years ago

Hi, I've got a solution for the bug described by #97; I plan to clean the code and make it more efficient during the long weekend. I think you can also close this issue.

Cheers, Alessandro

jeffhammond commented 8 years ago

@afanfa Glad to hear that, but I want to close the issue when we merge the fixes. I like to follow procedure 😄

jeffhammond commented 8 years ago

We merged, so this will be closed.