Open theobarnhart-USGS opened 6 years ago
@theobarnhart-USGS Sorry about that. I'm a little busy this week, but will look into it as soon as I can.
In the meantime, I've reformatted your post so that the outputs are appropriately formatted. You can do this yourself by highlighting the outputs as pressing the <>
button in the formatting bar, which will convert the outputs to "code" in a way which makes them much easier to read.
This probably has something to do with the file having the Int16 data type.
I should be able to fix this.
In the meantime, do you experience problems on Int32 or Float32/Double32 data?
I'm getting the same errors, with the exception that it seems to be running some of the jobs?
mpirun -n 4 ./parallel_d8_accum.exe one @evict ~/kod/flodesapp/localdata/geodata/DEM_LM/riks_50m_nowater.tif ./out/%n.tif --bwidth 500 --bheight 500
c Program name = RichDEM v2.2.9
c Script compiled at = 2019-10-09 11:09:19 UTC
c Git hash = abc04d81216d7cf5
c Copyright = Richard Barnes © 2018
a Analysis command = ./parallel_d8_accum.exe one @evict /home/johnnie/kod/flodesapp/localdata/geodata/DEM_LM/riks_50m_nowater.tif ./out/%n.tif --bwidth 500 --bheight 500
A Barnes (2017) Parallel Non-divergent Flow Accumulation
C Barnes, R., 2017. Parallel non-divergent flow accumulation for trillion cell digital elevation models on desktops or clusters. Environmental Modelling & Software 92, 202-212. doi:10.1016/j.envsoft.2017.02.022
c Processes = 4
c Many or one = one
c Input file = /home/johnnie/kod/flodesapp/localdata/geodata/DEM_LM/riks_50m_nowater.tif
c Retention strategy = @evict
c Block width = 500
c Block height = 500
c Flip horizontal = 0
c Flip vertical = 0
c Cache compression = FALSE
m Total width = 16000
m Total height = 32000
m Block width = 500
m Block height = 500
m Total cells to be processed = 512000000
t Preparer time = 0.0128314 s
c Input data type = Float32
m Jobs created = 2048
p Jobs remaining = 2047
p Jobs remaining = 2046
p Jobs remaining = 2045
p Jobs remaining = 2044
p Jobs remaining = 2043
p Jobs remaining = 2042
p Jobs remaining = 2041
p Jobs remaining = 2040
p Jobs remaining = 2039
p Jobs remaining = 2038
p Jobs remaining = 2037
p Jobs remaining = 2036
p Jobs remaining = 2035
p Jobs remaining = 2034
p Jobs remaining = 2033
p Jobs remaining = 2032
p Jobs remaining = 2031
p Jobs remaining = 2030
p Jobs remaining = 2029
p Jobs remaining = 2028
p Jobs remaining = 2027
p Jobs remaining = 2026
p Jobs remaining = 2025
p Jobs remaining = 2024
p Jobs remaining = 2023
p Jobs remaining = 2022
p Jobs remaining = 2021
p Jobs remaining = 2020
p Jobs remaining = 2019
p Jobs remaining = 2018
p Jobs remaining = 2017
p Jobs remaining = 2016
p Jobs remaining = 2015
p Jobs remaining = 2014
p Jobs remaining = 2013
p Jobs remaining = 2012
p Jobs remaining = 2011
p Jobs remaining = 2010
p Jobs remaining = 2009
p Jobs remaining = 2008
p Jobs remaining = 2007
p Jobs remaining = 2006
p Jobs remaining = 2005
p Jobs remaining = 2004
p Jobs remaining = 2003
p Jobs remaining = 2002
p Jobs remaining = 2001
p Jobs remaining = 2000
p Jobs remaining = 1999
p Jobs remaining = 1998
parallel_d8_accum.exe: ../../include/richdem/common/Array2D.hpp:552: richdem::Array2D< <template-parameter-1-1> >::i_t richdem::Array2D< <template-parameter-1-1> >::getN(richdem::Array2D< <template-parameter-1-1> >::i_t, uint8_t) const [with T = unsigned char; richdem::Array2D< <template-parameter-1-1> >::i_t = unsigned int; uint8_t = unsigned char]: Assertion `0<=n && n<=8' failed.
[gib:28256] *** Process received signal ***
[gib:28256] Signal: Aborted (6)
[gib:28256] Signal code: (-6)
[gib:28256] [ 0] /lib/x86_64-linux-gnu/libpthread.so.0(+0x12890)[0x7f2e4a729890]
[gib:28256] [ 1] /lib/x86_64-linux-gnu/libc.so.6(gsignal+0xc7)[0x7f2e4a364e97]
[gib:28256] [ 2] /lib/x86_64-linux-gnu/libc.so.6(abort+0x141)[0x7f2e4a366801]
[gib:28256] [ 3] /lib/x86_64-linux-gnu/libc.so.6(+0x3039a)[0x7f2e4a35639a]
[gib:28256] [ 4] /lib/x86_64-linux-gnu/libc.so.6(+0x30412)[0x7f2e4a356412]
[gib:28256] [ 5] ./parallel_d8_accum.exe(+0x2c96d)[0x55dac951296d]
[gib:28256] [ 6] ./parallel_d8_accum.exe(+0x30b62)[0x55dac9516b62]
[gib:28256] [ 7] ./parallel_d8_accum.exe(+0x35f40)[0x55dac951bf40]
[gib:28256] [ 8] ./parallel_d8_accum.exe(+0x406a1)[0x55dac95266a1]
[gib:28256] [ 9] ./parallel_d8_accum.exe(+0x10391)[0x55dac94f6391]
[gib:28256] [10] /lib/x86_64-linux-gnu/libc.so.6(__libc_start_main+0xe7)[0x7f2e4a347b97]
[gib:28256] [11] ./parallel_d8_accum.exe(+0x10d0a)[0x55dac94f6d0a]
[gib:28256] *** End of error message ***
p Jobs remaining = 1997
p Jobs remaining = 1996
p Jobs remaining = 1995
parallel_d8_accum.exe: ../../include/richdem/common/Array2D.hpp:552: richdem::Array2D< <template-parameter-1-1> >::i_t richdem::Array2D< <template-parameter-1-1> >::getN(richdem::Array2D< <template-parameter-1-1> >::i_t, uint8_t) const [with T = unsigned char; richdem::Array2D< <template-parameter-1-1> >::i_t = unsigned int; uint8_t = unsigned char]: Assertion `0<=n && n<=8' failed.
[gib:28257] *** Process received signal ***
[gib:28257] Signal: Aborted (6)
[gib:28257] Signal code: (-6)
[gib:28257] [ 0] /lib/x86_64-linux-gnu/libpthread.so.0(+0x12890)[0x7f3e32a85890]
[gib:28257] [ 1] /lib/x86_64-linux-gnu/libc.so.6(gsignal+0xc7)[0x7f3e326c0e97]
[gib:28257] [ 2] /lib/x86_64-linux-gnu/libc.so.6(abort+0x141)[0x7f3e326c2801]
[gib:28257] [ 3] /lib/x86_64-linux-gnu/libc.so.6(+0x3039a)[0x7f3e326b239a]
[gib:28257] [ 4] /lib/x86_64-linux-gnu/libc.so.6(+0x30412)[0x7f3e326b2412]
[gib:28257] [ 5] ./parallel_d8_accum.exe(+0x2c96d)[0x55ab9b04a96d]
[gib:28257] [ 6] ./parallel_d8_accum.exe(+0x30b62)[0x55ab9b04eb62]
[gib:28257] [ 7] ./parallel_d8_accum.exe(+0x35f40)[0x55ab9b053f40]
[gib:28257] [ 8] ./parallel_d8_accum.exe(+0x406a1)[0x55ab9b05e6a1]
[gib:28257] [ 9] ./parallel_d8_accum.exe(+0x10391)[0x55ab9b02e391]
[gib:28257] [10] /lib/x86_64-linux-gnu/libc.so.6(__libc_start_main+0xe7)[0x7f3e326a3b97]
[gib:28257] [11] ./parallel_d8_accum.exe(+0x10d0a)[0x55ab9b02ed0a]
[gib:28257] *** End of error message ***
p Jobs remaining = 1994
p Jobs remaining = 1993
p Jobs remaining = 1992
p Jobs remaining = 1991
p Jobs remaining = 1990
p Jobs remaining = 1989
p Jobs remaining = 1988
parallel_d8_accum.exe: ../../include/richdem/common/Array2D.hpp:552: richdem::Array2D< <template-parameter-1-1> >::i_t richdem::Array2D< <template-parameter-1-1> >::getN(richdem::Array2D< <template-parameter-1-1> >::i_t, uint8_t) const [with T = unsigned char; richdem::Array2D< <template-parameter-1-1> >::i_t = unsigned int; uint8_t = unsigned char]: Assertion `0<=n && n<=8' failed.
p Jobs remaining = 1987
p Jobs remaining = 1986
p Jobs remaining = 1985
[gib:28255] *** Process received signal ***
[gib:28255] Signal: Aborted (6)
[gib:28255] Signal code: (-6)
[gib:28255] [ 0] /lib/x86_64-linux-gnu/libpthread.so.0(+0x12890)[0x7fb16bf4e890]
[gib:28255] [ 1] /lib/x86_64-linux-gnu/libc.so.6(gsignal+0xc7)[0x7fb16bb89e97]
[gib:28255] [ 2] /lib/x86_64-linux-gnu/libc.so.6(abort+0x141)[0x7fb16bb8b801]
[gib:28255] [ 3] /lib/x86_64-linux-gnu/libc.so.6(+0x3039a)[0x7fb16bb7b39a]
[gib:28255] [ 4] /lib/x86_64-linux-gnu/libc.so.6(+0x30412)[0x7fb16bb7b412]
[gib:28255] [ 5] ./parallel_d8_accum.exe(+0x2c96d)[0x561d86f2b96d]
[gib:28255] [ 6] ./parallel_d8_accum.exe(+0x30b62)[0x561d86f2fb62]
[gib:28255] [ 7] ./parallel_d8_accum.exe(+0x35f40)[0x561d86f34f40]
[gib:28255] [ 8] ./parallel_d8_accum.exe(+0x406a1)[0x561d86f3f6a1]
[gib:28255] [ 9] ./parallel_d8_accum.exe(+0x10391)[0x561d86f0f391]
[gib:28255] [10] /lib/x86_64-linux-gnu/libc.so.6(__libc_start_main+0xe7)[0x7fb16bb6cb97]
[gib:28255] [11] ./parallel_d8_accum.exe(+0x10d0a)[0x561d86f0fd0a]
[gib:28255] *** End of error message ***
--------------------------------------------------------------------------
mpirun noticed that process rank 2 with PID 0 on node gib exited on signal 6 (Aborted).
I've tried digging around the code, but I've never worked with c++ so I'm a bit lost.
@r-barnes here's the DEM if you have the time to have a look: https://drive.google.com/open?id=1-NarHegMQiH6WmhfjR_4NlvB6tuEEsIb
@r-barnes just want to let you know that we're working on this and making progress. It mostly turned out to be mismatches between code and documentation. PR coming up.
Hi Rich,
Thank you for helping me with the other problem. Now when I try to run parallel_d8_accum it produces the following error and I've gdalinfoed the file I'm using below as well. Thank you for all your help!
Output from gdalinfo for the file I'm trying to run the accumulation process on is: