Open ShashankBice opened 7 months ago
I realize that this could also be due to the very high search ranges at steep terrain, I am trying now with search-range limited to reasonable values. Will report on how that goes.
Does this work with a different testcase?
This looks like a filesystem error. It is failing to write a file to disk.
Yes, I have used the same program with data over the same area in the past week without any issues. I will keep looking into it more and see what I find :slightly_smiling_face:
Hmm. @ShashankBice can you confirm that this is not an issue of filling the disk or exceeding hard quota (preventing additional writes to disk)?
@oleg-alexandrov , FYI, I noticed a change to GeoTIFF driver in GDAL 3.8.0 notes (https://github.com/OSGeo/gdal/blob/master/NEWS.md)
Performance improvement: avoid using block cache when writing whole blocks (up to about twice faster in some scenarios)
That performance improvement should be nice, and hopefully should not break anything. We are at GDAL 3.5.3 now.
I can confirm I am well within quota limits both in terms of disk space and file counts. Looking into it more!
I just reran this one tile alone outside of parallel_stereo, and it completed with the same processing parameters. I am curious how I solve this issue now, as I ran (submitted the job) twice before reporting it here, and some tile or the other failed each time after 10 hours of correlation. I have been monitoring the memory usage and that is not an issue.
I think I will report this to NAS folks to see if they have some advice or if this has been some known issue. Wanted to update here about this discovery.
Cheers, Shashank
Describe the bug During the correlation stage of MGM on a stereo pair, the correlation for a few tiles fail with the following error, and the program exits.
Below is the low resolution version of the left image (is rendered when the issue is viewed on github), along with the![image](https://github.com/NeoGeographyToolkit/StereoPipeline/assets/29011666/836949b1-39a5-4732-8059-d40fe945252e)
disparirty_debug
output from the D_sub file.The computation is being run on a broadwell node, with the following version of ASP:
Happy to provide more info if needed, please let me know :)
Cheers, Shashank