Closed adehecq closed 2 years ago
Thank you for raising this. Yeah, issue 1 has been solved in the latest build for a while.
I put a fix for issue 2. This was quite time-consuming to debug, and I also had to get a 10 GB ISO image to make a CentoOS 8 VM on which to reproduce this.
The problem seems to be traceable to the fact that ASP's wrapper shell scripts set up LD_LIBRARY_PATH to ensure our libs are used with our tools. But then GNU parallel is called, which starts ssh, which does not like our libs. The fix is to temporarily hide our libs when running ssh, then get them back in the child processes launched by ssh.
Not too clever, but the fix being done in parallel_stereo itself rather than in our wrapper scripts, it means it will work with ASP's conda distribution too.
Now, this issue is bigger than ASP iself. Conda users will run into this even with other tools (https://github.com/conda/conda/issues/10241).
I put a fix also to parallel_bundle_adjust, mapproject, and parallel_sfs.
The nightly build at https://github.com/NeoGeographyToolkit/StereoPipeline/releases will have this with build date 2022-04-08 or later.
Describe the bug When running
parallel_stereo
with the option--nodes-list $PBS_NODEFILE
, the program fails during correlation with the following two errors:Error 1:
Error 2:
The program runs normally without option
nodes-list
.To Reproduce I can provide the input data if needed, but the full command is
Expected behavior No error should be raised and the program should continue until completion.
Error Logs, Terminal Captures, Screenshots For example:
Your Environment (please complete the following information):
ls -l /usr/lib64/libcrypto.so*
yieldsAdditional context After some email exchanges with Oleg, several things have been tested:
Regarding error 1:
export TCL_LIBRARY=/PATH/TO/ASP/lib/tcl8.6/
solves the first errorRegarding error 2, we tried:
export LD_PRELOAD=/usr/lib64/libcrypto.so.1.1
-> "ERROR: ld.so: object '/usr/lib64/libcrypto.so.1.1' from LD_PRELOAD cannot be preloaded (wrong ELF class: ELFCLASS64): ignored." but the program continues to run...-> "/home/adehecq/bin/StereoPipeline-3.0.0-2021-07-27-x86_64-Linux/libexec/stereo_parse: error while loading shared libraries: libcrypto.so.1.0.0: cannot open shared object file: No such file or directory"
-> "libcrypto.so.1.0.0: wrong ELF class: ELFCLASS32" As of today, the second issue is not fixed.