JCSDA / spack-stack

Creative Commons Zero v1.0 Universal
27 stars 47 forks source link

cprnc on Gaea w/ spack-stack 1.6 reports error #1048

Closed DeniseWorthen closed 6 months ago

DeniseWorthen commented 7 months ago

Describe the bug cprnc on Gaea using SS 1.6 fails

[Gaea] ~: cprnc -m sfcf024.tile3.nc sfcf024.tile6.nc

Please verify that both the operating system and the processor support Intel(R) X87, CMOV, MMX, FXSAVE, SSE, SSE2, SSE3, SSSE3, SSE4_1, SSE4_2, MOVBE, POPCNT, F16C, AVX, FMA, BMI, LZCNT and AVX2 instructions.

To Reproduce

Login to Gaea clean session

[Gaea] ~: module list

Currently Loaded Modules:
  1) craype-x86-rome                         5) intel-classic/2022.2.1   9) cray-libsci/23.02.1.1  13) CmrsEnv/default
  2) craype-network-ofi                      6) craype/2.7.20           10) PrgEnv-intel/8.3.3     14) TimeZoneEDT/default
  3) perftools-base/23.03.0                  7) cray-dsmml/0.2.2        11) cray-pmi/6.1.10        15) DefApps/default
  4) xpmem/2.6.2-2.5_2.27__gd067c3f.shasta   8) cray-mpich/8.1.25       12) darshan-runtime/3.4.0

[Gaea] ~:  module use /ncrc/proj/epic/spack-stack/spack-stack-1.6.0/envs/unified-env/install/modulefiles/Core
[Gaea] ~: module spider cprnc

----------------------------------------------------------------------------------------------------------------------------------
  cprnc: cprnc/1.0.3
----------------------------------------------------------------------------------------------------------------------------------

    You will need to load all module(s) on any one of the lines below before the "cprnc/1.0.3" module is available to load.

      stack-intel/2023.1.0  stack-cray-mpich/8.1.25

    Help:
      CPRNC is a netcdf file comparison tool used by CESM and other scientific
      programs.

[Gaea] ~: module load stack-intel/2023.1.0  stack-cray-mpich/8.1.25 cprnc/1.0.3
[Gaea] ~: cprnc -m sfcf024.tile3.nc sfcf024.tile6.nc

Please verify that both the operating system and the processor support Intel(R) X87, CMOV, MMX, FXSAVE, SSE, SSE2, SSE3, SSSE3, SSE4_1, SSE4_2, MOVBE, POPCNT, F16C, AVX, FMA, BMI, LZCNT and AVX2 instructions.

Expected behavior cprnc should report any differences between the files

System: Gaea

Additional context Add any other context about the problem here.

climbfuji commented 7 months ago

Is this a C4 vs C5 problem?

DeniseWorthen commented 7 months ago

Only C5 is left now.

climbfuji commented 7 months ago

I am wondering if that particular version of the stack was still built on c4 - but that doesn't make much sense, since c5 is newer and should understand older instruction sets (unless they are entirely different CPUs, amd vs intel for example)

AlexanderRichert-NOAA commented 6 months ago

I'm unable to reproduce this on gaea57 with the given steps (though with a different netcdf file)-- @deniseworthen are you still having this issue, and if so, on what node(s)?

DeniseWorthen commented 6 months ago

I repeated the steps and got the same error message on gaea55 and gaea57. I tried a different set of netcdf files with the same result.

climbfuji commented 6 months ago

Is there anything in your default setup (.bashrc, ulimit, ...) that might cause this difference?

DeniseWorthen commented 6 months ago

I'm cringing. YES. I had my own install of cprnc which I was still aliasing. I thought I had removed it everywhere already. It is now working. I'll close the issue.

climbfuji commented 6 months ago

Glad that it was such an easy fix! Thanks @AlexanderRichert-NOAA for your help on Gaea