trilinos / Trilinos

Primary repository for the Trilinos Project
https://trilinos.org/
Other
1.2k stars 564 forks source link

AUTOTESTER TRANSITION END OF JUNE #13125

Closed sebrowne closed 2 months ago

sebrowne commented 3 months ago

The Autotester will be undergoing significant configuration changes surrounding the end of June (06/30/2024). We will be modifying many configurations with the transition to RHEL8 from now until the beginning of July.

Please report any issues or ask any questions on this GitHub issue

Per-build status

Build shortname Current status Old Full Config Name New Full Config Name
GCC OpenMPI Transition complete rhel7_sems-gnu-8.3.0-openmpi-1.10.1-serial_debug_shared_no-kokkos-arch_no-asan_no-complex_no-fpic_mpi_no-pt_no-rdc_no-uvm_deprecated-on_no-package-enables rhel8_sems-gnu-8.5.0-openmpi-4.1.6-serial_debug_shared_no-kokkos-arch_no-asan_no-complex_no-fpic_mpi_no-pt_no-rdc_no-uvm_deprecated-on_no-package-enables
GCC OpenMP Transition complete rhel7_sems-gnu-8.3.0-openmpi-1.10.1-openmp_release-debug_static_no-kokkos-arch_no-asan_no-complex_no-fpic_mpi_no-pt_no-rdc_no-uvm_deprecated-on_no-package-enables rhel8_sems-gnu-8.5.0-openmpi-4.1.6-openmp_release-debug_static_no-kokkos-arch_no-asan_no-complex_no-fpic_mpi_no-pt_no-rdc_no-uvm_deprecated-on_no-package-enables
GCC Transition complete rhel7_sems-v2-gnu-8.3.0-serial_release-debug_shared_no-kokkos-arch_no-asan_no-complex_no-fpic_no-mpi_no-pt_no-rdc_no-uvm_deprecated-on_no-package-enables rhel8_sems-gnu-8.5.0-serial_release-debug_shared_no-kokkos-arch_no-asan_no-complex_no-fpic_no-mpi_no-pt_no-rdc_no-uvm_deprecated-on_no-package-enables
Clang Transition complete rhel7_sems-clang-11.0.1-openmpi-1.10.1-serial_release-debug_shared_no-kokkos-arch_no-asan_no-complex_no-fpic_mpi_no-pt_no-rdc_no-uvm_deprecated-on_no-package-enables rhel8_sems-clang-11.0.1-openmpi-4.0.5-serial_release-debug_shared_no-kokkos-arch_no-asan_no-complex_no-fpic_mpi_no-pt_no-rdc_no-uvm_deprecated-on_no-package-enables
Intel Transition complete rhel7_sems-intel-2021.3-sems-openmpi-4.0.5_release-debug_shared_no-kokkos-arch_no-asan_no-complex_fpic_mpi_no-pt_no-rdc_no-uvm_deprecated-on_no-package-enables rhel8_sems-intel-2021.3-sems-openmpi-4.1.4_release-debug_shared_no-kokkos-arch_no-asan_no-complex_fpic_mpi_no-pt_no-rdc_no-uvm_deprecated-on_no-package-enables
CUDA Transition complete rhel7_sems-cuda-11.4.2-sems-gnu-10.1.0-sems-openmpi-4.0.5_release_static_Volta70_no-asan_complex_no-fpic_mpi_pt_no-rdc_no-uvm_deprecated-on_no-package-enables rhel8_sems-cuda-11.4.2-gnu-10.1.0-openmpi-4.1.6_release_static_Volta70_no-asan_complex_no-fpic_mpi_pt_no-rdc_no-uvm_deprecated-on_no-package-enables
CUDA UVM Transition complete rhel7_sems-cuda-11.4.2-sems-gnu-10.1.0-sems-openmpi-4.0.5_release_static_Volta70_no-asan_complex_no-fpic_mpi_pt_no-rdc_uvm_deprecated-on_no-package-enables rhel8_sems-cuda-11.4.2-gnu-10.1.0-openmpi-4.1.6_release_static_Volta70_no-asan_complex_no-fpic_mpi_pt_no-rdc_uvm_deprecated-on_no-package-enables
Python Unit Tests Transition complete rhel7_sems-gnu-7.2.0-anaconda3-serial_debug_shared_no-kokkos-arch_no-asan_no-complex_no-fpic_no-mpi_no-pt_no-rdc_no-uvm_deprecated-on_pr-framework rhel8_aue-gnu-12.1.0-anaconda3-serial_debug_shared_no-kokkos-arch_no-asan_no-complex_no-fpic_no-mpi_no-pt_no-rdc_no-uvm_deprecated-on_pr-framework
sebrowne commented 3 months ago

The Autotester will be down for scheduled transition work starting 5:00 PM MDT 06/24/2024. Expecting resuming normal operations by 12:00 PM MDT 06/25/2024.

Scheduling merge of all ready-to-merge configurations: GCC, Clang, CUDA, CUDA-UVM

EDIT: Maintenance completed, builds appear to be working correctly.

sebrowne commented 3 months ago

The frequently-failing Adelus_vector_random tests are being re-disabled in the new CUDA configuration (#13158). That detail was lost during the transition (the 3-rank one was previously disabled). This should stop the CUDA build from halting PRs.

sebrowne commented 3 months ago

I will update the configuration wiki at https://github.com/trilinos/Trilinos/wiki/Pull-Request-Testing-Interface once the Intel build is transitioned. We will still need to update the Master Merge process, so there is a good chance that will fail this week.

EDIT: PR configurations have been updated on the wiki. Transition is complete other than the Python3 Framework unit tests.

sebrowne commented 3 months ago

There is a critical error with the backing TPL modules for the Intel configuration. A ticket is in to get the issue resolved, but it is likely that the autotester will not allow merges until the module is restored. Trilinos DevOps will mark that build as non-blocking to prevent major interruptions to development if it is not resolved by 9:00PM MDT 07/03/2024.

EDIT: The Intel PR/MM build has been set to non-blocking. SEMS is looking into the issue and has made some progress resolving it, but I doubt it will be completed before the end of the holiday weekend.

sebrowne commented 2 months ago

All transitions are completed, closing.