EESSI / test-suite

A portable test suite for software installations, using ReFrame
GNU General Public License v2.0
3 stars 11 forks source link

Following the steps in the README leads to `undefined parameters` #43

Closed satishskamath closed 1 year ago

satishskamath commented 1 year ago

Command executed:

[satishk@int4 projects]$ PYTHONPATH=$PYTHONPATH:$EBROOTREFRAME:$eessihome reframe -vvvv -C eessi_reframe/settings_example.py -c test-suite/eessi/reframe/eessi_checks/applications/ -t CI -t 1_node  -l

Part of the output:

Looking for tests in '/gpfs/home5/satishk/projects/test-suite/eessi/reframe/eessi_checks/applications'
Validating '/gpfs/home5/satishk/projects/test-suite/eessi/reframe/eessi_checks/applications/__init__.py': not a test file
Validating '/gpfs/home5/satishk/projects/test-suite/eessi/reframe/eessi_checks/applications/gromacs_check.py': OK
WARNING: skipping test 'GROMACS_EESSI': test has one or more undefined parameters
  > Loaded 0 test(s)
Loaded 0 test(s)

Config settings file:

""" This file is a settings file for eessi test suite. """
from os import environ
username = environ.get('USER')

# This is an example configuration file
site_configuration = {
    'systems': [
        {
            'name': 'snellius_eessi',
            'descr': 'example_cluster',
            'modules_system': 'lmod',
            'hostnames': ['tcn*', 'gcn*'],
            # Note that the stagedir should be a shared directory available on
            # all nodes running ReFrame tests
            'stagedir': f'/scratch-shared/{username}/reframe_output/staging',
            'partitions': [
                {
                    'name': 'cpu',
                    'scheduler': 'slurm',
                    'launcher': 'mpirun',
                    'access':  ['-p thin'],
                    'environs': ['default'],
                    'max_jobs': 4,
                    'processor': {
                        'num_cpus': 128,
                        'num_sockets': 2,
                        'num_cpus_per_socket': 64,
                        'arch': 'znver2',
                    },
                    'features': ['cpu'],
                    'descr': 'CPU partition'
                },
                {
                    'name': 'gpu',
                    'scheduler': 'slurm',
                    'launcher': 'mpirun',
                    'access':  ['-p gpu'],
                    'environs': ['default'],
                    'max_jobs': 4,
                    'processor': {
                        'num_cpus': 72,
                        'num_sockets': 2,
                        'num_cpus_per_socket': 36,
                        'arch': 'icelake',
                    },
                    'resources': [
                        {
                            'name': '_rfm_gpu',
                            'options': ['--gpus-per-node={num_gpus_per_node}'],
                        }
                    ],
                    'devices': [
                        {
                            'type': 'gpu',
                            'num_devices': 4,
                        }
                    ],
                    'features': ['cpu', 'gpu'],
                    'descr': 'GPU partition'
                },
            ]
        },
    ],
    'environments': [
        {
            'name': 'default',
            'cc': 'cc',
            'cxx': '',
            'ftn': '',
        },
    ],
    'logging': [
        {
            'level': 'debug',
            'handlers': [
                {
                    'type': 'stream',
                    'name': 'stdout',
                    'level': 'info',
                    'format': '%(message)s'
                },
                {
                    'type': 'file',
                    'name': 'reframe.log',
                    'level': 'debug',
                    'format': '[%(asctime)s] %(levelname)s: %(check_info)s: %(message)s',   # noqa: E501
                    'append': False
                }
            ],
            'handlers_perflog': [
                {
                    'type': 'filelog',
                    'prefix': '%(check_system)s/%(check_partition)s',
                    'level': 'info',
                    'format': (
                        '%(check_job_completion_time)s|reframe %(version)s|'
                        '%(check_info)s|jobid=%(check_jobid)s|'
                        '%(check_perf_var)s=%(check_perf_value)s|'
                        'ref=%(check_perf_ref)s '
                        '(l=%(check_perf_lower_thres)s, '
                        'u=%(check_perf_upper_thres)s)|'
                        '%(check_perf_unit)s'
                    ),
                    'append': True
                }
            ]
        }
    ],
}
satishskamath commented 1 year ago

It was the tcn node part. @boegel was right. 👍 I just had to logout of my shell and login again and it worked on a tcn node.