sap-linuxlab / community.sap_install

Automation for SAP - Collection of Ansible Roles for various SAP software installation
Apache License 2.0
53 stars 56 forks source link

sap_hana_preconfigure: fail saptune verify on SLES4SAP 15 SP5 on Azure E20s_v5 #694

Open Tosch-OED opened 7 months ago

Tosch-OED commented 7 months ago

On roles/sap_hana_preconfigure/tasks/SLES/configuration.yml the task Ensure solution was successful for standalone HANA database fails on SLES4SAP 15 SP5 virtual machine on E20s_v5 in Azure.

The command saptune solution verify HANA returns with 1 because of not compliant parameters of note 2684254 and the the foot note setting is not available on the system.

# saptune solution verify HANA

NOTICE: block device related section settings detected: Traversing all block devices can take a considerable amount of time.

   SAPNote, Version | Parameter                          | Expected                    | Override  | Actual                      | Compliant
--------------------+------------------------------------+-----------------------------+-----------+-----------------------------+-----------
   1771258, 8       | LIMIT_@dba_hard_nofile             | @dba hard nofile 1048576    |           | @dba hard nofile 1048576    | yes
   1771258, 8       | LIMIT_@dba_soft_nofile             | @dba soft nofile 1048576    |           | @dba soft nofile 1048576    | yes
   1771258, 8       | LIMIT_@sapsys_hard_nofile          | @sapsys hard nofile 1048576 |           | @sapsys hard nofile 1048576 | yes
   1771258, 8       | LIMIT_@sapsys_soft_nofile          | @sapsys soft nofile 1048576 |           | @sapsys soft nofile 1048576 | yes
   1771258, 8       | LIMIT_@sdba_hard_nofile            | @sdba hard nofile 1048576   |           | @sdba hard nofile 1048576   | yes
   1771258, 8       | LIMIT_@sdba_soft_nofile            | @sdba soft nofile 1048576   |           | @sdba soft nofile 1048576   | yes
   1868829, 5       | fs.aio-max-nr                      | 18446744073709551615        |           | 18446744073709551615        | yes
   1980196, 7       | vm.max_map_count                   | 2147483647                  |           | 2147483647                  | yes
   2382421, 45      | net.core.rmem_max                  |                             |           | 212992                      | yes [7]
   2382421, 45      | net.core.somaxconn                 | 4096                        |           | 4096                        | yes
   2382421, 45      | net.core.wmem_max                  |                             |           | 212992                      | yes [7]
   2382421, 45      | net.ipv4.ip_local_port_range       |                             |           | 32768 60999                 | yes [7]
   2382421, 45      | net.ipv4.ip_local_reserved_ports   |                             |           |                             | yes [7]
   2382421, 45      | net.ipv4.tcp_max_syn_backlog       | 8192                        |           | 8192                        | yes
   2382421, 45      | net.ipv4.tcp_rmem                  |                             |           | 4096 131072 6291456         | yes [7]
   2382421, 45      | net.ipv4.tcp_slow_start_after_idle | 0                           |           | 0                           | yes
   2382421, 45      | net.ipv4.tcp_syn_retries           | 8                           |           | 8                           | yes
   2382421, 45      | net.ipv4.tcp_timestamps            |                             |           | 1                           | yes [7]
   2382421, 45      | net.ipv4.tcp_tw_reuse              |                             |           | 2                           | yes [7]
   2382421, 45      | net.ipv4.tcp_window_scaling        | 1                           |           | 1                           | yes
   2382421, 45      | net.ipv4.tcp_wmem                  |                             |           | 4096 16384 4194304          | yes [7]
   2534844, 15      | kernel.shmmni                      | 32768                       |           | 32768                       | yes
   2578899, 47      | IO_SCHEDULER_sda                   | none                        |           | none                        | yes
   2578899, 47      | IO_SCHEDULER_sdb                   | none                        |           | none                        | yes
   2578899, 47      | IO_SCHEDULER_sdc                   | none                        |           | none                        | yes
   2578899, 47      | IO_SCHEDULER_sdd                   | none                        |           | none                        | yes
   2578899, 47      | IO_SCHEDULER_sde                   | none                        |           | none                        | yes
   2578899, 47      | systemd:sysstat.service            | start                       |           | start, disable              | yes
   2578899, 47      | systemd:uuidd.socket               | start                       |           | start, enable               | yes
   2578899, 47      | vm.dirty_background_bytes          | 314572800                   |           | 314572800                   | yes
   2578899, 47      | vm.dirty_bytes                     | 629145600                   |           | 629145600                   | yes
   2684254, 23      | KSM                                | 0                           |           | 0                           | yes
   2684254, 23      | THP                                | never                       |           | never                       | yes
   2684254, 23      | energy_perf_bias                   | all:0                       |           | all:none                    |  -  [1]
   2684254, 23      | force_latency                      | 70                          |           | all:none                    |  -  [1]
   2684254, 23      | governor                           | all:performance             |           | all:powersave               | no 
   2684254, 23      | grub:intel_idle.max_cstate         | 1                           |           | NA                          | no  [2] [3] [6]
   2684254, 23      | grub:numa_balancing                | disable                     |           | NA                          | no  [2] [3] [6]
   2684254, 23      | grub:processor.max_cstate          | 1                           |           | NA                          | no  [2] [3] [6]
   2684254, 23      | grub:transparent_hugepage          | never                       |           | NA                          | no  [2] [3] [6]
   2684254, 23      | kernel.numa_balancing              | 0                           |           | 0                           | yes
   2993054, 2       | net.ipv4.tcp_keepalive_intvl       | 75                          |           | 75                          | yes
   2993054, 2       | net.ipv4.tcp_keepalive_probes      | 9                           |           | 9                           | yes
   2993054, 2       | net.ipv4.tcp_keepalive_time        | 300                         |           | 300                         | yes
   941735, 11       | ShmFileSystemSizeMB                | 120722                      |           | 120722                      | yes
   941735, 11       | VSZ_TMPFS_PERCENT                  | 75                          |           | 75                          |  -  [15]
   941735, 11       | kernel.shmall                      | 1152921504606846720         |           | 1152921504606846720         | yes
   941735, 11       | kernel.shmmax                      | 18446744073709551615        |           | 18446744073709551615        | yes

  [1] setting is not available on Azure instances (see SAP Note 2993054).
  [2] setting is not available on the system
  [3] value is only checked, but NOT set
  [6] grub settings are mostly covered by other settings. See man page saptune-note(5) for details
  [7] parameter value is untouched by default
 [15] the parameter is only used to calculate the size of tmpfs (/dev/shm)

Attention for SAP Note 2382421:
Hints or values not yet handled by saptune. So please read carefully, check and set manually, if needed:
# SAP HANA Parameters - all '.ini' file changes - not handled by saptune
# WARNING - on systems with iSCSI devices the setting of 'net.ipv4.tcp_syn_retries = 8'
# may result in deferred faulty message of iSCSI paths.

Attention for SAP Note 2993054:
Hints or values not yet handled by saptune. So please read carefully, check and set manually, if needed:
# Settings from SAP Note 1410736 are integrated in this SAP Note.
#
# Parameters energy_perf_bias, governor and force_latency, which are mentioned
# in the recommended OS settings for the SAP HANA databases, are not supported
# on Azure virtual machines. A footnote will show up during verify operation.

Attention for SAP Note 2684254:
Hints or values not yet handled by saptune. So please read carefully, check and set manually, if needed:
# IBM EnergyScale for POWER8 Processor-Based Systems (applies to IBM Power systems only) - not handled by saptune!
# IBM EnergyScale for POWER9 Processor-Based Systems (applies to IBM Power systems only) - not handled by saptune!
# HANA is not supported neither on Intel Cluster-On-Die (COD) technology nor on sub-NUMA clustering technology.

ERROR: The parameters listed above have deviated from the specified SAP solution recommendations.
Wabri commented 7 months ago

It's a known problem of saptune with newer versions of cpupower command on Azure. We have missed to exclude Azure in one of the 'get' functions saptune is using for the verify. It was not needed with the older versions of cpupower, but now we need to adapt the handling. It's part of the saptune 3.2 development currently WIP.

I'll try to find a way to patch this temporarily

sean-freeman commented 6 months ago

@Wabri If this is an error in saptune, can you please cite the bugzilla/jira etc that is tracking it?

I don't want to keep this open indefinitely in this repository, if there is not much we can do within this repository

Wabri commented 6 months ago

@Tosch-OED May I ask the output of saptune status? And if you have it also the logs!

Tosch-OED commented 6 months ago

saptune status

# saptune status

saptune.service:        enabled/active
saptune package:        '3.0.2'
configured version:     '3'
configured Solution:    HANA
configured Notes:       
order of enabled Notes: 941735 1771258 1980196 2578899 2684254 2382421 2534844 2993054 1656250
applied Notes:          1656250 1771258 1980196 2382421 2534844 2578899 2684254 2993054 941735
staging:                disabled
staging area:           

sapconf.service:        disabled/inactive
tuned.service:          disabled/inactive
system state:           degraded

The system state is NOT ok.
Please call '/usr/sbin/saptune_check' to get guidance to resolve the issues!

saptune.log

If you need further information feel free to ask. It's also possible to plan a short meeting to investigate together.

Wabri commented 5 months ago

Hi @Tosch-OED a new version of saptune is out: https://github.com/SUSE/saptune/releases/tag/3.1.2 , would you mind doing some tests to check if everything is ok now? Thanks!