NetApp / harvest

Open-metrics endpoint for ONTAP and StorageGRID
https://netapp.github.io/harvest/latest
Apache License 2.0
150 stars 37 forks source link

Harvest should document which metrics each dashboard uses #1577

Closed cgrinds closed 1 year ago

cgrinds commented 1 year ago

Thanks to Chris Waltham on Discord for raising

Results for commit 716e111a

bin/harvest grafana metrics
7mode/aggregate7.json
- aggr_inode_files_total
- aggr_inode_files_used
- aggr_inode_inodefile_private_capacity
- aggr_inode_inodefile_public_capacity
- aggr_inode_used_percent
- aggr_labels
- aggr_new_status
- aggr_raid_disk_count
- aggr_snapshot_files_total
- aggr_snapshot_files_used
- aggr_snapshot_inode_used_percent
- aggr_snapshot_maxfiles_available
- aggr_snapshot_maxfiles_possible
- aggr_snapshot_maxfiles_used
- aggr_snapshot_size_available
- aggr_snapshot_size_used
- aggr_snapshot_used_percent
- aggr_space_available
- aggr_space_sis_saved
- aggr_space_sis_saved_percent
- aggr_space_total
- aggr_space_used
- aggr_space_used_percent
- aggr_volume_count_flexvol
- node_labels

7mode/cluster7.json
- aggr_disk_max_busy
- aggr_space_total
- aggr_space_used
- aggr_space_used_percent
- cluster_subsystem_new_status
- cluster_subsystem_outstanding_alerts
- cluster_subsystem_suppressed_alerts
- disk_busy
- node_cpu_busy
- node_labels
- node_new_status
- volume_avg_latency
- volume_read_data
- volume_total_ops
- volume_write_data

7mode/disk7.json
- aggr_disk_busy
- aggr_disk_max_busy
- aggr_disk_max_total_transfers
- aggr_raid_disk_count
- aggr_space_total
- aggr_space_used_percent
- disk_labels
- disk_sectors
- disk_uptime
- flashcache_disk_reads_replaced
- flashcache_hit_percent
- flashpool_read_ops_replaced
- flashpool_read_ops_replaced_percent
- hostadapter_bytes_read
- hostadapter_bytes_written
- node_disk_data_read
- node_disk_data_written
- node_labels
- wafl_cp_count

7mode/lun7.json
- lun_avg_read_latency
- lun_avg_write_latency
- lun_labels
- lun_new_status
- lun_read_align_histo
- lun_read_data
- lun_read_ops
- lun_size
- lun_size_used
- lun_write_align_histo
- lun_write_data
- lun_write_ops
- node_labels
- volume_size_used_percent

7mode/network7.json
- fcp_avg_read_latency
- fcp_avg_write_latency
- fcp_read_data
- fcp_read_ops
- fcp_total_data
- fcp_total_ops
- fcp_util_percent
- fcp_write_data
- fcp_write_ops
- nic_labels
- nic_new_status
- nic_rx_bytes
- nic_rx_total_errors
- nic_tx_bytes
- nic_tx_total_errors
- nic_util_percent
- node_labels

7mode/node7.json
- aggr_new_status
- disk_busy
- fcp_util_percent
- fcp_write_data
- iscsi_lif_avg_latency
- iscsi_lif_iscsi_other_ops
- nic_tx_bytes
- nic_util_percent
- node_avg_processor_busy
- node_cifs_latency
- node_cifs_op_count
- node_cifs_ops
- node_cpu_busy
- node_fcp_ops
- node_iscsi_ops
- node_labels
- node_new_status
- node_nfs_latency
- node_nfs_ops
- node_nfs_read_avg_latency
- node_nfs_read_ops
- node_nfs_total_ops
- node_nfs_write_avg_latency
- node_nfs_write_ops
- node_uptime
- volume_avg_latency
- volume_other_latency
- volume_other_ops
- volume_read_data
- volume_read_latency
- volume_read_ops
- volume_total_ops
- volume_write_data
- volume_write_latency
- volume_write_ops
- wafl_cp_phase_times
- wafl_read_io_type

7mode/shelf7.json
- shelf_fan_rpm
- shelf_labels
- shelf_new_status
- shelf_sensor_reading
- shelf_temperature_reading
- shelf_voltage_reading

7mode/volume7.json
- node_labels
- volume_avg_latency
- volume_labels
- volume_new_status
- volume_read_data
- volume_read_latency
- volume_read_ops
- volume_size_total
- volume_size_used
- volume_size_used_percent
- volume_total_ops
- volume_write_data
- volume_write_latency
- volume_write_ops

cmode/aggregate.json
- aggr_disk_busy
- aggr_inode_files_total
- aggr_inode_files_used
- aggr_inode_inodefile_private_capacity
- aggr_inode_inodefile_public_capacity
- aggr_inode_used_percent
- aggr_labels
- aggr_logical_used_wo_snapshots
- aggr_logical_used_wo_snapshots_flexclones
- aggr_new_status
- aggr_physical_used_wo_snapshots
- aggr_physical_used_wo_snapshots_flexclones
- aggr_raid_disk_count
- aggr_snapshot_files_total
- aggr_snapshot_files_used
- aggr_snapshot_inode_used_percent
- aggr_snapshot_maxfiles_available
- aggr_snapshot_maxfiles_possible
- aggr_snapshot_maxfiles_used
- aggr_snapshot_reserve_percent
- aggr_snapshot_size_available
- aggr_snapshot_size_used
- aggr_snapshot_used_percent
- aggr_space_available
- aggr_space_capacity_tier_used
- aggr_space_data_compaction_saved
- aggr_space_data_compaction_saved_percent
- aggr_space_physical_used
- aggr_space_sis_saved
- aggr_space_sis_saved_percent
- aggr_space_total
- aggr_space_used
- aggr_space_used_percent
- aggr_total_logical_used
- aggr_total_physical_used
- aggr_volume_count_flexvol
- node_labels

cmode/cdot.json
- aggr_space_total
- aggr_space_used
- node_cifs_ops
- node_cpu_busy
- node_labels
- node_nfs_ops
- node_volume_avg_latency
- svm_labels
- svm_vol_avg_latency
- svm_vol_read_data
- svm_vol_total_ops
- svm_vol_write_data
- volume_avg_latency
- volume_labels
- volume_read_data
- volume_size_total
- volume_size_used
- volume_total_ops
- volume_write_data

cmode/cluster.json
- aggr_disk_busy
- aggr_logical_used_wo_snapshots
- aggr_logical_used_wo_snapshots_flexclones
- aggr_physical_used_wo_snapshots
- aggr_physical_used_wo_snapshots_flexclones
- aggr_space_total
- aggr_space_used
- aggr_space_used_percent
- aggr_total_logical_used
- aggr_total_physical_used
- cluster_new_status
- cluster_subsystem_new_status
- cluster_subsystem_outstanding_alerts
- cluster_subsystem_suppressed_alerts
- environment_sensor_average_ambient_temperature
- environment_sensor_average_fan_speed
- environment_sensor_average_temperature
- environment_sensor_max_fan_speed
- environment_sensor_max_temperature
- environment_sensor_min_ambient_temperature
- environment_sensor_min_fan_speed
- environment_sensor_min_temperature
- environment_sensor_power
- node_avg_processor_busy
- node_cpu_busy
- node_disk_busy
- node_disk_max_busy
- node_labels
- node_new_status
- node_volume_avg_latency
- node_volume_read_data
- node_volume_read_latency
- node_volume_total_ops
- node_volume_write_data
- node_volume_write_latency
- svm_vol_avg_latency
- svm_vol_read_data
- svm_vol_total_ops
- svm_vol_write_data
- volume_avg_latency
- volume_read_data
- volume_total_ops
- volume_write_data

cmode/compliance.json
- cluster_peer_labels
- cluster_peer_non_encrypted
- ntpserver_labels
- security_account_activediruser
- security_account_certificateuser
- security_account_labels
- security_account_ldapuser
- security_account_localuser
- security_account_samluser
- security_audit_destination_status
- security_certificate_labels
- security_labels
- security_login_labels
- security_ssh_labels
- support_labels
- svm_labels
- svm_ldap_encrypted
- svm_ldap_signed

cmode/data_protection_snapshot.json
- snapshot_policy_total_schedules
- volume_labels
- volume_snapshot_count
- volume_snapshot_reserve_size
- volume_snapshots_size_used

cmode/disk.json
- aggr_disk_busy
- aggr_disk_max_busy
- aggr_disk_max_total_transfers
- aggr_disk_max_user_read_chain
- aggr_disk_max_user_write_chain
- aggr_raid_disk_count
- aggr_space_total
- aggr_space_used_percent
- disk_labels
- disk_sectors
- disk_stats_average_latency
- disk_stats_io_kbps
- disk_uptime
- flashcache_disk_reads_replaced
- flashcache_hit_percent
- flashpool_read_ops_replaced
- flashpool_read_ops_replaced_percent
- hostadapter_bytes_read
- hostadapter_bytes_written
- node_disk_data_read
- node_disk_data_written
- node_labels
- node_vol_write_latency
- wafl_cp_count

cmode/headroom.json
- headroom_aggr_current_latency
- headroom_aggr_current_ops
- headroom_aggr_current_utilization
- headroom_aggr_optimal_point_latency
- headroom_aggr_optimal_point_ops
- headroom_aggr_optimal_point_utilization
- headroom_cpu_current_latency
- headroom_cpu_current_ops
- headroom_cpu_current_utilization
- headroom_cpu_optimal_point_latency
- headroom_cpu_optimal_point_ops
- headroom_cpu_optimal_point_utilization
- volume_labels

cmode/lun.json
- lun_avg_read_latency
- lun_avg_write_latency
- lun_caw_reqs
- lun_labels
- lun_new_status
- lun_read_align_histo
- lun_read_data
- lun_read_ops
- lun_remote_bytes
- lun_remote_ops
- lun_size
- lun_size_used
- lun_unmap_reqs
- lun_write_align_histo
- lun_write_data
- lun_write_ops
- lun_writesame_reqs
- lun_writesame_unmap_reqs
- lun_xcopy_reqs
- node_labels
- qos_detail_volume_resource_latency
- volume_sis_compress_saved_percent
- volume_sis_dedup_saved_percent
- volume_size_used_percent
- volume_snapshot_reserve_used_percent

cmode/mcc_cluster.json
- aggr_disk_max_busy
- aggr_new_status
- fcvi_rdma_write_avg_latency
- fcvi_rdma_write_ops
- fcvi_rdma_write_throughput
- hostadapter_bytes_read
- hostadapter_bytes_written
- node_avg_processor_busy
- node_labels
- path_read_data
- path_read_iops
- path_read_latency
- path_write_data
- path_write_iops
- path_write_latency
- plex_disk_busy
- plex_disk_user_read_latency
- plex_disk_user_reads
- plex_disk_user_write_latency
- plex_disk_user_writes
- volume_avg_latency

cmode/metadata.json
- metadata_collector_api_time
- metadata_collector_calc_time
- metadata_collector_metrics
- metadata_collector_parse_time
- metadata_collector_poll_time
- metadata_component_count
- metadata_component_status
- metadata_exporter_count
- metadata_exporter_time
- metadata_target_goroutines
- metadata_target_ping
- metadata_target_status
- poller_cpu
- poller_cpu_percent
- poller_fds
- poller_io
- poller_memory
- poller_memory_percent
- poller_net
- poller_status
- poller_threads

cmode/network.json
- fcp_avg_read_latency
- fcp_avg_write_latency
- fcp_discarded_frames_count
- fcp_int_count
- fcp_invalid_crc
- fcp_invalid_transmission_word
- fcp_isr_count
- fcp_link_down
- fcp_link_failure
- fcp_loss_of_signal
- fcp_loss_of_sync
- fcp_nvmf_avg_read_latency
- fcp_nvmf_avg_write_latency
- fcp_nvmf_read_data
- fcp_nvmf_read_ops
- fcp_nvmf_total_data
- fcp_nvmf_total_ops
- fcp_nvmf_write_data
- fcp_nvmf_write_ops
- fcp_prim_seq_err
- fcp_queue_full
- fcp_read_data
- fcp_spurious_int_count
- fcp_threshold_full
- fcp_total_data
- fcp_util_percent
- fcp_write_data
- nic_labels
- nic_new_status
- nic_rx_alignment_errors
- nic_rx_bytes
- nic_rx_crc_errors
- nic_rx_length_errors
- nic_rx_total_errors
- nic_tx_bytes
- nic_tx_hw_errors
- nic_tx_total_errors
- nic_util_percent
- node_labels

cmode/nfs4storePool.json
- nfs_diag_storePool_ByteLockAlloc
- nfs_diag_storePool_ByteLockMax
- nfs_diag_storePool_ClientAlloc
- nfs_diag_storePool_ClientMax
- nfs_diag_storePool_ConnectionParentSessionReferenceAlloc
- nfs_diag_storePool_ConnectionParentSessionReferenceMax
- nfs_diag_storePool_CopyStateAlloc
- nfs_diag_storePool_CopyStateMax
- nfs_diag_storePool_DelegAlloc
- nfs_diag_storePool_DelegMax
- nfs_diag_storePool_DelegStateAlloc
- nfs_diag_storePool_DelegStateMax
- nfs_diag_storePool_LayoutAlloc
- nfs_diag_storePool_LayoutMax
- nfs_diag_storePool_LayoutStateAlloc
- nfs_diag_storePool_LayoutStateMax
- nfs_diag_storePool_LockAlloc
- nfs_diag_storePool_LockMax
- nfs_diag_storePool_LockStateAlloc
- nfs_diag_storePool_LockStateMax
- nfs_diag_storePool_OpenAlloc
- nfs_diag_storePool_OpenMax
- nfs_diag_storePool_OpenStateAlloc
- nfs_diag_storePool_OpenStateMax
- nfs_diag_storePool_OwnerAlloc
- nfs_diag_storePool_OwnerMax
- nfs_diag_storePool_SessionAlloc
- nfs_diag_storePool_SessionConnectionHolderAlloc
- nfs_diag_storePool_SessionConnectionHolderMax
- nfs_diag_storePool_SessionHolderAlloc
- nfs_diag_storePool_SessionHolderMax
- nfs_diag_storePool_SessionMax
- nfs_diag_storePool_StateRefHistoryAlloc
- nfs_diag_storePool_StateRefHistoryMax
- nfs_diag_storePool_StringAlloc
- nfs_diag_storePool_StringMax
- node_labels

cmode/nfs_clients.json
- nfs_clients_idle_duration
- volume_labels

cmode/node.json
- aggr_new_status
- fcp_lif_avg_latency
- fcp_lif_total_ops
- fcp_lif_write_data
- fcp_nvmf_read_data
- fcp_nvmf_write_data
- fcp_read_data
- fcp_util_percent
- fcp_write_data
- iscsi_lif_avg_latency
- iscsi_lif_iscsi_other_ops
- iscsi_lif_write_data
- nic_rx_bytes
- nic_tx_bytes
- nic_util_percent
- node_avg_processor_busy
- node_cifs_connections
- node_cifs_established_sessions
- node_cifs_latency
- node_cifs_op_count
- node_cifs_open_files
- node_cifs_ops
- node_cifs_signed_sessions
- node_cpu_busy
- node_cpu_domain_busy
- node_disk_busy
- node_disk_max_busy
- node_failed_fan
- node_failed_power
- node_fcp_ops
- node_iscsi_ops
- node_labels
- node_new_status
- node_nfs_latency
- node_nfs_ops
- node_nfs_read_avg_latency
- node_nfs_read_ops
- node_nfs_read_throughput
- node_nfs_throughput
- node_nfs_total_ops
- node_nfs_write_avg_latency
- node_nfs_write_ops
- node_nfs_write_throughput
- node_nvmf_ops
- node_uptime
- nvme_lif_avg_latency
- nvme_lif_total_ops
- nvme_lif_write_data
- volume_avg_latency
- volume_other_latency
- volume_other_ops
- volume_read_data
- volume_read_latency
- volume_read_ops
- volume_total_ops
- volume_write_data
- volume_write_latency
- volume_write_ops
- wafl_cp_phase_times
- wafl_read_io_type

cmode/power.json
- environment_sensor_average_ambient_temperature
- environment_sensor_average_fan_speed
- environment_sensor_average_temperature
- environment_sensor_max_fan_speed
- environment_sensor_max_temperature
- environment_sensor_min_ambient_temperature
- environment_sensor_min_fan_speed
- environment_sensor_min_temperature
- environment_sensor_power
- node_labels
- shelf_average_ambient_temperature
- shelf_average_fan_speed
- shelf_average_temperature
- shelf_disk_count
- shelf_labels
- shelf_max_fan_speed
- shelf_max_temperature
- shelf_min_ambient_temperature
- shelf_min_fan_speed
- shelf_min_temperature
- shelf_new_status
- shelf_power

cmode/qtree.json
- qtree_cifs_ops
- qtree_internal_ops
- qtree_labels
- qtree_nfs_ops
- qtree_total_ops
- quota_disk_used
- quota_files_used
- volume_labels

cmode/quotaReport.json
- quota_disk_limit
- quota_disk_used
- quota_disk_used_pct_disk_limit
- quota_file_limit
- quota_files_used
- quota_files_used_pct_file_limit
- quota_soft_disk_limit
- quota_soft_file_limit
- volume_labels

cmode/security.json
- security_account_activediruser
- security_account_certificateuser
- security_account_ldapuser
- security_account_localuser
- security_account_samluser
- security_certificate_labels
- svm_labels
- volume_labels

cmode/shelf.json
- shelf_average_ambient_temperature
- shelf_average_fan_speed
- shelf_average_temperature
- shelf_disk_count
- shelf_fan_rpm
- shelf_labels
- shelf_max_fan_speed
- shelf_max_temperature
- shelf_min_ambient_temperature
- shelf_min_fan_speed
- shelf_min_temperature
- shelf_new_status
- shelf_power
- shelf_psu_power_drawn
- shelf_psu_power_rating
- shelf_sensor_reading
- shelf_temperature_reading
- shelf_voltage_reading

cmode/snapmirror.json
- snapmirror_break_failed_count
- snapmirror_break_successful_count
- snapmirror_labels
- snapmirror_lag_time
- snapmirror_last_transfer_duration
- snapmirror_last_transfer_size
- snapmirror_resync_failed_count
- snapmirror_resync_successful_count
- snapmirror_update_failed_count
- snapmirror_update_successful_count
- volume_labels

cmode/svm.json
- copy_manager_kb_copied
- fcp_lif_avg_latency
- fcp_lif_avg_other_latency
- fcp_lif_avg_read_latency
- fcp_lif_avg_write_latency
- fcp_lif_other_ops
- fcp_lif_read_data
- fcp_lif_read_ops
- fcp_lif_total_ops
- fcp_lif_write_data
- fcp_lif_write_ops
- iscsi_lif_avg_latency
- iscsi_lif_avg_other_latency
- iscsi_lif_avg_read_latency
- iscsi_lif_avg_write_latency
- iscsi_lif_iscsi_other_ops
- iscsi_lif_iscsi_read_ops
- iscsi_lif_iscsi_write_ops
- iscsi_lif_read_data
- iscsi_lif_write_data
- lif_recv_data
- lif_sent_data
- nvme_lif_avg_latency
- nvme_lif_avg_other_latency
- nvme_lif_avg_read_latency
- nvme_lif_avg_write_latency
- nvme_lif_other_ops
- nvme_lif_read_data
- nvme_lif_read_ops
- nvme_lif_total_ops
- nvme_lif_write_data
- nvme_lif_write_ops
- qos_detail_resource_latency
- qos_latency
- qos_ops
- qos_read_data
- qos_read_latency
- qos_read_ops
- qos_sequential_reads
- qos_sequential_writes
- qos_write_data
- qos_write_latency
- qos_write_ops
- svm_cifs_connections
- svm_cifs_latency
- svm_cifs_op_count
- svm_cifs_open_files
- svm_cifs_read_latency
- svm_cifs_read_ops
- svm_cifs_write_latency
- svm_cifs_write_ops
- svm_nfs_latency
- svm_nfs_ops
- svm_nfs_read_avg_latency
- svm_nfs_read_ops
- svm_nfs_read_throughput
- svm_nfs_read_total
- svm_nfs_throughput
- svm_nfs_write_avg_latency
- svm_nfs_write_ops
- svm_nfs_write_throughput
- svm_nfs_write_total
- svm_read_total
- svm_vol_avg_latency
- svm_vol_other_latency
- svm_vol_other_ops
- svm_vol_read_data
- svm_vol_read_latency
- svm_vol_read_ops
- svm_vol_total_ops
- svm_vol_write_data
- svm_vol_write_latency
- svm_vol_write_ops
- svm_vscan_connections_active
- svm_vscan_dispatch_latency
- svm_vscan_scan_latency
- svm_vscan_scan_noti_received_rate
- svm_vscan_scan_request_dispatched_rate
- svm_write_total
- volume_labels
- volume_read_data
- volume_read_latency
- volume_read_ops
- volume_sis_compress_saved
- volume_sis_compress_saved_percent
- volume_sis_dedup_saved
- volume_sis_dedup_saved_percent
- volume_sis_total_saved
- volume_size_used_percent
- volume_snapshot_reserve_used_percent
- volume_write_data
- volume_write_latency
- volume_write_ops

cmode/volume.json
- fabricpool_cloud_bin_op_latency_average
- fabricpool_cloud_bin_operation
- qos_detail_volume_resource_latency
- qos_volume_read_data
- qos_volume_read_latency
- qos_volume_read_ops
- qos_volume_sequential_reads
- qos_volume_sequential_writes
- qos_volume_write_data
- qos_volume_write_latency
- qos_volume_write_ops
- volume_avg_latency
- volume_labels
- volume_new_status
- volume_read_data
- volume_read_latency
- volume_read_ops
- volume_sis_compress_saved
- volume_sis_dedup_saved
- volume_size
- volume_size_total
- volume_size_used
- volume_size_used_percent
- volume_snapshot_reserve_available
- volume_snapshot_reserve_percent
- volume_snapshot_reserve_size
- volume_snapshot_reserve_used_percent
- volume_snapshots_size_available
- volume_snapshots_size_used
- volume_space_logical_used
- volume_space_logical_used_percent
- volume_space_physical_used
- volume_space_physical_used_percent
- volume_total_ops
- volume_write_data
- volume_write_latency
- volume_write_ops

storagegrid/tenant.json
- bucket_bytes
- bucket_objects
- tenant_labels
- tenant_logical_quota
- tenant_logical_used
- tenant_objects
- tenant_used_percent
cgrinds commented 1 year ago

Verified in 23.02

cgrinds commented 5 months ago

Set of metrics used by dashboards as of 24.05

7mode/aggregate7.json

7mode/cluster7.json

7mode/disk7.json

7mode/lun7.json

7mode/network7.json

7mode/node7.json

7mode/shelf7.json

7mode/volume7.json

cmode/aggregate.json

cmode/cdot.json

cmode/changelogMonitor.json

cmode/cluster.json

cmode/compliance.json

cmode/data_protection_snapshot.json

cmode/datacenter.json

cmode/details/volumeBySVM.json

cmode/details/volumeDeepDive.json

cmode/disk.json

cmode/external_service_op.json

cmode/flexcache.json

cmode/flexgroup.json

cmode/fsa.json

cmode/headroom.json

cmode/health.json

cmode/lun.json

cmode/mcc_cluster.json

cmode/metadata.json

cmode/namespace.json

cmode/network.json

cmode/nfs4storePool.json

cmode/nfsTroubleshooting.json

cmode/nfs_clients.json

cmode/node.json

cmode/power.json

cmode/qtree.json

cmode/quotaReport.json

cmode/s3ObjectStorage.json

cmode/security.json

cmode/shelf.json

cmode/smb.json

cmode/snapmirror.json

cmode/svm.json

cmode/volume.json

cmode/workload.json

storagegrid/fabricpool.json

storagegrid/overview.json

storagegrid/tenant.json