canonical / hotsos

Software analysis toolkit. Define checks in high-level language and leverage library to perform analysis of common Cloud applications.
Apache License 2.0
30 stars 37 forks source link

microceph support #903

Open nishant-dash opened 2 weeks ago

nishant-dash commented 2 weeks ago

currently running hotsos with the --storage plugin gives me

storage:
  ceph:
    release:
      name: unknown
      days-to-eol: null
    services:
      systemd:
        disabled:
          - ceph-volume
        mds:
          - snap.microceph.mds
        mgr:
          - snap.microceph.mgr
        mon:
          - snap.microceph.mon
        osd:
          - snap.microceph.osd
        rgw:
          - snap.microceph.rgw
      ps:
        - ceph-mds (1)
        - ceph-mgr (1)
        - ceph-mon (1)
        - ceph-osd (2)
    dpkg:
      - ceph-volume 17.2.7-0ubuntu0.22.04.1
    snaps:
      - microceph 0+git.4a608fc
    status: HEALTH_OK
    versions:
      mon:
        - 17.2.6
      mgr:
        - 17.2.6
      osd:
        - 17.2.6
      mds:
        - 17.2.6

it would be great if we can add more to this to get it upto parity with regular ceph

pponnuvel commented 2 weeks ago

@nishant-dash

The EOL needs to be updated as the release cadence is still being decided (how long a version would be supported, how many versions backwards, etc). There's also one missing piece of info: https://github.com/canonical/hotsos/issues/804

Otherwise hotsos should have parity with regular ceph already. It's also possible that hotsos hasn't caught up with microceph development since we added support in hotsos.

Is there anything specific that's missing currently?

nishant-dash commented 2 weeks ago

There are some errors I see in my ceph logs at /var/snap/microceph/common/logs/* which I'm not sure if hotsos considers them important enough to catch if the hotsos --storage plugin is indeed going through /var/snap/microceph/common/logs/* then I think this should be ok

pponnuvel commented 2 weeks ago

Yes, /var/snap/microceph/common/logs are already looked at by hotsos.

What are the errors you noticed? It might possibly be a candidate to add another scenario (for both regular ceph & microceph).

dosaboy commented 1 week ago

I think this is a dup of https://github.com/canonical/hotsos/issues/775