thoth-station / core

Using Artificial Intelligence to analyse and recommend Software Stacks for Artificial Intelligence applications.
https://thoth-station.github.io/
GNU General Public License v3.0
28 stars 25 forks source link

Kebechet overview dashboard and web page metrics #322

Open pacospace opened 2 years ago

pacospace commented 2 years ago

Is your feature request related to a problem? Please describe. As User of Kebechet,

I would like to have an overview of the use of Kebechet:

As Maintaer of Kebechet,

I would like to have a look at a dashboard with all metrics related to Kebechet

High-level Goals

Describe the solution you'd like

Collect the following metrics:

Usage

Operational

The above metrics should be combined to allow the managers to set the following SLO:

Additional context

Acceptance Criteria

cc @KPostOffice @xtuchyna

goern commented 2 years ago

Will this provide all the data required to fill in the blanks at https://goern.github.io/kebechet-universe/ ?

pacospace commented 2 years ago

Will this provide all the data required to fill in the blanks at https://goern.github.io/kebechet-universe/ ?

Yes it is part of the acceptance criteria!

pacospace commented 2 years ago

Related-To: https://github.com/thoth-station/slo-reporter/issues/210

goern commented 2 years ago

@KPostOffice @xtuchyna any update on this?!

xtuchyna commented 2 years ago

sorry for delay, preparing test for daily data&metrics aggregation https://github.com/thoth-station/thoth-application/pull/1954

pacospace commented 2 years ago

Related-To: https://github.com/thoth-station/kebechet/issues/679 https://github.com/thoth-station/kebechet/issues/546

goern commented 2 years ago

ping, any decision on this?

pacospace commented 2 years ago

ping, any decision on this?

We are working on it with @hemajv :) we are waiting also for grafana to be back in the clusters, cc @harshad16

codificat commented 2 years ago

/lifecycle active

codificat commented 2 years ago

We are working on it

/triage accepted

pacospace commented 2 years ago

The initial dashboard is available at: https://grafana.operate-first.cloud/d/bBFI9MJnk/kebechet-monitoring?orgId=1 cc @hemajv

pacospace commented 2 years ago

We need to extend dashboard with Kebechet github metrics cc @xtuchyna

xtuchyna commented 2 years ago

Waiting for https://github.com/thoth-station/thoth-application/issues/2333 to be resolved

KPostOffice commented 2 years ago

Hey @pacospace, I added a comment to an issue here: https://github.com/thoth-station/kebechet/issues/825. I feel like it doesn't fit here as it is more of an operational metric, but I figured I'd link it in case it is something we might want to include.

sesheta commented 2 years ago

Issues go stale after 90d of inactivity. Mark the issue as fresh with /remove-lifecycle stale. Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

/lifecycle stale

sesheta commented 1 year ago

Stale issues rot after 30d of inactivity. Mark the issue as fresh with /remove-lifecycle rotten. Rotten issues close after an additional 30d of inactivity.

If this issue is safe to close now please do so with /close.

/lifecycle rotten