coiled / feedback

A place to provide Coiled feedback
14 stars 3 forks source link

Feature request: Show number of used cores #108

Closed FabioRosado closed 3 years ago

FabioRosado commented 3 years ago

Val mentioned that it would be good to check the number of cores that a user has used so far, something similar to the message we send when a user requests more cores than the ones that are available.

worker_count_requested={'message': 'Requested core count is beyond user limits', 'request': '96', 'active': '18', 'limit': '100'}

We could add a command to the API to get this count and perhaps add it to the frontend?

jrbourbeau commented 3 years ago

That seems like a reasonable request. We might provide something like a coiled.info() method which prints out the number of used cores, core limit, number of GPUs, GPU limit, coiled version, dask & distributed version, etc. This could also be a good opportunity to utilize rich for nice outputting.

FabioRosado commented 3 years ago

That seems like a great idea, I was looking at the rich docs and I'm curious about the tables and text formatting! If everybody thinks this is a good idea, I'll add it to my to-do list when I touch the CLI

jrbourbeau commented 3 years ago

I'll add it to my to-do list

That sounds great, thanks @FabioRosado! FWIW when thinking about what content to add, I'm hoping coiled.info() can provide both

I was looking at the rich docs and I'm curious about the tables and text formatting!

🎉

ian-r-rose commented 3 years ago

That seems like a great idea, I was looking at the rich docs and I'm curious about the tables and text formatting!

This sounds like a great idea. I'd add that if you are in a notebook setting, you can produce an HTML table/other markup for nicer reading.

dantheman39 commented 3 years ago

There should probably also be a spot for this on the clusters dashboard in the frontend

FabioRosado commented 3 years ago

Do we want to show how many cores a user has used only or do you think it's useful to add a separate count as well? One for all active clusters and other for all the active jobs?

jrbourbeau commented 3 years ago

I can see both clusters and jobs both being useful. Though honestly I'm not sure how jobs tie into our current resource limit system. Is it 100 cores total, 100 cores for clusters and jobs each, or something else?

necaris commented 3 years ago

:+1: to showing both. Right now we give them 100 cores total (with a 10% graceful extension factor on clusters).

FabioRosado commented 3 years ago

Awesome thank you for the help, I'll add both 👍

dantheman39 commented 3 years ago

Excited for this and can see it being very useful.

But could we solicit more explicit method / api names? "info" is very generic.

dantheman39 commented 3 years ago

Closed by https://github.com/coiled/cloud/pull/1721