BharatSahAIyak / tensor.hub

0 stars 0 forks source link

[DMP 2024]: Fractional GPU Utilisation #1

Open singhalkarun opened 5 months ago

singhalkarun commented 5 months ago

Ticket Contents

Description

Create a linux package with an exposed interface that allows the utilisation of a fraction of a GPU, similar to how cgroups works. This package will enable users to efficiently allocate and manage GPU resources for their applications.

image

Reference 1: https://docs.run.ai/v2.15/Researcher/Walkthroughs/walkthrough-fractions/ Reference 2: https://github.com/AliyunContainerService/gpushare-scheduler-extender

Goals & Mid-Point Milestone

Mid Point Milestone

Linux Package with following APIs/Functionalities :

  1. Reserve GPU for a Processes
  2. Get Free GPU
  3. Get a list of Current Processes

Goals

Setup/Installation

No response

Expected Outcome

Acceptance Criteria

Implementation Details

Mockups/Wireframes

No response

Product Name

BharatSahAIyak

Organisation Name

SamagraX

Domain

Open Source Library

Tech Skills Needed

Docker Linux - Internals, Networking GPU (good starter)

Mentor(s)

@singhalkarun

Category

AI

RohanHBTU commented 5 months ago

hi @singhalkarun, I already have WSL in my machine and have past experience with docker too. I am highly obsessed with GPU and Linux. Can you give me guidance regarding what approach should be considered, so that I can contemplate about it further.

Thank you for your time and consideration.

AbhimanyuSamagra commented 5 months ago

Do not ask process related questions about how to apply and who to contact in the above ticket. The only questions allowed are about technical aspects of the project itself. If you want help with the process, you can refer instructions listed on Unstop and any further queries can be taken up on our Discord channel titled DMP queries. Here's a Video Tutorial on how to submit a proposal for a project.

Bre-31 commented 5 months ago

hello @singhalkarun , i will be actively working on this project.

H4R5H1T-007 commented 5 months ago

I have found out genv library from run.ai maybe we can use this but the problem is they are working on integration with kubernetes and it's not complete.

Saswatsusmoy commented 5 months ago

@singhalkarun @AbhimanyuSamagra

I have few questions regarding the project and how I should approach it