kubeedge / ianvs

Distributed Synergy AI Benchmarking
https://ianvs.readthedocs.io
Apache License 2.0
110 stars 45 forks source link

Cloud-edge collaborative speculative decoding for LLM based on KubeEdge-Ianvs #126

Open hsj576 opened 1 month ago

hsj576 commented 1 month ago
kairveeehh commented 1 month ago

Hi @hsj576 I am kairvee and would like to take up this project under LFX mentorship as it aligns with my interests and skills as well , could you assign it and let me know the further details?

hsj576 commented 1 month ago

Hi @hsj576 I am kairvee and would like to take up this project under LFX mentorship as it aligns with my interests and skills as well , could you assign it and let me know the further details?

The details of the project will be discussed in the weekly SIG ai meeting. Feel free to join us in https://zoom.us/j/4167237304 every Thursday 16:30 UTC+8.

kairveeehh commented 1 month ago

Hi @hsj576 I am kairvee and would like to take up this project under LFX mentorship as it aligns with my interests and skills as well , could you assign it and let me know the further details?

The details of the project will be discussed in the weekly SIG ai meeting. Feel free to join us in https://zoom.us/j/4167237304 every Thursday 16:30 UTC+8.

okk sure , thankyou

Ytemiloluwa commented 1 month ago

Hello @hsj576 my name is Temi I am an open source developer. I have genuine interest to work on this project this fall.

Ytemiloluwa commented 1 month ago

Hi @hsj576 I am kairvee and would like to take up this project under LFX mentorship as it aligns with my interests and skills as well , could you assign it and let me know the further details?

The details of the project will be discussed in the weekly SIG ai meeting. Feel free to join us in https://zoom.us/j/4167237304 every Thursday 16:30 UTC+8.

it seems I missed the meeting. I wasn't familiar with the time zone

kairveeehh commented 1 month ago

Hi @hsj576, Need your reviews and the way forward to contributing

Proposed Approach

  1. Speculative Decoding Implementation:

    • I plan to set up a basic speculative decoding pipeline using a draft model to parallelize the decoding process, thus improving the inference speed of the LLM.
  2. Cloud-Edge Architecture:

    • For cloud-edge collaboration, I will deploy the draft model at the edge to handle initial predictions and the full model in the cloud for verification and refinement. This setup aims to optimize resource usage and reduce latency.
  3. Testing and Optimization:

    • I will benchmark the system to evaluate performance improvements and ensure that the solution does not compromise accuracy. I will also explore possible optimizations to enhance the efficiency of the cloud-edge collaboration.
lazyperson1020 commented 1 month ago

@hsj576 Are there any pre-tests to submit? I am interested to contribute.

aryan0931 commented 1 month ago

@hsj576 I'm Aryan , I would like to take his project under LFX mentorship as it aligns perfectly with my skills, are there any pretest to submit?

Sid260303 commented 1 month ago

@hsj576, Hello sir, I am Siddhant. I would like to work on this project under LFX mentorship as working on LLMs have been my aim and this opportunity is best to kick-start my journey in open source. I also missed the weekly meeting and would like to know more about the project and how can I help. Also, it would be very kind of you if you could share some resources to make ourselves more prepared for the project

hsj576 commented 1 month ago

I will release a pretest in next week.

Ytemiloluwa commented 1 month ago

I will release a pretest in next week.

Hello @hsj576 the application for mentees close on 13th August (Tuesday).

hsj576 commented 1 month ago

I will release a pretest in next week.

Hello @hsj576 the application for mentees close on 13th August (Tuesday).

Ok, I will release the pretest as soon as possible (before 9th August).

FuryMartin commented 1 month ago

Hi, I hope to take on this project. I would like to highlight my advantages and the contributions I can make to the community.

I have a relatively deep understanding of the concept of Edge Computing and LLMs, and I am familiar with the main strategies for LLM cloud-edge collaboration, as well as the principles and implementation methods of Speculative Decoding.

Additionally, I have strong programming skills, particularly in Python and PyTorch. Last year, I interned at a large AI company specializing in LLMs.

What's more, I am trying to conduct a research about LLM cloud-edge collaboration strategy using Ianvs. After two months of learning, I have gained a good understanding of Ianvs's architecture, interfaces, and features. Now I am attempting to introduce a new feature which is relevant to the issue. If I am accepted, the integration of my insights could lead to a more coherent and rational architectural design.

Eagerly await consideration of my taking on this task!

sanggusti commented 1 month ago

Hi @hsj576

I would like to know more on your weekly meet. Do you have calendar link that I can add myself into? I would like to know more about this project.

And for the Collaborative Speculative Decoding, do you mind to share the paper of this technique? I just find a Collaborative Decoding paper online on Arxiv(https://arxiv.org/html/2406.12295v1) and can't find the Collaborative Speculative Decoding one.

hsj576 commented 1 month ago

Details of the pretest are released in #130.