ARCLab-MIT / kspdg

ARCLab-MIT participation in the KSPDG challenge
MIT License
4 stars 0 forks source link

Check how the API response time varies now between business and personal accounts #23

Open vrodriguezf opened 10 months ago

vrodriguezf commented 10 months ago

Do it for different models

OhhTuRnz commented 9 months ago

I'll be adding some context given our Slack convo:

vrodriguezf commented 9 months ago

My thought right now is that, in the approach of replicating human behaviour, RAG (customGPTs) or prompt engineering themselves, without fine tuning, are not going to get us anywhere here because of the latency, especially RAG.

In other paths different than human behaviour cloning, such as the one that @DumplingLife is purusing of trying to forecast the future state of each object and make a plan based on that, these approaches could be more useful, despite being slower.

Anyway, as they say in this (fantastic talk)[https://www.youtube.com/watch?v=ahnGLM-RC1Y&t=1429s] by OpenAI to optimize LLMs, the best approach to explore LLM optimal usage is not exclusively one path (RAG, prompt engineering, fine tuning), but it can include multiple aspects. image

vrodriguezf commented 9 months ago

So the question, related to this issue is: Are fine-tuned models with the arclab business account slower than fine-tuned models in a personal account? This is extremely relevant