Cloud deployment - Githubissues

run-llama / LlamaIndexTS

Data framework for your LLM applications. Focus on server side solution

https://ts.llamaindex.ai

MIT License

1.93k stars 358 forks source link

Cloud deployment #528

Open austinmw opened 9 months ago

austinmw commented 9 months ago

Hi, are there any directions available for cloud deployment of create-llama applications?

logan-markewich commented 9 months ago

We have a blog! (Assumes you are using api-based models like openai though, hosting your own models is another beast on its own)

https://blog.llamaindex.ai/shipping-your-retrieval-augmented-generation-app-to-production-with-create-llama-7bbe43b6287d

austinmw commented 9 months ago

Thanks! I see that it's using OpenAI and deployed to Vercel. I think you'd gain a large audience if you had another blog on deploying with Amazon Bedrock+Claude to AWS! :)

logan-markewich commented 9 months ago

@austinmw thats as simple as changing the llm and embeddings in the service context to use bedrock (which we support! In python at least)

https://docs.llamaindex.ai/en/stable/examples/embeddings/bedrock.html https://docs.llamaindex.ai/en/stable/examples/llm/bedrock.html#bedrock

from llama_index import ServiceContext, set_global_service_context 
service_context = ServiceContext.from_defaults(llm=llm, embed_model=embed_model)
set_global_service_context(service_context)

logan-markewich commented 9 months ago

(Just realized this is the TS repo, not python lol)

austinmw commented 9 months ago

Thanks! Do you happen to know the best way to deploy Next.js apps on AWS? I'm not very familiar with React frameworks 😅

logan-markewich commented 9 months ago

Hhmm, I'm not sure. I've only ever used vercel

marcusschiesser commented 9 months ago

@austinmw, here are your deployment options for NextJS: https://nextjs.org/docs/pages/building-your-application/deploying#self-hosting You can even do a static HTML export if you're using just the NextJS frontend (and Python or Express as the backend).

LlamaIndexTS already supports using Claude models (see https://github.com/run-llama/LlamaIndexTS/blob/dd054137bf16a043d9581bc70432ff1129640516/packages/core/src/llm/LLM.ts#L617) - you just would have to modify the generated code by create-llama.

marcusschiesser commented 7 months ago

This also looks like an interesting option to deploy NextJS to AWS: https://docs.sst.dev/start/nextjs