RafalWilinski / cloudflare-rag

Fullstack "Chat with your PDFs" RAG (Retrieval Augmented Generation) app built fully on Cloudflare
https://rwilinski.ai
458 stars 59 forks source link

Costs tracking #3

Open RafalWilinski opened 1 month ago

RafalWilinski commented 1 month ago

It would be fun to see how much each interaction costs in terms of LLM inference.

RihanArfan commented 1 month ago

The new pricing model should make it a lot easier to calculate :) https://blog.cloudflare.com/workers-ai-bigger-better-faster/#:~:text=New%20Workers%20AI%20pricing