explodinggradients / ragas

Evaluation framework for your Retrieval Augmented Generation (RAG) pipelines
https://docs.ragas.io
Apache License 2.0
5.66k stars 529 forks source link

[R-263] Roadmap - v0.2 #1009

Open jjmachan opened 1 month ago

jjmachan commented 1 month ago

image

From SyncLinear.com | R-263

rajib76 commented 1 month ago

We lack chunk quality metrics as of today. It will be good to see some chunk quality evaluation metrics.

jjmachan commented 4 weeks ago

hey @rajib76, thanks for chipping in 🙂

could you explain a bit more about how you're measuring quality here? maybe an example too if possible?

rajib76 commented 4 weeks ago

One of the hard problem today in RAG is to determine the right size of the chunk. If a chunk talks about multiple concept, it is very difficult to find the most relevant chunk for the question. I was looking for a metrics that will tell that a chunk is atomic and it talks about only one concept. The semantic chunking approach did not work as the embedding model itself has a semantic dissonance.