Pinpoint raw evidence chunks

Hey @athena-yao! I'm here to help you with any bugs, questions, or contributions you have. Let's tackle this together!

To get the raw evidence chunks associated with answers in a document, you can use the aget_evidence method directly. This method retrieves the relevant texts based on the query without summarizing them. Here is an example of how you can use it:

async def get_raw_evidence(docs: Docs, query: str, settings: Settings, embedding_model: EmbeddingModel, summary_llm_model: LiteLLMModel) -> list[Text]:
    state = EnvironmentState(docs=docs, answer=Answer(question=query))
    gather_evidence_tool = GatherEvidence(settings=settings, embedding_model=embedding_model, summary_llm_model=summary_llm_model)
    await gather_evidence_tool.gather_evidence(query, state)
    return [context.context for context in state.answer.contexts]

# Usage
docs = Docs()  # Initialize your Docs object
query = "Your question here"
settings = Settings()  # Initialize your Settings object
embedding_model = EmbeddingModel()  # Initialize your EmbeddingModel object
summary_llm_model = LiteLLMModel()  # Initialize your LiteLLMModel object
raw_evidence_chunks = await get_raw_evidence(docs, query, settings, embedding_model, summary_llm_model)

This will give you the raw evidence chunks without any summarization ^[1]^[2].

To continue talking to Dosu, mention @dosu.

Future-House / paper-qa

Pinpoint raw evidence chunks #464