[Question]: How are citations generated in the response of CitationQueryEngine?

mw19930312 commented 2 months ago

Question Validation

[X] I have searched both the documentation and discord for an answer.

Question

I understand that in a CitationQueryEngine, we retrieve a couple of reference nodes from a vector index. Then, these nodes are passed into a synthesizer to generate a final response.

dosubot[bot] commented 2 months ago

Hey there @mw19930312! Great to see you diving into the intricacies of the CitationQueryEngine again. 😄 I remember our last brainstorming session here; always a pleasure to assist a fellow knowledge seeker. I'm on it - gonna unravel the citation generation mystery for you in just a jiffy!

logan-markewich commented 2 months ago

The nodes are labeled as source 1, source 2, etc. and then the LLM is prompted to cite which sources it used

mw19930312 commented 2 months ago

Would you mind referring me where the LLM is prompted? I just wonder how to pass the citation URLs if there are any. Also, if I understand it correctly, the nodes are formatted as the following through a metadata conversion template, right?

[source 1] context: metadata: xxx: xxx url: url

mw19930312 commented 2 months ago

@logan-markewich ^^

dosubot[bot] commented 2 months ago

In the CitationQueryEngine, the process of prompting the LLM for citations involves a few key steps:

Retrieving and Formatting Nodes: Initially, nodes are retrieved from a vector index and are then formatted as citation sources. Each node is assigned a label (e.g., source 1, source 2), which includes the context and any relevant metadata, such as URLs. This is achieved through the retrieval and formatting mechanisms within the engine, where metadata from the nodes is used to create a detailed context for the LLM.
Synthesizing the Response: The formatted nodes are passed to the LLM along with a citation prompt template. This template instructs the LLM on how to incorporate the citations into its response, guiding it to reference the sources by their assigned labels. The process ensures that the final response generated by the LLM includes citations to the provided sources, making it clear which parts of the response are supported by which sources.
Metadata and URLs: The inclusion of URLs and other metadata in the citation nodes is controlled by the metadata_mode parameter. This means that when nodes are converted into citation sources, their metadata, including URLs, can be formatted and included based on the settings specified in the metadata_mode. This allows for a flexible approach to how detailed the citations are, including the possibility of directly citing URLs within the response.

This approach ensures that the final response from the LLM is not only informative but also accurately cites the sources it used, providing a clear and traceable path back to the original information.

#### Details

**Sources** - [llama-index-core/llama_index/core/query_engine/citation_query_engine.py](https://github.com/run-llama/llama_index/blob/main/llama-index-core/llama_index/core/query_engine/citation_query_engine.py) - [llama-index-core/llama_index/core/schema.py](https://github.com/run-llama/llama_index/blob/main/llama-index-core/llama_index/core/schema.py) - [llama-index-legacy/llama_index/legacy/schema.py](https://github.com/run-llama/llama_index/blob/main/llama-index-legacy/llama_index/legacy/schema.py) - [docs/docs/examples/metadata_extraction/MetadataExtractionSEC.ipynb](https://github.com/run-llama/llama_index/blob/main/docs/docs/examples/metadata_extraction/MetadataExtractionSEC.ipynb) - [llama-index-legacy/llama_index/legacy/query_engine/citation_query_engine.py](https://github.com/run-llama/llama_index/blob/main/llama-index-legacy/llama_index/legacy/query_engine/citation_query_engine.py) - [llama-index-legacy/tests/postprocessor/test_metadata_replacement.py](https://github.com/run-llama/llama_index/blob/main/llama-index-legacy/tests/postprocessor/test_metadata_replacement.py) - [llama-index-legacy/llama_index/legacy/response_synthesizers/google/generativeai/base.py](https://github.com/run-llama/llama_index/blob/main/llama-index-legacy/llama_index/legacy/response_synthesizers/google/generativeai/base.py) - [llama-index-legacy/llama_index/legacy/indices/struct_store/sql_retriever.py](https://github.com/run-llama/llama_index/blob/main/llama-index-legacy/llama_index/legacy/indices/struct_store/sql_retriever.py) - [llama-index-legacy/llama_index/legacy/postprocessor/metadata_replacement.py](https://github.com/run-llama/llama_index/blob/main/llama-index-legacy/llama_index/legacy/postprocessor/metadata_replacement.py)
**About Dosu** This response is meant to be useful and save you time. It is **not meant to be a precise solution**, but rather a starting point for your own research. Help me be more useful! Please leave a 👍 if this is helpful and 👎 if it is irrelevant.

To continue the conversation, mention @dosu.

logan-markewich commented 2 months ago

@mw19930312

prompt: https://github.com/run-llama/llama_index/blob/287c3df887792035c9605ae87234903c2ca68795/llama-index-core/llama_index/core/query_engine/citation_query_engine.py#L32

code entry point: https://github.com/run-llama/llama_index/blob/287c3df887792035c9605ae87234903c2ca68795/llama-index-core/llama_index/core/query_engine/citation_query_engine.py#L266

run-llama / llama_index

[Question]: How are citations generated in the response of CitationQueryEngine? #13536

Question Validation

Question