Open mboret opened 7 months ago
I would filter on the indexer, personally. Maybe an optional setting to include the status? The call on the confluence_client can take a list of statuses so we could support different types of filters if you wanted to include the other pages as separate knowledge sets.
I tried it with a list of statuses for a quick test and it didn't like the list, if status argument is set to the string "current" the archived pages won't be included.
Hi,
Danswer considers the Confluence archive page to be a "standard" one. So, to answer a question, it can use it. It's not a good thing IMO.
I don't know if the best way is to filter the archive pages on the indexer side or downrank the document from the sources selection...