Closed sunank200 closed 10 months ago
Is there good data in the database? —> Use Weaviate client directly or streamlit app Is the frontend retrieving it properly? → The current approach is fine where we as questions to slack directly Scenarios both are positive both negative → problem with documents we are embedding or ingestion —> log issue for this DB is good but not front end → problem with conversation retrival ??????? —> log issue for this Frontend is good but DB is bad → log a bug 70% positive - good enough for us to go ahead. % for postive response % with postive result in top 3 document sources Average of both More relevant questions from langsmith - with correctness check (bad and good answers)
70% positive - good enough for us to go ahead.
More relevant questions from langsmith - with correctness check (bad and good answers)
@vatsrahul1001 has created this notion doc with test plan. @vatsrahul1001 please add more details as required
Test pipeline for this is at https://github.com/mpgreg/ask-astro/blob/add_baseline_test/airflow/dags/monitor/test_baseline.py with test logic at https://github.com/mpgreg/ask-astro/blob/add_baseline_test/airflow/include/tasks/utils/retrieval_tests.py
Latest Test results : https://docs.google.com/spreadsheets/d/13cVqNikix82YjCPA4t0XaULg3XccBnvrQUmQa9VwgC0/edit#gid=1200545478
This can be closed right @vatsrahul1001 ?
summary Total Test Cases: 52 Passed Test Cases: 45 Failed Test Cases: 7
More context