astronomer / ask-astro

An end-to-end LLM reference implementation providing a Q&A interface for Airflow and Astronomer
https://ask.astronomer.io/
Apache License 2.0
192 stars 47 forks source link

Test and evaluate the Ask-Astro after hybrid search #148

Closed sunank200 closed 8 months ago

sunank200 commented 10 months ago

Test and evaluate the Ask-Astro results after following:

Depends on: #147

vatsrahul1001 commented 10 months ago

Results after hybrid search implementation Total test - 27 Pass - 13 Fail - 14

Results sheet

As per results responses are very inconsistent and for the same questions opposite answer

Raised below issues

  1. https://github.com/astronomer/ask-astro/issues/167
  2. https://github.com/astronomer/ask-astro/issues/168
  3. https://github.com/astronomer/ask-astro/issues/169
sunank200 commented 10 months ago

Conclusion from testing by @vatsrahul1001 @pankajkoti :

  1. We should consider rolling back the hybrid approach as both the sources and responses have degraded in quality The results are in the link. Hence, rolling back the deployment to the state with airflow docs.
vatsrahul1001 commented 10 months ago

Closing this tasks as per above results we are not going with hybrid for 28th release