Closed sunank200 closed 10 months ago
@Lee-W you can start the discussion on this on the ask-astro-dev channel. Input from Michael, Steven and Julian is useful. The goal is to scope this task and implement that.
Let's just start with number of traces (i.e. root runs = user requests) broken down by success/failure, and the Avg. correctness score, by day. We don't need historical data. Totally fine to just start collecting this now. I'd like to keep LangSmith around for troubleshooting.
@Lee-W please contact Steven for where those metrics should land in snowflake.
Discussed with the data team and concluded that we will create a separate DB, waiting on IT to create that
@Lee-W to follow up with Josh Fell and make progress on this
Thanks to Josh. We already have snowflake DB created but will still need IT's help creating snowflake account
As informed earlier today, we'll need to rewrite it into airflow DAG.
Feature Description: We are currently tracking various metrics through Langsmith, but we plan to transition away from Langsmith in the future. To facilitate this shift, we need to start ingesting all metrics currently tracked by Langsmith into Snowflake. This will allow us to maintain and analyze our data more effectively. The goal is to establish a more formal and centralized system for tracking metrics, including counts of successes and tests.
Proposed Solution: