Closed NathanHB closed 1 month ago
It is currently cumbersome to log details of what is happening in metric functions, log judge prompt in llm as judge metric for example. Passing it the evaluation tracker would greatly simply this.
It is currently cumbersome to log details of what is happening in metric functions, log judge prompt in llm as judge metric for example. Passing it the evaluation tracker would greatly simply this.