Closed kalanchan closed 7 months ago
We should roll this into the retention testing work.
Summary - I suspect that user preference comes into play with the "best in class" models. We should test which default setting yields the greatest user retention. Added cost of a better model can be modeled against the ROI associated with providing a better experience. If a cheaper model has the same user retention, we should minimize cost. If a better model is more expensive but yields better UX that yields better retention at a level we think is worth the cost tradeoff, great.
At a minimum we should upgrade to the Claude 3.0 Sonnet model as the baseline (no A/B needed). Then we should test the two best in class models against it as a baseline.
closing this in favour of claude3
@beyang we chatted about this during the grooming session, could you provide a summary of what you're thinking here? We should also loop in @chillatom as he's working on things that affect retention and this could be one of them