[ENHANCEMENT] build golden datasets or manual evals

Is your feature request related to a problem? Please describe. Annotate via phoenix app to build golden datasets or manual evals

Describe the solution you'd like

Was wondering if span or trace annotation for dataset creation and/or evaluation is on Phoenix roadmap at all? We are already using phoenix for a lot of the heavy lifting with tracing and visualizing traces however we are still exporting these traces out and manually converting them into datasets for eval/examples optimizations/etc What would be awesome is if there was a way for me too add manual annotations , rewrite expected output, etc when reviewing a span (see screenshot below) Current plan is to load phoenix traces offline and annotate via doccano PS is this available in managed arize? CleanShot 2024-05-20 at 12 38 20

Describe alternatives you've considered Doccano

Additional context Add any other context or screenshots about the feature request here.

Arize-ai / phoenix

[ENHANCEMENT] build golden datasets or manual evals #3249