Closed teetone closed 9 months ago
Assigning to Percy as reminder/placeholder until we find someone else to take on.
@Tiiiger Maybe you could post updates here, so we can update this as needed. Will keep Percy as the assignee for now, but can switch over at some point if it makes sense.
@teetone @percyliang @teetone Status:
Try to evaluate the perplexity of common benchmark test/validation set of AI21 models.
A lot of test sets are too big to evaluate directly so I am looking into the convergence of subsampling.
@Tiiiger Just to track the status so everyone's on board:
Is this right/anything else to add (incl. the experiments/figures whenever you get a chance)?
(Note: Moving this to P2 for now.)
transferring to @fladhak
also i just realized i was a complete idiot pushing these to main
directly. Should have created a separate branch and do PR. sorry for this.
Closing because contamination tracking is deprecated.
Pilot: