Open cloudronin opened 1 week ago
We need to check in the benchmarking code as a tool/notebook that can be run against the local pythia deployment to measure how well Pythia is doing on various benchmark datasets
We need to check in the benchmarking code as a tool/notebook that can be run against the local pythia deployment to measure how well Pythia is doing on various benchmark datasets