evals Search Results - Githubissues

1000+ results
for evals

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

axolotl-ai-cloud/axolotl #1225

Wrong eval stages distribution when using `evals_per_epoch` …

### Please check that this issue hasn't been reported before. - [X] I searched previous [Bug Reports](https://github.com/OpenAccess-AI-Collective/axolotl/labels/bug) didn't find any similar reports. …

sadaisystems updated 1 month ago
1
openai/evals #632

Idea for Evals: Complex, multi-turn instruction-following Ev…

Hello everyone, thank you for contributions so far, I've been working through them and these tasks are forming a challenging a comprehensive benchmark for modern LLMs and LLM programs. We worked on [C…

andrew-openai updated 1 year ago
2
Arize-ai/phoenix #4289

[ENHANCEMENT] Improvements to workflow when Adding Custom LL…

**Is your feature request related to a problem? Please describe.** I can't modify the existing template + rails on the evaluator object to customize for my use case. **Describe the solution you'd…

AparnaDhinakaran updated 1 week ago
3
ComputerScienceHouse/conditional-backend #9

API: /evals/batch

Batch view / definition routes **7 Routes** --- **POST** /evals/batch _accessible to all upperclassmen_ Creates a new batch with either specified criteria or specific members If created by the eval…

joeoneil updated 12 months ago
1
shap/shap #3654

Questions: question about SamplingExplainer

### Problem Description Hi, everyone! I check the code of function `sampling_estimate`. Assume we have a data instance `x` with `M` features. - We keep the 1st to j-th feature as original, replace …

vectorsss updated 4 months ago
1
mindsdb/mindsdb #5732

[feat] Support for multiple acc evals

### Short description and motivation for the proposed feature Idea from @Ricram2: doing `EVALUATE * FROM` would yield a table with all compatible accuracy metrics for the model being evaluated. ### …

paxcema updated 3 months ago
2
understanding-search/maze-transformer #165

Add logit-based evals

Perplexity to start

valedan updated 9 months ago
2
facebookresearch/jepa #43

KeyError when running evals.main

Thanks for your brilliant work! Having downloaded K400 pretrained checkpoint file(k400-probe.pth.tar) and modified the config yaml file for the corresponding dataset(specifying datapath), I ran evals.…

JPerAsperaadAstra updated 6 months ago
1
stanfordnlp/dspy #1060

Data Extraction of multiple fields with suggestions from Typ…

I have been trying to extract data (title, question answered, entities, summary) from documents chunks. I believed typed predictors would be good for this, but I keep running into "Too many retrie…

BlueKiji77 updated 1 month ago
1
sangoma/switchy #50

Is MultiEval.evals insane?

The following is posted verbatim from @dtkerrs review of #49 with regard to the `switchy.distribute.MultiEval` interface and implementation: > So some feedback here on `evals` is that it might be ni…

goodboy updated 7 years ago
2

上一页 1...3 4 5 6 7 8 9...100 下一页

1000+ results for evals

1000+ results
for evals