evals Search Results - Githubissues

1000+ results
for evals

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Arize-ai/phoenix #3249

[ENHANCEMENT] build golden datasets or manual evals

**Is your feature request related to a problem? Please describe.** Annotate via phoenix app to build golden datasets or manual evals **Describe the solution you'd like** Was wondering if span o…

sraibagiwith100x updated 1 month ago
1
microsoft/promptflow #3522

[BUG] Evaluator respond with Nan values as the first token i…

**Describe the bug** Similarity Evaluator responds with a response Nan as first token in response is 'Text" and promptflow is not able to convert text to integer. **How To Reproduce the bug** St…

deepakas updated 3 days ago
1
EatMoreChicken/ta-crontoolkit #6

Update dashboards to use a macro for time evals

Each panel in the dashboard has a hard-coded eval to extract as human-readable time from epoch time. Build a macro that can accomplish this and update the dashboards to use it.

EatMoreChicken updated 1 month ago
1
mbanani/probe3d #9

SigLIP forward pass question

Hi, thank you for the wonderful paper and codebase! I had one clarification question: it looks like there is an extra set of forward passes for the SigLIP ViT blocks - is this intentional for the sigl…

stephanie-fu updated 5 days ago
1
microsoft/promptflow #3203

[Feature Request] Provide example of evaluator_config for ev…

I suspect that I need to specify evaluator_config for evaluate in order to map the data from the target response, but there's no example of it in the docstring or in https://pypi.org/project/promptflo…

pamelafox updated 3 days ago
3
stanfordnlp/dspy #1060

Data Extraction of multiple fields with suggestions from Typ…

I have been trying to extract data (title, question answered, entities, summary) from documents chunks. I believed typed predictors would be good for this, but I keep running into "Too many retrie…

BlueKiji77 updated 3 weeks ago
1
mindsdb/mindsdb #5732

[feat] Support for multiple acc evals

### Short description and motivation for the proposed feature Idea from @Ricram2: doing `EVALUATE * FROM` would yield a table with all compatible accuracy metrics for the model being evaluated. ### …

paxcema updated 3 weeks ago
2
Arize-ai/phoenix #3408

get_qa_with_reference is always None where as get_retrieved_…

get_qa_with_reference is always None where as get_retrieved_documents works fine. Always get No spans found. File "C:\anaconda3\Lib\site-packages\phoenix\evals\classify.py", line 354, …

axiomofjoy updated 3 weeks ago
7
promptfoo/promptfoo #1025

How to compare two separate eval runs in the Web UI

In the config, if I've defined two prompts, or two providers, I see the side-by-side results in the Web UI. What about the situation if I or someone else have run an eval on a single prompt or prov…

efung updated 1 week ago
2
magma-labs/magma-chat #43

Implement Evals

Details here https://github.com/openai/evals Products that implement evals get priority access to GPT-4

obie updated 1 year ago
2

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for evals

1000+ results
for evals