Striveworks valor issues

Striveworks / valor

Valor is a centralized evaluation store which makes it easy to measure, explore, and rank model performance.

https://striveworks.github.io/valor/

Other

38 stars 4 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

Add as_dict to object detection

#774 czaloom closed 1 month ago
0
Lite Semantic Segmentation

#773 czaloom closed 1 month ago
0
Add label maps for detection tasks to `lite`

#772 ntlind closed 1 month ago
0
Small Fixes for Lite ObjDet

#771 czaloom closed 1 month ago
0
ENH: ability to add datum metadata after datum creation

#770 MattMcClainStrive opened 1 month ago
0
BUG: `lite` doesn't count true positives if there is a false positive with an equal score

#769 ntlind closed 1 month ago
1
ObjDet Confusion Matrices

#768 czaloom closed 1 month ago
0
BugFix Valor Lite Filtering

#767 czaloom closed 1 month ago
0
ENH: Data filtering should affect returned metrics.

#766 czaloom closed 1 month ago
0
Add Metrics to return to Lite

#765 czaloom closed 1 month ago
0
valor lite objdet detailed perf

#764 czaloom closed 1 month ago
0
BUG: Add test to `lite` checking for classification bug

#763 ntlind closed 1 month ago
1
Allow users to pre-populate joint_df for the Valor text generation streaming manager

#762 bnativi closed 2 months ago
0
Add Bounding Box Examples to Detailed Metrics

#761 czaloom closed 2 months ago
0
Added Obj Det tests to Lite

#760 czaloom closed 2 months ago
0
Enable Bitmasks and Polygons in `lite`

#759 ntlind closed 1 month ago
0
Add rasters and polygons to `lite`

#758 ntlind closed 2 months ago
0
BUG: `lite` Counts are incorrect for a core test case

#757 ntlind closed 2 months ago
1
BUG: `lite` AR calculations don't match the API for one test

#756 ntlind closed 2 months ago
1
bugfix for ranked pairing in lite

#755 czaloom closed 2 months ago
0
BUG: `lite` fails an AP test from `core`, likely due to an issue with selecting the "best pairs"

#754 ntlind closed 2 months ago
0
Fix type issue with valor-core text gen metrics

#753 bnativi closed 2 months ago
0
Make OpenAI and Mistral optional dependencies of valor core

#752 bnativi closed 2 months ago
0
Fix Detailed Counts in Valor-Lite

#751 czaloom closed 2 months ago
0
ENH: ability to search models and datasets

#750 MattMcClainStrive opened 2 months ago
0
Lite Classification

#749 czaloom closed 1 month ago
0
Numpy-based Object Detection for Bounding Boxes

#748 czaloom closed 2 months ago
0
Add Text Generation Metrics to Valor Core

#747 bnativi closed 2 months ago
0
Numpy Implementation

#746 czaloom closed 2 months ago
0
BUG: valor_core Average Recall metric does not report correct IoU thresholds.

#745 czaloom closed 1 month ago
0
Fix DetailedPRCurve examples for classification tasks

#744 ntlind closed 2 months ago
0
Handle bad llm response with retries (llm-guided metrics)

#743 bnativi closed 1 month ago
1
Add BLEU Smoothing Function

#742 bnativi opened 2 months ago
0
Add smoothing function as a metric parameter for the BLEU text generation metric

#741 bnativi opened 2 months ago
1
Fix bug in `valor_core` where some true positives weren't being correctly deassigned

#740 ntlind closed 2 months ago
0
BUG: Valor Core Counting Datums with other label keys as true negative

#739 jqu-striveworks closed 1 month ago
2
BUG: Valor Core Classification Fix Future Warning

#738 jqu-striveworks closed 1 month ago
0
BUG: Valor Core Classification Mislabels Prediction FP, FN, TN for examples.

#737 jqu-striveworks closed 2 months ago
0
BUG: Valor Core mAP is not calculated correctly when there are predictions for a class, but no ground truths

#736 jqu-striveworks closed 1 month ago
2
BUG: Valor Core Detection Incorrectly Assigns True Postives

#735 jqu-striveworks closed 2 months ago
2
WIP Optimized Detection Metics

#734 jqu-striveworks closed 2 months ago
2
Fix GitHub workflows

#733 ntlind closed 2 months ago
0
improvements to bias external integration tests

#732 bnativi closed 2 months ago
0
Vectorize IOU calcs for axis-aligned bboxes

#731 ntlind closed 2 months ago
0
improve toxicity external integration tests

#730 bnativi closed 2 months ago
0
Update API Benchmarks

#729 czaloom closed 2 months ago
0
Add Retries for LLM-Guided Metrics

#728 bnativi closed 1 month ago
0
Unaccounted Time in Valor-Core Benchmark

#727 czaloom closed 2 months ago
0
Fix OD benchmarks and disallow calculating DetailedPRCurves when using AnnotationType.RASTER

#726 ntlind closed 2 months ago
0
ignore me

#725 jqu-striveworks closed 2 months ago
0

Previous Next