issues
search
Striveworks
/
valor
Valor is a centralized evaluation store which makes it easy to measure, explore, and rank model performance.
https://striveworks.github.io/valor/
Other
38
stars
4
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Add as_dict to object detection
#774
czaloom
closed
1 month ago
0
Lite Semantic Segmentation
#773
czaloom
closed
1 month ago
0
Add label maps for detection tasks to `lite`
#772
ntlind
closed
1 month ago
0
Small Fixes for Lite ObjDet
#771
czaloom
closed
1 month ago
0
ENH: ability to add datum metadata after datum creation
#770
MattMcClainStrive
opened
1 month ago
0
BUG: `lite` doesn't count true positives if there is a false positive with an equal score
#769
ntlind
closed
1 month ago
1
ObjDet Confusion Matrices
#768
czaloom
closed
1 month ago
0
BugFix Valor Lite Filtering
#767
czaloom
closed
1 month ago
0
ENH: Data filtering should affect returned metrics.
#766
czaloom
closed
1 month ago
0
Add Metrics to return to Lite
#765
czaloom
closed
1 month ago
0
valor lite objdet detailed perf
#764
czaloom
closed
1 month ago
0
BUG: Add test to `lite` checking for classification bug
#763
ntlind
closed
1 month ago
1
Allow users to pre-populate joint_df for the Valor text generation streaming manager
#762
bnativi
closed
2 months ago
0
Add Bounding Box Examples to Detailed Metrics
#761
czaloom
closed
2 months ago
0
Added Obj Det tests to Lite
#760
czaloom
closed
2 months ago
0
Enable Bitmasks and Polygons in `lite`
#759
ntlind
closed
1 month ago
0
Add rasters and polygons to `lite`
#758
ntlind
closed
2 months ago
0
BUG: `lite` Counts are incorrect for a core test case
#757
ntlind
closed
2 months ago
1
BUG: `lite` AR calculations don't match the API for one test
#756
ntlind
closed
2 months ago
1
bugfix for ranked pairing in lite
#755
czaloom
closed
2 months ago
0
BUG: `lite` fails an AP test from `core`, likely due to an issue with selecting the "best pairs"
#754
ntlind
closed
2 months ago
0
Fix type issue with valor-core text gen metrics
#753
bnativi
closed
2 months ago
0
Make OpenAI and Mistral optional dependencies of valor core
#752
bnativi
closed
2 months ago
0
Fix Detailed Counts in Valor-Lite
#751
czaloom
closed
2 months ago
0
ENH: ability to search models and datasets
#750
MattMcClainStrive
opened
2 months ago
0
Lite Classification
#749
czaloom
closed
1 month ago
0
Numpy-based Object Detection for Bounding Boxes
#748
czaloom
closed
2 months ago
0
Add Text Generation Metrics to Valor Core
#747
bnativi
closed
2 months ago
0
Numpy Implementation
#746
czaloom
closed
2 months ago
0
BUG: valor_core Average Recall metric does not report correct IoU thresholds.
#745
czaloom
closed
1 month ago
0
Fix DetailedPRCurve examples for classification tasks
#744
ntlind
closed
2 months ago
0
Handle bad llm response with retries (llm-guided metrics)
#743
bnativi
closed
1 month ago
1
Add BLEU Smoothing Function
#742
bnativi
opened
2 months ago
0
Add smoothing function as a metric parameter for the BLEU text generation metric
#741
bnativi
opened
2 months ago
1
Fix bug in `valor_core` where some true positives weren't being correctly deassigned
#740
ntlind
closed
2 months ago
0
BUG: Valor Core Counting Datums with other label keys as true negative
#739
jqu-striveworks
closed
1 month ago
2
BUG: Valor Core Classification Fix Future Warning
#738
jqu-striveworks
closed
1 month ago
0
BUG: Valor Core Classification Mislabels Prediction FP, FN, TN for examples.
#737
jqu-striveworks
closed
2 months ago
0
BUG: Valor Core mAP is not calculated correctly when there are predictions for a class, but no ground truths
#736
jqu-striveworks
closed
1 month ago
2
BUG: Valor Core Detection Incorrectly Assigns True Postives
#735
jqu-striveworks
closed
2 months ago
2
WIP Optimized Detection Metics
#734
jqu-striveworks
closed
2 months ago
2
Fix GitHub workflows
#733
ntlind
closed
2 months ago
0
improvements to bias external integration tests
#732
bnativi
closed
2 months ago
0
Vectorize IOU calcs for axis-aligned bboxes
#731
ntlind
closed
2 months ago
0
improve toxicity external integration tests
#730
bnativi
closed
2 months ago
0
Update API Benchmarks
#729
czaloom
closed
2 months ago
0
Add Retries for LLM-Guided Metrics
#728
bnativi
closed
1 month ago
0
Unaccounted Time in Valor-Core Benchmark
#727
czaloom
closed
2 months ago
0
Fix OD benchmarks and disallow calculating DetailedPRCurves when using AnnotationType.RASTER
#726
ntlind
closed
2 months ago
0
ignore me
#725
jqu-striveworks
closed
2 months ago
0
Previous
Next