issues
search
NASA-IMPACT
/
evalem
An evaluation framework for your large model pipelines
0
stars
0
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Update README with documentation
#38
NISH1001
closed
8 months ago
0
Add sentence similarity metric using Sentence Transformers
#37
NISH1001
opened
11 months ago
0
[WIP] Experiments with BART-based Generative QA
#36
NISH1001
opened
1 year ago
1
[result] Askathon QA comparison table for distilbert and nasa-v6
#35
NISH1001
opened
1 year ago
0
Build comparison table
#34
NISH1001
closed
1 year ago
1
Addition of onnx runtime to evalem.nlp and minor bug fixes
#33
NISH1001
closed
1 year ago
0
Add "score" key to metric result dict
#32
NISH1001
closed
1 year ago
0
Addition of onnx runtime for huggingface language models
#31
NISH1001
closed
1 year ago
1
render RTD (readthedocs) documentation for evalem
#30
NISH1001
opened
1 year ago
1
onnx runtime for Question Answering in evalem.nlp.models
#29
NISH1001
closed
1 year ago
4
add CV metrics to evalem
#28
NISH1001
opened
1 year ago
1
CV and NLP submodule refactor
#27
rbavery
closed
1 year ago
14
Add Continuous Integration workflow to run unit tests
#26
weiji14
closed
1 year ago
1
segregate nlp and non-nlp (cv) namespace in evalem
#25
NISH1001
closed
1 year ago
4
setup python submodules with pyproject.toml and hatch? or other package management tool
#24
rbavery
closed
1 year ago
0
setup CI/CD with github actions
#23
rbavery
closed
1 year ago
6
storing datasets for computer vision evaluation
#22
rbavery
opened
1 year ago
1
find examples of nn modules to test if they can be exported to torchscript and export burn scar model
#21
rbavery
closed
1 year ago
14
[feature request] Langchain-based quality evaluation for generated QA pairs
#20
NISH1001
opened
1 year ago
0
Deciding on features for evalem.cv
#19
rbavery
opened
1 year ago
4
integration with huggingface transformers?
#18
rbavery
closed
8 months ago
1
evalem.cv submodule location
#17
rbavery
closed
1 year ago
4
Bugfix setup.py and down/upgrade packages
#16
NISH1001
closed
1 year ago
0
[feature request] Generate comparison table for the same set of metrics but different models
#15
NISH1001
closed
1 year ago
3
[feature request] Cache predictions in the evaluation pipeline
#14
NISH1001
opened
1 year ago
2
[WIP] Experiment v0.0.3a with NASA v6 model
#13
NISH1001
closed
1 year ago
10
[alpha] Addition of more semantic metrics
#12
NISH1001
closed
1 year ago
1
[alpha] Addition of simple pipeline abstraction
#11
NISH1001
closed
1 year ago
0
[alpha] Addition of test suites for QA and text classification model wrapper
#10
NISH1001
closed
1 year ago
0
Addition of unit tests for metrics
#9
NISH1001
closed
1 year ago
0
[alpha] Improvements to ModelWrapper and better QA/Classification implementation
#8
NISH1001
closed
1 year ago
0
[feature/improvements] List of metrics to implement for NLP
#7
NISH1001
opened
1 year ago
0
[alpha] Implementation of Semantic metrics and more basic metrics
#6
NISH1001
closed
1 year ago
1
[feature/improment] Add unit tests
#5
NISH1001
closed
1 year ago
2
[feature request] Task-based data format implementation
#4
NISH1001
opened
1 year ago
0
[improvement] Efficient way to convert predictions/references format into Jury
#3
NISH1001
closed
1 year ago
0
[alpha] Implementation of model wrapper, metrics and evaluators
#2
NISH1001
closed
1 year ago
0
Initial evaluator implememtation
#1
NISH1001
closed
1 year ago
2