NASA-IMPACT evalem issues

NASA-IMPACT / evalem

An evaluation framework for your large model pipelines

0 stars 0 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

Update README with documentation

#38 NISH1001 closed 8 months ago
0
Add sentence similarity metric using Sentence Transformers

#37 NISH1001 opened 11 months ago
0
[WIP] Experiments with BART-based Generative QA

#36 NISH1001 opened 1 year ago
1
[result] Askathon QA comparison table for distilbert and nasa-v6

#35 NISH1001 opened 1 year ago
0
Build comparison table

#34 NISH1001 closed 1 year ago
1
Addition of onnx runtime to evalem.nlp and minor bug fixes

#33 NISH1001 closed 1 year ago
0
Add "score" key to metric result dict

#32 NISH1001 closed 1 year ago
0
Addition of onnx runtime for huggingface language models

#31 NISH1001 closed 1 year ago
1
render RTD (readthedocs) documentation for evalem

#30 NISH1001 opened 1 year ago
1
onnx runtime for Question Answering in evalem.nlp.models

#29 NISH1001 closed 1 year ago
4
add CV metrics to evalem

#28 NISH1001 opened 1 year ago
1
CV and NLP submodule refactor

#27 rbavery closed 1 year ago
14
Add Continuous Integration workflow to run unit tests

#26 weiji14 closed 1 year ago
1
segregate nlp and non-nlp (cv) namespace in evalem

#25 NISH1001 closed 1 year ago
4
setup python submodules with pyproject.toml and hatch? or other package management tool

#24 rbavery closed 1 year ago
0
setup CI/CD with github actions

#23 rbavery closed 1 year ago
6
storing datasets for computer vision evaluation

#22 rbavery opened 1 year ago
1
find examples of nn modules to test if they can be exported to torchscript and export burn scar model

#21 rbavery closed 1 year ago
14
[feature request] Langchain-based quality evaluation for generated QA pairs

#20 NISH1001 opened 1 year ago
0
Deciding on features for evalem.cv

#19 rbavery opened 1 year ago
4
integration with huggingface transformers?

#18 rbavery closed 8 months ago
1
evalem.cv submodule location

#17 rbavery closed 1 year ago
4
Bugfix setup.py and down/upgrade packages

#16 NISH1001 closed 1 year ago
0
[feature request] Generate comparison table for the same set of metrics but different models

#15 NISH1001 closed 1 year ago
3
[feature request] Cache predictions in the evaluation pipeline

#14 NISH1001 opened 1 year ago
2
[WIP] Experiment v0.0.3a with NASA v6 model

#13 NISH1001 closed 1 year ago
10
[alpha] Addition of more semantic metrics

#12 NISH1001 closed 1 year ago
1
[alpha] Addition of simple pipeline abstraction

#11 NISH1001 closed 1 year ago
0
[alpha] Addition of test suites for QA and text classification model wrapper

#10 NISH1001 closed 1 year ago
0
Addition of unit tests for metrics

#9 NISH1001 closed 1 year ago
0
[alpha] Improvements to ModelWrapper and better QA/Classification implementation

#8 NISH1001 closed 1 year ago
0
[feature/improvements] List of metrics to implement for NLP

#7 NISH1001 opened 1 year ago
0
[alpha] Implementation of Semantic metrics and more basic metrics

#6 NISH1001 closed 1 year ago
1
[feature/improment] Add unit tests

#5 NISH1001 closed 1 year ago
2
[feature request] Task-based data format implementation

#4 NISH1001 opened 1 year ago
0
[improvement] Efficient way to convert predictions/references format into Jury

#3 NISH1001 closed 1 year ago
0
[alpha] Implementation of model wrapper, metrics and evaluators

#2 NISH1001 closed 1 year ago
0
Initial evaluator implememtation

#1 NISH1001 closed 1 year ago
2