issues
search
neulab
/
ExplainaBoard
Interpretable Evaluation for AI Systems
MIT License
359
stars
36
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Introduce dict[str, Performance]
#511
odashi
closed
1 year ago
2
t-test confidence level is wrong
#510
odashi
closed
1 year ago
1
Is the current applicable condition of t-test correct?
#509
tetsuok
closed
1 year ago
22
Bump mypy version to 0.981
#508
tetsuok
closed
1 year ago
2
refactor get_basic_words
#507
odashi
closed
1 year ago
0
Rename GrammaticalErrorCorrection to GrammaticalErrorCorrectionProcessor
#506
tetsuok
closed
1 year ago
0
Refactor preprocessor.py
#505
odashi
closed
1 year ago
2
Rename TextClassificationProcessor to TabularClassificationProcessor
#504
tetsuok
closed
1 year ago
0
Implement MetricValues
#503
odashi
closed
1 year ago
1
Remove prop_samples from calc_confidence_interval
#502
tetsuok
closed
1 year ago
1
Implement common registry
#501
odashi
closed
1 year ago
1
Custom Analysis Todo List
#500
qjiang002
opened
1 year ago
1
Strict shape checking of `aggregate_stats` and `calc_metric_from_aggregate`
#499
odashi
closed
1 year ago
6
Separate batched/non-batched operations in Metric
#498
odashi
opened
1 year ago
0
aggregate_stats and calc_metric_from_aggregate don't work in some cases.
#497
odashi
closed
1 year ago
0
Increase the minimum sample size of bootstrapping
#496
odashi
closed
1 year ago
0
Test of analysis on open-domain QA
#495
neubig
opened
1 year ago
1
Clarified length mismatch message
#494
neubig
closed
1 year ago
0
including a list of samples in combo analysis result
#493
PaulCCCCCCH
closed
1 year ago
3
Bump version to 0.11.2
#492
OscarWang114
closed
1 year ago
4
list[(name, Thing)] to dict[name, Thing], and remove MetricConfig and directly store configs in Metric
#491
odashi
opened
1 year ago
9
Use setup.cfg to store every setup config
#490
odashi
closed
1 year ago
2
Add third_party directory and guideline.
#489
odashi
closed
1 year ago
0
Remove external_stats from MetricConfig
#488
tetsuok
closed
1 year ago
0
Use same versions of actions as ci.yml
#487
tetsuok
closed
1 year ago
0
Change the version of pypa/gh-action-pypi-publish from master to release/v1
#486
tetsuok
closed
1 year ago
0
Publish package to PyPI on push tags
#485
tetsuok
opened
1 year ago
2
Add typing information around Metric
#484
odashi
closed
1 year ago
0
enable Chinese tokenizer to work
#483
pfliu-nlp
closed
1 year ago
6
`value` should be set to `None` under empty buckets
#482
OscarWang114
opened
1 year ago
2
add more concrete documents for dataset formats when performing analyses
#481
pfliu-nlp
closed
1 year ago
2
We need documents to introduce how to fix different types of CI Errors
#480
pfliu-nlp
opened
1 year ago
0
We need a doc on how to perform unit/integration tests
#479
pfliu-nlp
opened
1 year ago
0
test_generate_system_analysis in integration_tests.summarization_test.SummarizationTest is too slow
#478
tetsuok
closed
1 year ago
12
Decompose MetricResult
#477
odashi
closed
1 year ago
4
Bucket analysis: Handle corner case where samples list is empty
#476
OscarWang114
closed
1 year ago
1
Disambiguate "conf"
#475
odashi
closed
1 year ago
1
Remove unused EXTERNAL_METRICS
#474
tetsuok
closed
1 year ago
1
Two places where tokenization should be standardized
#473
neubig
opened
1 year ago
4
Change to `~=` requirements
#472
neubig
opened
1 year ago
1
DataLab logging is too verbose in integration tests
#471
neubig
closed
1 year ago
3
Move external_stats from MetricConfig to ExternalEvalConfig
#470
tetsuok
closed
1 year ago
2
refactor AnalysisResult.print to generate_report.
#469
odashi
closed
1 year ago
0
Refactor dtype of Value feature
#468
odashi
closed
1 year ago
0
Avoid serializing functions
#467
odashi
opened
1 year ago
3
Concrete data type
#466
odashi
opened
1 year ago
2
Document how custom dataset should be formatted
#465
pfliu-nlp
opened
1 year ago
0
Document how to add customized features through the JSON config file
#464
pfliu-nlp
opened
1 year ago
0
Data hosting for APE task is unstable
#463
neubig
closed
1 year ago
3
Refactor FeatureType
#462
odashi
closed
1 year ago
2
Previous
Next