mlcommons / tiny

MLPerf™ Tiny is an ML benchmark suite for extremely low-power systems such as microcontrollers
https://mlcommons.org/en/groups/inference-tiny/
Apache License 2.0
344 stars 81 forks source link

Balanced CIFAR10 Dataset #116

Closed jmduarte closed 2 years ago

jmduarte commented 2 years ago
Model Datset Accuracy
Pretrained ResNet Full test set 87.2%
Pretrained ResNet Original subset 85.5%
Pretrained ResNet Balanced subset 87.0%
Pretrained ResNet TFLite quantized Full test set 87.0%
Pretrained ResNet TFLite quantized Original subset 86.5%
Pretrained ResNet TFLite quantized Balanced subset 87.0%

@cskiraly @maltanar @jeremy-syn

github-actions[bot] commented 2 years ago

MLCommons CLA bot All contributors have signed the MLCommons CLA ✍️ ✅

maltanar commented 2 years ago

Using PERF_SAMPLE=True and QUANT_MODEL=True in tflite_test.py (which were the default values) I was able to reproduce the 87.0% accuracy with the new balanced subset.

Very minor/unrelated-to-PR comment: README specifies # Prepare Python venv (Python 3.6+ and pip>20 required) but scipy==1.6.0 in requirements.txt in fact requires Python 3.7+ as specified here so I had to install that for the reproduction:

https://scipy.github.io/devdocs/dev/toolchain.html#numpy

maltanar commented 2 years ago

I've also signed the CLA now.

cskiraly commented 2 years ago

@jmduarte can you also sign the CLA (or let me know if you have already signed) Thanks

jmduarte commented 2 years ago

@jmduarte can you also sign the CLA (or let me know if you have already signed) Thanks

@cskiraly I have signed the CLA, I think the job is failing from @maltanar. Maybe he needs to link his GitHub to the CLA appropriately?

Created a committerMap for users in PR: (signed: jmduarte-4932543, ) , (notSigned:  maltanar-1880902, )
cskiraly commented 2 years ago

@jmduarte @maltanar , what do you think, should we also update the performance threshold in https://github.com/mlcommons/tiny/blob/master/benchmark/README.md and in https://github.com/mlcommons/tiny/blob/master/benchmark/MLPerfTiny_Rules.adoc ?