Open antiagainst opened 2 years ago
I found that the max error was high like 6.36. According to the comment, it's expected to have high max error because of quantized Softmax issues.
The error is bumped to 8.559662 after the integration. @rsuderman are there changes on frontend side might change the input IR which leads to the numeric issue recently? Also, is the error acceptable?
cc @bjacob @mariecwhite for visibility, since you've participated in the discussion of error values in https://github.com/iree-org/iree/pull/9337.
The artifacts can be downloaded from https://storage.googleapis.com/iree-shared-files/nod-perf/hanchung/issue_9796.zip
To repro:
$ iree-compile --iree-mlir-to-vm-bytecode-module --iree-hal-target-backends=dylib-llvm-aot -iree-input-type=tosa ~/mobilebert_quant_tosa.mlir -o /tmp/a.vmfb
$ iree-run-module --module_file=/tmp/a.vmfb --device=local-sync --entry_function=main --function_input=@/tmp/mobilebert_quant/download/input0.npy --function_input=@/tmp/mobilebert_quant/download/input1.npy --function_input=@/tmp/mobilebert_quant/download/input2.npy
Things that would help decide as compromises between certainty and engineering effort - ordered from quickest to most certainty:
@bjacob @hanhanW Can we add a priority to this? P1?
Good idea, set P1 : we should either know the cause of this regression or make a conscious decision not to invest in that :-) Given @hanhanW 's clean reproduction steps above, it should be feasible to at least get the bisection.
The error goes down after some integrates... new error range:
I0720 01:12:07.200919 140737350333696 test_util.py:94] Max error (0): 6.603168
I0720 01:12:07.201174 140737350333696 test_util.py:94] Max error (1): 8.559662
Benoit has good point, is this something we are concerned about and digging in?
@antiagainst @hanhanW Is this still active?
I think it's still active, but maybe we can set it to P2.
The following tests fail after integration #9790:
More relavent logs:
The test is disabled to push forward integration.