precice / systemtests

Testing preCICE / solver combinations using Docker
GNU General Public License v3.0
3 stars 4 forks source link

Comparison between results of different solver/adapter versions #19

Closed shkodm closed 5 years ago

shkodm commented 5 years ago

During fixing test cases for Ubuntu 18.04 and bringing it to travis as well, I have noticed that some of the results differ. (See #17 for documentation ). Which lead me to think about more general question:

MakisH commented 5 years ago

If I understand correctly, the reference results in this case were produced with a different OpenFOAM version, compiled and ran on a different machine. Therefore, I expect them to be slightly different and, after we evaluate the importance of the difference, update the reference results we will use from now on.

Overall, we should always check for the same version of everything (the oldest one we support, I would say).

The "non-significant fluctuations in the floating point numbers" is a bit tricky. If we repeat the same simulation in the same environment, we should receive identical results in our cases. A contributor should know that her changes affected the behavior (even by one digit) and she should evaluate the significance and investigate the reasons.

Apart from this, I like the idea of having a measure of the maximum difference we introduce (could be an error, could be an improvement).

Could you maybe point to some example data for your script?

shkodm commented 5 years ago

Yea, you can test it with for instance:

git clone https://github.com/shkodm/systemtests 
git clone https://github.com/shkodm/precice_st_output 

And then to output the relative differences above 1 percent:

sh systemtests/compare_results.sh precice_st_output/Ubuntu1804/Output_of-of/ systemtests/Test_of-of.Ubuntu1604/referenceOutput/ 1 
MakisH commented 5 years ago

We tried it together today, here are some comments:

MakisH commented 5 years ago

When opening a PR for this, please also use a low threshold (e.g. 1e-6).