open-mpi / ompi

Open MPI main development repository
https://www.open-mpi.org
Other
2.16k stars 859 forks source link

Reproducible Linear Algebra through posit-enabled MPI reductions #6494

Open Ravenwater opened 5 years ago

Ravenwater commented 5 years ago

New feature request

Details of the problem

IEEE floating point does not adhere to associative and distributive laws of arithmetic. This creates rounding noise due to ordering effects of concurrent operations.

Posits are a tapered floating point system that restores associativity and distributive arithmetic without rounding noise. We have build a High-Performance Reproducible linear algebra environment using posit arithmetic, but for DMM we need to augment MPI reductions to use posit arithmetic.

We are looking for a research collaboration to add a posit-enable reduction instruction set to OpenMPI to create a fully scalable high-performance reproducible linear algebra solution for arbitrary concurrency environments.

The benefit of this effort would be that research collaborations between teams that have very different cluster configurations becomes trivial, because computational results are independent of the concurrency dynamics of the cluster.

bosilca commented 5 years ago

I would be curious to take a stab at this. Feel free to ping me privately.

Ravenwater commented 5 years ago

@bosilca I have emailed you and called your office number. Would you be so kind to ping me?