Parallel ledger hashing

PR #15980 introduces a neat trick that makes ledger hashes for a mask (i.e. a block) to be computed in a single function call, hashing is executed layer by layer.

We could go one step further and come up with a batch version of hash function.

As of the moment, we rely on block_cipher exposed from the rust poseidon implementation. Sponge construction as a whole is implemented in snarky. What could be done is the following:

Start using Rust's implementation for poseidon hash (including the sponge construction) // maybe padding/block splitting could be left in Ocaml for convenience
Come up with a new function in Rust interface to compute a hash batch (instead of individual hash)
Use rayon's par_iter in implementation of hash function to utilize multicore capabilities

Motivation

At this stage (after closing #14752, to be precise), stage ledger diff application's cost is dominated by computing various hashes:

Hash of an account (bottom layer of ledger)
Merge hash (non-bottom layers of ledger)
Receipt chain hash
Account's actions state hash

Overwhelming majority of cost (>80%) for processing a max-account-update block comes from merge/account hashes. As measured on server, it's around 1.2s for a max-account-update block, and it uses a single thread. When executed on a 12-core server, it has potential to be reduced to ~300ms (napkin math™).

MinaProtocol / mina

Parallel ledger hashing #16053