Optimize memory usage and reduce GC by eliminating memory copies 3x

uber / RemoteShuffleService

Remote shuffle service for Apache Spark to store shuffle data on remote servers.

Other

321 stars 100 forks source link

The current shuffle service implementation creates multiple copies of the shuffle write which leads to increased GC and under heavy load reduced performance due to GC throughput limitations. These copies can be eliminating by simply passing the reference of the original Netty ByteBuf which can then be passed to the storage layer and eliminate the need for any Buf to array conversion / rewrapping.

This PR is an initial draft of these changes validated with the unit tests as well as basic long running tests for validity.

Original Flow-> network buf copied to custom buf -> shuffleDataWrapper to byte array (copy) -> wrapped to buf -> write to file as output stream (copy to byte array) New Flow -> network buf to composite buf (no copy) -> handler passthru -> materialize to ByteBuffer -> write to file (via channel using direct buf)

This therefore eliminates the 3 copies of the data in memory.

uber / RemoteShuffleService

Optimize memory usage and reduce GC by eliminating memory copies 3x #74