microsoft / Microsoft.IO.RecyclableMemoryStream

A library to provide pooling for .NET MemoryStream objects to improve application performance.
MIT License
1.95k stars 203 forks source link

Consider using ArrayPool<byte> as underlying storage mechanism #314

Open benmwatson opened 9 months ago

benmwatson commented 9 months ago

Pros? Cons?

Would it simplify implementation?

sgorozco commented 9 months ago

Hi Ben! I really like your current implementation A LOT. It is very readable, tailor-made for the use-case and very easy to reason about; with nice event hooks to observe the operations, and very useful debugging utilities.

With ArrayPool being an abstract class, things are a bit blurrier. The Shared implementation looks complex, with multi-core optimizations that might improve performance, but then, you would be relying on a singleton class that is heavily used internally by the runtime libraries (at least in the .Net Standard 2.0 libraries) and that might dwarf the optimizations. The non-shared implementation looks much simpler but is restricted to work internally with buffers that are sized in powers-of-two and last time I checked, protects concurrent access via a SpinLock; It may offer better performance, but personally I like more being able to directly inspect the code as in your current implementation.

Currently I am using the library to implement extremely efficient channel multiplexing. I am concurrently serializing hundreds of large DTOs to Recyclable streams and the data slicing to enable the multiplexing is completely handled by your library, with zero-copy, zero-allocation. According to the profiler, now the "bottleneck" is curiously, on the Math.DivRem() operations that are done to locate the buffer to place a byte! I am really thankful.

I'm curious about your own thoughts? What are the reasons why you consider such a change?

benmwatson commented 9 months ago

It's been asked a couple of times, so I created this as a place for discussion.

grbell-ms commented 8 months ago

Adding to what @sgorozco said:

RMS would need to use its own array pool implementation, which wouldn't simplify anything. However, it may be worth it to open up an extension point to let users control allocation. If we used ArrayPool<byte> as the interface, RMS should throw if either of the default implementations are used, and document how the custom pool should behave.

AnthonyLloyd commented 6 months ago

DOTNET_SYSTEM_BUFFERS_SHAREDARRAYPOOL_MAXARRAYSPERPARTITION can be used to set the number of arrays cached.

I think there should be a way to configure RMS to use the ArrayPool<byte>.Shared also.