Closed turboFei closed 2 weeks ago
Fix disk buffer size is negative issue.
Before, when writing for PartitionDataWriter with memory file storage
isMemoryShuffleFile
evict
Then the disk buffer size would be negative finally, and memory file storage would be always positive.
In this PR, we update the counter after evict finished.
After no active running application in the celeborn cluster, I found that, it is abnormal per the celeborn worker log.
24/11/09 23:30:50,474 INFO [worker-memory-manager-reporter] MemoryManager: Direct memory usage: 276.0 MiB/40.0 GiB, disk buffer size: -748726.0 B, sort memory size: 0.0 B, read buffer size: 0.0 B, memory file storage size : 731.2 KiB
disk buffer size: -748726.0 B memory file storage size : 731.2 KiB
Both of them are expected to be 0.
No.
UT and Integration testing.
cc @FMX @RexXiong
What changes were proposed in this pull request?
Fix disk buffer size is negative issue.
Before, when writing for PartitionDataWriter with memory file storage
isMemoryShuffleFile
is true, increment the memory file storage counterevict
is needed, if that, flush the buffer and then setisMemoryShuffleFile
to falseThen the disk buffer size would be negative finally, and memory file storage would be always positive.
In this PR, we update the counter after
evict
finished.Why are the changes needed?
After no active running application in the celeborn cluster, I found that, it is abnormal per the celeborn worker log.
Both of them are expected to be 0.
Does this PR introduce any user-facing change?
No.
How was this patch tested?
UT and Integration testing.