If there are too many inserts, the distribute tasks will run out of heap and fail.
The data merging uses lazy iterator and does not load all data into memory.
Problem is in serialization. The entire data set is serialized into on-heap byte[], and that causes OOEM.
Solution is to serialize data into FileChannel and latter transfer it into log file
If there are too many inserts, the distribute tasks will run out of heap and fail.
The data merging uses lazy iterator and does not load all data into memory. Problem is in serialization. The entire data set is serialized into on-heap
byte[]
, and that causes OOEM.Solution is to serialize data into FileChannel and latter transfer it into log file