Alluxio / alluxio

Alluxio, data orchestration for analytics and machine learning in the cloud
https://www.alluxio.io
Apache License 2.0
6.86k stars 2.94k forks source link

Local Cache with large page size cause Out of heap memory #16669

Open LuQQiu opened 1 year ago

LuQQiu commented 1 year ago

Alluxio Version: Dora branch - 2.10.0-SNAPSHOT

Describe the bug Local cache hit read is zero copy. Local cache with FUSE SDK read is double copy at least:

byte[] page = LocalCacheFileInStream.readExternalPage(position, readType);

To Reproduce Change page size to 512MB launch Fuse SDK with local cache Run FIO big file read: fio --name=sequentialread --rw=read --bs=4m --numjobs=1 --filesize=1g -direct=1 --group_reporting --nrfiles 1

read failed because out of heap memory

Expected behavior A clear and concise description of what you expected to happen.

Urgency Describe the impact and urgency of the bug.

Are you planning to fix it Please indicate if you are already working on a PR.

Additional context Add any other context about the problem here.

HelloHorizon commented 1 year ago

@LuQQiu should we close this issue?

github-actions[bot] commented 1 year ago

This issue has been automatically marked as stale because it has not had recent activity. It will be closed in two weeks if no further activity occurs. Thank you for your contributions.

apc999 commented 1 year ago

@LuQQiu is this still an issue in 30x ?