Index error while running run_allocate file for UK Union dataset

File: /data/DUCATI_SIGMOD/run_allocate.py

I'm encountering an IndexError when running the code with the Uk Union dataset for batch sizes of 8192 and 4096. The total budget specified as params is 15-20GB onwards. Despite having 49.14 GB of available GPU memory, the code fails to execute intermittently.

P.S. I ran into a similar issue while using Twitter dataset, but on the adjacency cache allocation. `

` We observed that the above issue only popped up when the allocated adj cache size was larger than total adj size. Therefore, by increasing the fake_dim parameter, we essentially reduced the adj budget ( so the total adj never fits within the adj cache). But the issue still exists for adj cache as well. Wrt UK union, the issue is with nfeat cache allocation where the total nfeat size is way bigger than the total budget or the allocated nfeat cache.

Twitter: total adj size: 11.251GB, total nfeat size: 59.274GB
UK union: total adj size: 42.031GB, total nfeat size: 64.717GB

I would really appreciate if someone could shed some light on this. Thanks.

initzhang / DUCATI_SIGMOD

Index error while running run_allocate file for UK Union dataset #8