Open cjnolet opened 2 years ago
This issue has been labeled inactive-30d
due to no recent activity in the past 30 days. Please close this issue if no further response or action is needed. Otherwise, please respond with a comment indicating any updates or changes to the original issue and/or confirm this issue still needs to be addressed. This issue will be labeled inactive-90d
if there is no activity in the next 60 days.
This issue has been labeled inactive-90d
due to no recent activity in the past 90 days. Please close this issue if no further response or action is needed. Otherwise, please respond with a comment indicating any updates or changes to the original issue and/or confirm this issue still needs to be addressed.
The current way we are computing norms on sparse data in cuml is suboptimal because we are performing explicit loops over sparse vectors within threads, which causes massive divergences, especially on highly skewed degree distributions.
We should instead use the semiring spmv for this purpose, and if possible even store the vector element (it’s all 1s) only once.