Open fhieber opened 3 years ago
@fhieber Currently in numpy mode Gluon 2.0, sparse feature is not supported.
I see, thanks. Are there plans to re-add this? Sparse gradient updates for embedding matrices provided noticable improvements in training throughput in the past.
MXNet2.0 NumPy array will need to follow the python array API standard, so we will probably not add sparse feature for NumPy arrays. But, I'm working on a work around to help users to fallback to legacy and sparse gradients when sparse grad is required in parameters and some operators.
Description
While migrating to the numpy namespaces in MXnet 2.0 I observed an error when trying to create a row_sparse parameter (see example below). The example shows our current pattern in mxnet 1.x (using NDArrays/symbols).
Does the new numpy interface not yet support row_sparse parameters/gradients?
Error Message
To Reproduce
Environment