Hi, I have some questions about ASP module:
The document and related paper about N:M sparsity says that the matrices are compressed and the metedata are 2-bit .
But I after using the ASP.prune_trained_model(model, optimizer, I saved the weights about my model.
I found that the matrices did not be compressed and the metadata are bool-type.
I wonder where are the compressed matrices? and where is the 2-bit type metata?
Or I need another step to compress my matrices?
Hi, I have some questions about ASP module: The document and related paper about N:M sparsity says that the matrices are compressed and the metedata are 2-bit . But I after using the ASP.prune_trained_model(model, optimizer, I saved the weights about my model. I found that the matrices did not be compressed and the metadata are bool-type.
I wonder where are the compressed matrices? and where is the 2-bit type metata? Or I need another step to compress my matrices?
Thanks a lot.