Are you supporting sparse matrix operations or graph surgery?

Tencent / PocketFlow

An Automatic Model Compression (AutoMC) framework for developing smaller and faster AI applications.

Other

2.78k stars 490 forks source link

Hi, I'm working with pruning in the level of neurons and channels.

Most of Tensorflow codes from Github describes pruning as making 0 unimportant weights or channels. However, just making them 0 cannot make the model size smaller or accelerate inference speed.

In your project, did you solve this problem by supporting sparse matrix operation for weight sparsification or Tensorflow graph surgery for channel pruning?

I read your code fast, but I can found only weights masked or channel[:, :, idx, :] = 0 syntax.

Thanks.

Tencent / PocketFlow

Are you supporting sparse matrix operations or graph surgery? #261