It would be awesome if the software architecture would allow for plugins that perform transforms of network at or right before the intermediate layer.
Goal:
For example, quantization and compression of neural networks, which is especially interesting for resource-constrained target systems.
There are framework-specific methods, e.g. tensorflow can output integer-quantized weights, but in the future there might be framework-agnostic algorithms to do this generically.
The effort here should hopefully be low as the API should basically just offer “setters” on the intermediate weights and layers and default to an identity operation (==no change).
It would be awesome if the software architecture would allow for plugins that perform transforms of network at or right before the intermediate layer. Goal: For example, quantization and compression of neural networks, which is especially interesting for resource-constrained target systems. There are framework-specific methods, e.g. tensorflow can output integer-quantized weights, but in the future there might be framework-agnostic algorithms to do this generically. The effort here should hopefully be low as the API should basically just offer “setters” on the intermediate weights and layers and default to an identity operation (==no change).