larq / docs

Docs website for Larq, Larq Zoo, and Larq Compute Engine
https://docs.larq.dev
Apache License 2.0
5 stars 2 forks source link

Add section about performance claims of quantized/binarized NNs #12

Closed lgeiger closed 4 years ago

lgeiger commented 5 years ago

In https://github.com/openjournals/joss-reviews/issues/1746 @sbrugman raised a very valid point that is currently missing in our guides and documentation:

The documentation and paper make no claims on the comparison of performance between Quantized Neural Networks and their high-precision weight and bias counterparts. At the time of writing, there is clearly a trade-off (although the authors are working hard to close the gap). I suggest being transparent in this difference and possibly opening the option of educating users on when a few percents of accuracy on a specific dataset are essential and when not. This might be especially relevant for people working in the industry.

I think it would be good to add a small paragraph to our FAQ or the guides that briefly explains the story about efficiency and the challenges and of quantized networks as well as what theoretical speedups someone can expect. In general this always depends on the choice of networks and hardware used to run the models, but I agree that we should be transparent about this.

@jamescook106 @koenhelwegen Do we have some resources we could use to elaborate on our claims on the home page?

lgeiger commented 4 years ago

149 Added real world latency benchmarks for our models in Larq Zoo, so I think this can be closed.