[ ] Attach the benchmark numbers. (Good for beginners)
[ ] Add a tutorial about how to convert GluonNLP models to TVM and do inference. (Good for beginners)
[ ] Investigate the numerical issues triggered when converting ALBERT to TVM. Currently, the test may still sometimes fail even if atol=1E-1, rtol=1E-3.
Description
TVM support has been added in https://github.com/dmlc/gluon-nlp/pull/1390. Also, we updated our benchmarking utility to support profiling the inference speed of TVM: https://github.com/dmlc/gluon-nlp/tree/master/scripts/benchmarks. We can further improve our document.
atol=1E-1
,rtol=1E-3
.@dmlc/gluon-nlp-committers