Closed vishalaksh closed 4 years ago
I am trying to use this library for a real time prediction. But it is taking more than 1 min even on GPU to predict. Here is a colab link.
Here is a screenshot showing the execution time:
What could be the possible reason for this latency? Is it pertaining to the library or BERT itself?
Thanks
that is strange, it didn't used to. I wonder if this has to do with framework choice? Also in that link, I see:
Since I can't reproduce, I am closing this
I am trying to use this library for a real time prediction. But it is taking more than 1 min even on GPU to predict. Here is a colab link.
Here is a screenshot showing the execution time:
What could be the possible reason for this latency? Is it pertaining to the library or BERT itself?
Thanks