deploy on serverless architecture

google-research / bert

TensorFlow code and pre-trained models for BERT

https://arxiv.org/abs/1810.04805

Apache License 2.0

38.23k stars 9.62k forks source link

deploy on serverless architecture #1079

Open flieks opened 4 years ago

flieks commented 4 years ago

Someone experience with deploying this to serverless ? AWS lambda has size restrictions to max of 512MB temp storage which is a problem.

Other options ? Because BERT can be GPU resource heavy if running fulltime

aegorfk commented 4 years ago

Hi. Also researching this. Found out that you can rather decrease the model requirements (aka use distilbert instead), or provision this yourself, try using cortex framework to ease the pain.