google-research / bert

TensorFlow code and pre-trained models for BERT
https://arxiv.org/abs/1810.04805
Apache License 2.0
38.23k stars 9.62k forks source link

deploy on serverless architecture #1079

Open flieks opened 4 years ago

flieks commented 4 years ago

Someone experience with deploying this to serverless ? AWS lambda has size restrictions to max of 512MB temp storage which is a problem.

image

Other options ? Because BERT can be GPU resource heavy if running fulltime

aegorfk commented 4 years ago

Hi. Also researching this. Found out that you can rather decrease the model requirements (aka use distilbert instead), or provision this yourself, try using cortex framework to ease the pain.