Closed mattmoor closed 5 years ago
+1 -- this is a very common use case we're seeing.
how would you define 'hyperscale', in a measureable unit :-) ?
/kind doc
I'd define hyperscale as >100k qps (requests/second). Other definitions may vary. Measured in CPU-seconds/second, I'd expect hyperscale to be >10k CPU-seconds/second.
Is this worth keeping open?
I believe there were talks on this at Kubecon. Maybe @mchmarny would want to put together a sample for the knative/docs
repo, but that should be tracked there.
Using Knative, I can load my trained tensorflow model and statelessly serve it at hyperscale.
cc @mchmarny