Closed aksh-at closed 10 months ago
Sending larger batches to each Modal function instead of relying on allow_concurrent_inputs. A bit annoying to do this, but this allows us to embed 10% of wikipedia in 5 minutes, using 30-50 GPUs:
allow_concurrent_inputs
Sending larger batches to each Modal function instead of relying on
allow_concurrent_inputs
. A bit annoying to do this, but this allows us to embed 10% of wikipedia in 5 minutes, using 30-50 GPUs: