Closed ccccjunkang closed 4 weeks ago
Could you please disclose the actual number of samples, number of GPUs used, and number of model parameters in the online system?
Hi. We use a model with the same parameter size as described in the paper, and trained on hundreds of millions of samples.
Could you please disclose the actual number of samples, number of GPUs used, and number of model parameters in the online system?