distilBERT with GPU, MAX_LEN = 100:
LENGTH OF TOTAL DOCUMENT 3003
-- Encoder: 3003 sentences in 35s
LENGTH OF TOTAL DOCUMENT 3003
-- Encoder: 3003 sentences in 35s
LENGTH OF TOTAL DOCUMENT 3003
-- Encoder: 3003 sentences in 35s
LENGTH OF TOTAL DOCUMENT 3003
--Encoder: 3003 sentences in 35s
LENGTH OF TOTAL DOCUMENT 3003
-- Encoder: 3003 sentences in 34s
distilBERT with CPU , MAX_LEN = 100: Time to embed 5 documents of 3003 sentences
Total time: around 7mins*5 = 35 mins
In this setting, we compute one sentence at a time.
distilBERT with GPU, MAX_LEN = 100: Time to embed 5 documents of 3003 sentences
Total time: around 35s*5 = 175s = 2min55s
In this setting, we compute with batches of size 32.
distilBERT with CPU, MAX_LEN = 100 100% 3003/3003 [06:54<00:00, 7.30it/s] 100% 3003/3003 [07:15<00:00, 7.25it/s] 100% 3003/3003 [07:06<00:00, 7.75it/s] 100% 3003/3003 [06:35<00:00, 7.91it/s] 100% 3003/3003 [06:31<00:00, 8.07it/s]
distilBERT with GPU, MAX_LEN = 100: LENGTH OF TOTAL DOCUMENT 3003 -- Encoder: 3003 sentences in 35s LENGTH OF TOTAL DOCUMENT 3003 -- Encoder: 3003 sentences in 35s LENGTH OF TOTAL DOCUMENT 3003 -- Encoder: 3003 sentences in 35s LENGTH OF TOTAL DOCUMENT 3003 --Encoder: 3003 sentences in 35s LENGTH OF TOTAL DOCUMENT 3003 -- Encoder: 3003 sentences in 34s