Memory issue in running existing model with XLM roberta

kumar-shridhar / Online-Toxicity-Detection

APOLLO-1: Online Toxicity Detection

MIT License

2 stars 4 forks source link

Closed n-akram closed 4 years ago

n-akram commented 4 years ago

The current implementation runs out of memory (with one GPU system, of memory 12 GB). The problem exists for both Local GPU and google colab.

n-akram commented 4 years ago

What I have tried so far:

GPU and TPU always result in memory overflow error. Running on local machine results in ETA of 300+ hours.

n-akram commented 4 years ago

Potential solutions:

n-akram commented 4 years ago

Changed the model to xlm-base, and it is working in both google Colab and local GPU. Local GPU: 3 hours, Colab 9 hours ETA.

However, colab seems to use around 8.14 GB. Whereas local machine is using complete assigned GPU.

n-akram commented 4 years ago

Another implementation seems to use parallelization.

Next task is to try that today on TPU with xlm roberta base.