microsoft / rat-sql

A relation-aware semantic parsing model from English to SQL
https://arxiv.org/abs/1911.04942
MIT License
406 stars 117 forks source link

Error during training (killed in step0) #31

Closed mellahysf closed 3 years ago

mellahysf commented 3 years ago

Hi there,

I'm trying to run your model with BERT, but during training, the process is killed just at Step0:

root@7e709b6634de:/app# python run.py train experiments/spider-bert-run.jsonnet To use data.metrics please install scikit-learn. See https://scikit-learn.org/st able/index.html [2020-10-21T08:05:13] Logging to logdir/bert_run/bs=6,lr=7.4e-04,bert_lr=3.0e-06 ,end_lr=0e0,att=1 [2020-10-21T08:06:48] Step 0 stats, train: loss = 161.07755279541016 [2020-10-21T08:07:34] Step 0 stats, val: loss = 195.53395080566406 [2020-10-21T08:08:48] Step 0: loss=186.9427 Killed

@alexpolozov @berlino @DevanshChoubey @MuriloSchaefer

mellahysf commented 3 years ago

Problem with RAM, I increase it more and it works now.

PedroEstevesPT commented 3 years ago

Hi,

I am facing the same problem. Can you describe how you increased the ram? Thanks

MuriloSchaefer commented 3 years ago

Moved to cloud computing :)

mellahysf commented 3 years ago

@Muradean
if you have sufficient RAM in your computer, increase using more RAM in docker: in the moment of running the container with the flag: -mXg (X is the number of G of RAM to use, perfectly use 10) or in docker>setting>Resources

PedroEstevesPT commented 3 years ago

I confirm the issue was lack of RAM