Expected behavior
Was hoping that the model would actually train
Actual output
[00:00:38] Pre-processing files (1416 Mo) ████████████████████████████████████████ 100%
[00:00:00] Tokenize words ████████████████████████████████████████ 601011 / 601011
[00:00:00] Count pairs ████████████████████████████████████████ 601011 / 601011
[00:00:04] Compute merges ████████████████████████████████████████ 51324 / 51324
INFO:simpletransformers.language_modeling.language_modeling_model: Training of None tokenizer complete. Saved to
outputs/.
INFO:simpletransformers.language_modeling.language_modeling_model: Training language model from scratch
Desktop (please complete the following information):
Ubuntu 20.04
transformers==4.6.1
simpletransformers==0.61.6
Additional context
The training and test files are 1.3GB and 130MB respectively.
Debugging efforts
Stepping through the code leads to this:
--Call--
Exception ignored in: <function WeakValueDictionary.__init__.<locals>.remove at 0x7f771552dee0>
Traceback (most recent call last):
File "/home/reverie-pc/.pyenv/versions/3.8.0/lib/python3.8/weakref.py", line 103, in remove
File "/home/reverie-pc/.pyenv/versions/3.8.0/lib/python3.8/bdb.py", line 90, in trace_dispatch
File "/home/reverie-pc/.pyenv/versions/3.8.0/lib/python3.8/bdb.py", line 134, in dispatch_call
File "/home/reverie-pc/.pyenv/versions/3.8.0/lib/python3.8/pdb.py", line 250, in user_call
File "/home/reverie-pc/.pyenv/versions/3.8.0/lib/python3.8/pdb.py", line 354, in interaction
File "/home/reverie-pc/.pyenv/versions/3.8.0/lib/python3.8/pdb.py", line 1458, in print_stack_entry
File "/home/reverie-pc/.pyenv/versions/3.8.0/lib/python3.8/bdb.py", line 543, in format_stack_entry
ImportError: sys.meta_path is None, Python is likely shutting down
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.
Describe the bug The model starts to train but exits abruptly. If this is a memory issue, there is no indication.
To Reproduce Steps to reproduce the behavior:
Expected behavior Was hoping that the model would actually train
Actual output
Desktop (please complete the following information):
Additional context The training and test files are 1.3GB and 130MB respectively.
Debugging efforts Stepping through the code leads to this: