Open MrEdwards007 opened 2 weeks ago
I let the process run and found that it stopped, though I'm not entirely sure about completion after (I believe) was 100,000 iterations six hours later.
Some indication that I wasn't in an endless loop and what percentage of completion is highly desirable, even if this is machine independent time to completion.
Yet, this was followed by the notification that I am in need of assistance in resolving.
Average tokens per second: 977.64 /home/homer/anaconda3/envs/ai_scientist/lib/python3.11/site-packages/numpy/core/fromnumeric.py:3504: RuntimeWarning: Mean of empty slice. return _methods._mean(a, axis=axis, dtype=dtype, /home/homer/anaconda3/envs/ai_scientist/lib/python3.11/site-packages/numpy/core/_methods.py:129: RuntimeWarning: invalid value encountered in scalar divide ret = ret.dtype.type(ret / rcount)
This should be solved by: https://github.com/SakanaAI/AI-Scientist/issues/89#issuecomment-2323129188
the nanogpt part would iterate till 1,00,000 times for i guess 2-3 times as per me but the nanogpt_lite one would be faster
& it would be faster in the H100 gpu
Good day, I was performing the installation process as shown in the model card instructions located at https://github.com/SakanaAI/AI-Scientist
I executed "cd templates/nanoGPT && python experiment.py --out_dir run_0 && python plot.py" After two hours of repeatedly cycling through building the model, I had no idea if this was normal, should it have finished after a few minutes or was it nearly finished and I just didn't know better.
OS : Ubuntu 22.04.4 LTS Driver Version : 535.183.01 CUDA Version : 12.2 Attached GPUs : 1 Product Name : NVIDIA RTX A4500 Laptop GPU Product Brand : NVIDIA RTX
I terminated the process at the keyboard after two hours of cycling. I've added a screenshot after restarting this same process, which has been running for about 40 minutes.
Questions