Noob Question : Need help running the code. It seems to be running for forever.

Hello good people!

I didn't know where else to post, so I am posting here.

Background : First of all, I am out of my elements here. I am just learning about RL. I got a job on it. It's more code oriented task but I need some concepts as well. I decided to throw myself in the water to break my stagnation. And I am struggling a bit, but that was the idea. I would like to understand the concepts eventually by myself but for the job I need to press on right now. I hope you can help me here.

Issue : When I run it with default arguments it just keep running. I think by default it is set to run 5 million episodes(T-max = 50e6). I want to run one successful run before I start playing with it so I have an idea on what the result is supposed to look like. Should I just change the T-max variable? There are about 20 more arguments and I am not sure if it affects other or not. For example, I think the target-update and learn-start are related to this. And since my concepts are not so clear, I could use some help here.

I hope I was clear, if not please ask me here.

Kaixhin / Rainbow

Noob Question : Need help running the code. It seems to be running for forever. #61