aiidateam / aiida-tutorials

AiiDA tutorials web site
http://aiida-tutorials.readthedocs.org
23 stars 37 forks source link

Prepare AiiDAlab deployment on Azure with AiiDA 2.x and test it #419

Open giovannipizzi opened 2 years ago

giovannipizzi commented 2 years ago

@csadorf @unkcpz @superstar54 @chrisjsewell @mbercx

csadorf commented 2 years ago

I have prepared a test deployment at https://aiida-tutorial-2022-test.aiidalab.xyz based on the revised docker stack with Python 3.9.4 and AiiDA 2.0.0.

mbercx commented 2 years ago

With @csadorf's help, I've adapted the docker stack to our needs for the tutorial (Installing QE, aiida-quantumespresso, ...), and have updated the versions to the latest releases (see email for link). This should be close to the final deployment we want to use for the tutorial. For the testing on Azure, I've made the following assignments based on the presenters of the corresponding sessions in the schedule:

If you're running Quantum ESPRESSO for your sections, you still need to set up the pw.x code:

verdi code setup --label pw --computer localhost --remote-abs-path /opt/conda/bin/pw.x --input-plugin quantumespresso.pw --non-interactive

And install the SSSP efficiency pseudopotential family:

aiida-pseudo install sssp

Additional Notes:

unkcpz commented 2 years ago

It might be an issue that when the first time starts the AiiDAlab server, I encountered a timeout and the server failed to start error. I have to refresh to relaunch the server. If there is an option to increase the timeout might be helpful. @csadorf @mbercx

mbercx commented 2 years ago

Thanks @unkcpz for letting us know! I'll adapt the timeout in the terraform setup.

@csadorf just to be sure, the value to be adapted is the startTimeout here, right?

singleuser:
[...]
  memory:
    limit: 4G
    guarantee: 2G
  cpu:
    limit: 3.0
    guarantee: 1.0
  startTimeout: 300
csadorf commented 2 years ago

Thanks @unkcpz for letting us know! I'll adapt the timeout in the terraform setup.

@csadorf just to be sure, the value to be adapted is the startTimeout here, right?

Yes, I believe that should be sufficient. I'm just a bit surprised that this has not been an issue during your tests.

mbercx commented 2 years ago

I'm just a bit surprised that this has not been an issue during your tests.

Yeah, strange that the issue only pops up now. The timeout is currently at 120s, maybe @unkcpz was just unlucky. 😅

I'll update the deployment later tonight, after I'm done with testing the basic workflows section.

ramirezfranciscof commented 2 years ago
* [ ]  Writing workflows - [Basics](https://aiida-tutorials.readthedocs.io/en/tutorial-2022-intro/sections/writing_workflows/index.html): I'll take care of these (Up to and including Debugging)

* [ ]  Writing workflows - [Advanced](https://aiida-tutorials.readthedocs.io/en/tutorial-2022-intro/sections/writing_workflows/index.html): @ramirezfranciscof can you test from the Real-world example?

Donzo #439. Actually, I didn't notice the difference you pointed out there between up to debugging / after debuggin, so I just went through all the workflow section, sorry! I hope I didn't overstep on what you were checking @mbercx .