geoschem / geos-chem-cloud

Run GEOS-Chem easily on AWS cloud
http://cloud.geos-chem.org
MIT License
39 stars 9 forks source link

Ideas for user survey and testing (for the paper) #14

Closed JiaweiZhuang closed 5 years ago

JiaweiZhuang commented 6 years ago

Daniel suggested a survey & testing among users to see how quickly they can get started with the AWS cloud. I think this is a great idea and will add a significant value to the paper.

Here's my preliminary plan:

Test specification

Users/volunteers will be asked to go through one or two pre-defined workflows, and report the time spent on each stage.

Test 1 (required). The absolutely minimum beginner demo, following the Quick start guide.

  1. Sign up an AWS account (or just giving them a temporary account? See "Issues to discuss" below)
  2. Launch a standard, on-demand EC2 instance from tutorial AMI.
  3. Execute the pre-configured GEOS-Chem executable to run a 20-min proof-of-concept simulation
  4. Launch a Jupyter notebook on EC2; read and plot the output data using xarray
  5. Terminate the server.

The entire process should only take 10~20 minutes if everything goes smoothly. Factors that could slow down the process include:

Test 2 (optional). A more complete, customizable workflow, resembling a realistic project. This requires the user to read the all beginner tutorials for additional stuff like S3 and spot.

Those steps will be documented more explicitly, to replace the current research workflow section. The entire process should take 1\~2 hours, if the users have already read most of beginner tutorials like AWS-CLI, S3, and spot. Would it be necessary to also record the time spent on those individual chapters? They should be finished in 1\~2 hours in total, depending on how focused a user is.

Information to gather

Besides the time spent on each stage of the workflow, additional information includes:

How do users report results

We could in principle use Google Forms, but I want to users to directly reply on GitHub issue tracker. This makes the "raw data" transparently visible and ensures information credibility. All participants wound need to sign up their own GitHub accounts.

Remaining issues to discuss

  1. Where to obtain AWS accounts? I would like to let users sign up their own accounts using their own credit cards. But not everyone might be willing to do so. We can also send out temporary user account (IAM users derived from our own root account) for testing. I should be able to get some credits for this.

  2. Collaborate with existing classes & conferences? Students from EPS200/EPS236 seem good candidates. I also really like to collaborate with CS205 because that class teaches a lot about scientific computing on AWS, but it is not offered in the coming semester. A great chance to gather information is the workshop at ICC9, but it is two semesters ahead. We can get some initial results by inviting users online, and gradually add more data if there's a chance to perform offline user testing.

  3. How many samples should we get? 20 doesn't seem very compelling? Maybe 50?

CC @yantosca @lizziel @msulprizio @sdeastham @ayshaw for any comments. Let's make all our discussions transparent on GitHub. This is the approach adopted by Pangeo (https://github.com/pangeo-data/pangeo/issues) and I think this transparency is quite valuable for the community.

djxjacob commented 6 years ago

I think this would be great to send this survey to the geos-chem email list, telling them that we invite them to use the cloud facility, that it should be very easy to use and if it's not we want to hear about it. You can also invite others, e.g. Harvard classes, with no prior GC experience. Expect self-selected responses though. From the standpoint of the BAMS paper, I think what we should do is just ask the students and postdocs in the group with prior experience of GEOS-Chem. Four reasons: (1) no credit card issue, (2) homogeneity of background, (3) we can ask them all to participate, (4) that's a large enough sample. Yes, they will be proficient in GEOS-Chem, but with no experience in running on the cloud and that's what what we want to test.

JiaweiZhuang commented 6 years ago

@djxjacob Thanks very much!

I think this would be great to send this survey to the geos-chem email list, telling them that we invite them to use the cloud facility, that it should be very easy to use and if it's not we want to hear about it.

Sure, I will let you know when it is ready.

I think what we should do is just ask the students and postdocs in the group with prior experience of GEOS-Chem.

I agree that this is the most straightforward way, although people in our group are getting excellent technical support from GCST and probably won't find the cloud more convenient. But that should be good enough for timing purpose.

JiaweiZhuang commented 5 years ago

The user survey is now published with the paper: https://journals.ametsoc.org/doi/abs/10.1175/BAMS-D-18-0243.1

djxjacob commented 5 years ago

Excellent, thanks! Daniel

From: Jiawei Zhuang notifications@github.com Sent: Saturday, September 7, 2019 2:25 PM To: geoschem/geos-chem-cloud geos-chem-cloud@noreply.github.com Cc: Jacob, Daniel J. djacob@fas.harvard.edu; Mention mention@noreply.github.com Subject: Re: [geoschem/geos-chem-cloud] Ideas for user survey and testing (for the paper) (#14)

The user survey is now published with the paper: https://journals.ametsoc.org/doi/abs/10.1175/BAMS-D-18-0243.1https://urldefense.proofpoint.com/v2/url?u=https-3A__journals.ametsoc.org_doi_abs_10.1175_BAMS-2DD-2D18-2D0243.1&d=DwMCaQ&c=WO-RGvefibhHBZq3fL85hQ&r=rhdjeCmvErplnjGEj5OoH_CDkPdqB-BDUxqwWC1AzIs&m=_j4fPRwF1uP1-y5FumBtLJ7zVc8cbLD3sC9uQBTN-_s&s=SLVv53PQkv_WKyUh-pWCHEgdrcJJbzsd2lsHrq1izqU&e=

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHubhttps://urldefense.proofpoint.com/v2/url?u=https-3A__github.com_geoschem_geos-2Dchem-2Dcloud_issues_14-3Femail-5Fsource-3Dnotifications-26email-5Ftoken-3DAKHHSVL32C3OUAUQU566ZLDQIPWYVA5CNFSM4FSWQFVKYY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOD6E7DAI-23issuecomment-2D529133953&d=DwMCaQ&c=WO-RGvefibhHBZq3fL85hQ&r=rhdjeCmvErplnjGEj5OoH_CDkPdqB-BDUxqwWC1AzIs&m=_j4fPRwF1uP1-y5FumBtLJ7zVc8cbLD3sC9uQBTN-_s&s=lPpeLle--YdON4xJkaG788ptBPVlPCXcUa9LJCaXZko&e=, or mute the threadhttps://urldefense.proofpoint.com/v2/url?u=https-3A__github.com_notifications_unsubscribe-2Dauth_AKHHSVIZCI52KOAY5TQ3UKLQIPWYVANCNFSM4FSWQFVA&d=DwMCaQ&c=WO-RGvefibhHBZq3fL85hQ&r=rhdjeCmvErplnjGEj5OoH_CDkPdqB-BDUxqwWC1AzIs&m=_j4fPRwF1uP1-y5FumBtLJ7zVc8cbLD3sC9uQBTN-_s&s=gjo4Wh5mTS4ZPJQMByd2Ugc5FxV8VNutX_mBOKOhZ3k&e=.