NASA-Openscapes / earthdata-cloud-cookbook

A tutorial book of workflows for research using NASA EarthData in the Cloud created by the NASA-Openscapes team
https://nasa-openscapes.github.io/earthdata-cloud-cookbook
Other
87 stars 32 forks source link

Create content for the Get Started page #194

Closed jessnicwelch closed 1 year ago

jessnicwelch commented 1 year ago

Discussed in https://github.com/NASA-Openscapes/earthdata-cloud-cookbook/discussions/193

Originally posted by **jessnicwelch** April 5, 2023 Jess's comments and draft MD. ``` title: “Get Started” subtitle: “Get Started Working with Data in the Cloud” date: last-modified author: “NASA Openscapes Team” citation_url: https://nasa-openscapes.github.io/earthdata-cloud-cookbook/get-started slug: index --- **Proposed Sources of Information** * Understanding the Cloud * Selected content from “Cheatsheets, Guides, & Slides” page * API primer * Authentication * Earthdata Login * earthaccess EDL programmatic login * .netrc creation? (what contexts will this be necessary?) * How to Code * Selected content from “Our Cookbook” page * Python installation/environment setup * R installation/environment setup ``` OR some of the “background” info listed above can go into a new section and Get Started is pertinent only to the technical aspects of code/cloud environments. Either way, I think this would replace portions of the “How to...” pages.
jessnicwelch commented 1 year ago

See https://github.com/NASA-Openscapes/earthdata-cloud-cookbook/pull/198

Materials to add to other pages

About the Cloud

Maybe you’ve heard that NASA Earthdata is “moving to the cloud” but you want to know why. You can read the details of the Earthdata Cloud Evolution, but here we summarize the benefits of the cloud and additional resources on its use and history. In short, the cloud will make the download of data unnecessary, allow for data processing and manipulation without new software purchases or installation, and, ultimately, reduce the amount of time it takes to get the data needed to do science.

Amazon Web Services

NASA’s Office of the Chief Information Officer chose Amazon Web Services (AWS) as the source of general-purpose cloud services (but some areas within NASA are working with Google Earth Engine (GEE) to make NASA data accessible in the GEE cloud-based analysis platform). The following resources provide a background on AWS, but much of the information is relevant to folks who want to develop in the cloud rather than simply access data. Remember, all NASA’s science information (including the algorithms, metadata, and documentation associated with science mission data) must be freely available to the public. This means that anyone, anywhere in the world, can access NASA Earth science data without restriction. However, advanced cloud operations could require a user to set-up their own cloud account through AWS or another cloud provider.

Cloud Optimized Data Formats

Traditional file formats can easily be migrated to the cloud, but serving or processing the data from the cloud is inefficient and often requires that the data be downloaded and then translated to another format and stored in memory. Cloud optimized formats are being developed to better serve analysis-in-place workflows that make the cloud so beneficial to science users.

Environments

Development on AWS

Python

R

Additional Coding Resources

Python

R

jules32 commented 1 year ago

In #199, I:

jules32 commented 1 year ago

I think we could probably close this issue, what do you think @jessnicwelch ?