filecoin-project / filecoin-plus-large-datasets

Hub for client applications for DataCap at a large scale
110 stars 62 forks source link

[DataCap Application] Data Conservation House - NOAA Geostationary Operational Environmental Satellites (GOES) 16, 17 & 18 (2/4) #1489

Closed dch-club closed 1 year ago

dch-club commented 1 year ago

Data Owner Name

The National Oceanic and Atmospheric Administration

Data Owner Country/Region

United States

Data Owner Industry

Environment

Website

https://github.com/awslabs/open-data-docs/tree/main/docs/noaa/noaa-goes16

Social Media

https://www.dch.club/ ; https://twitter.com/dch_club

Total amount of DataCap being requested

5PiB

Weekly allocation of DataCap requested

500TiB

On-chain address for first allocation

f1bijrzvlxouvqdvtep5t5uodqppbj524vzzdy53i

Custom multisig

Identifier

No response

Share a brief history of your project and organization

Data Conservation House wants to preserve humanity's most important information by encouraging non Filecoin-native data communities to onboard more public data onto decentralized storage solutions. 
On our platform, it is our goal to make it technically easy for data communities around the world to onboard public data onto the Filecoin network. In the long run, we hope to assist our users with automating LDN application and data preparation processes.
Data Conservation House does not intend to profit from LDN applications. For any proceeds, we will donate them to data communities or use them to fund activities that aim to onboard more useful data onto Filecoin

Is this project associated with other projects/ecosystem stakeholders?

Yes

If answered yes, what are the other projects/ecosystem stakeholders

BigData Exchange. Data Conservation House will list all verified deals on BigData Exchange so that all deal making processes are transparent.

Describe the data being stored onto Filecoin

The National Oceanic and Atmospheric Administration (NOAA) operates a constellation of Geostationary Operational Environmental Satellites (GOES) to provide continuous weather imagery and monitoring of meteorological and space environment data for the protection of life and property across the United States. GOES satellites provide critical atmospheric, oceanic, climatic and space weather products supporting weather forecasting and warnings, climatologic analysis and prediction, ecosystems management, safe and efficient public and private transportation, and other national priorities.

The satellites provide advanced imaging with increased spatial resolution, 16 spectral channels, and up to 1 minute scan frequency for more accurate forecasts and timely warnings.

This dataset contains three S3 buckets:
$ aws s3 ls --no-sign-request s3://noaa-goes16/ --recursive  --human-readable --summarize
952 TiB
$ aws s3 ls --no-sign-request s3://noaa-goes17/ --recursive  --human-readable --summarize
769TiB
$ aws s3 ls --no-sign-request s3://noaa-goes18/ --recursive  --human-readable --summarize    
63TiB   

Considering 10 replicas, we are applying for 20PiB in total across 4 applications.

Where was the data currently stored in this dataset sourced from

AWS Cloud

If you answered "Other" in the previous question, enter the details here

No response

How do you plan to prepare the dataset

singularity

If you answered "other/custom tool" in the previous question, enter the details here

No response

Please share a sample of the data

https://registry.opendata.aws/noaa-goes/

Confirm that this is a public dataset that can be retrieved by anyone on the Network

If you chose not to confirm, what was the reason

No response

What is the expected retrieval frequency for this data

Sporadic

For how long do you plan to keep this dataset stored on Filecoin

Permanently

In which geographies do you plan on making storage deals

Greater China, Asia other than Greater China, Africa, North America, South America, Europe, Australia (continent), Antarctica

How will you be distributing your data to storage providers

HTTP or FTP server

How do you plan to choose storage providers

Big data exchange

If you answered "Others" in the previous question, what is the tool or platform you plan to use

No response

If you already have a list of storage providers to work with, fill out their names and provider IDs below

No response

How do you plan to make deals to your storage providers

Boost client, Lotus client

If you answered "Others/custom tool" in the previous question, enter the details here

No response

Can you confirm that you will follow the Fil+ guideline

Yes

large-datacap-requests[bot] commented 1 year ago

Thanks for your request!

Heads up, you’re requesting more than the typical weekly onboarding rate of DataCap!
large-datacap-requests[bot] commented 1 year ago

Thanks for your request! Everything looks good. :ok_hand:

A Governance Team member will review the information provided and contact you back pretty soon.

raghavrmadya commented 1 year ago

Datacap Request Trigger

Total DataCap requested

5PiB

Expected weekly DataCap usage rate

500TiB

Client address

f1bijrzvlxouvqdvtep5t5uodqppbj524vzzdy53i

large-datacap-requests[bot] commented 1 year ago

DataCap Allocation requested

Multisig Notary address

f02049625

Client address

f1bijrzvlxouvqdvtep5t5uodqppbj524vzzdy53i

DataCap allocation requested

250TiB

Id

344f5a03-2257-47cb-9b4f-f474b7fd4daa

filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report[^1]

There is no previous allocation for this issue.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

kernelogic commented 1 year ago

Owner of https://github.com/filecoin-project/filecoin-plus-large-datasets/issues/391 have informed me this is also them. Willing to support.

github-actions[bot] commented 1 year ago

This application has not seen any responses in the last 10 days. This issue will be marked with Stale label and will be closed in 4 days. Comment if you want to keep this application open.

github-actions[bot] commented 1 year ago

This application has not seen any responses in the last 14 days, so for now it is being closed. Please feel free to contact the Fil+ Gov team to re-open the application if it is still being processed. Thank you!