filecoin-project / filecoin-plus-large-datasets

Hub for client applications for DataCap at a large scale
110 stars 62 forks source link

[DataCap Application] Hubble Space Telescope Public Data #2313

Open alliswella126 opened 5 months ago

alliswella126 commented 5 months ago

Data Owner Name

Space Telescope Science Institute

What is your role related to the dataset

Data Preparer

Data Owner Country/Region

United States

Data Owner Industry

Environment

Website

https://astroquery.readthedocs.io/en/latest/mast/mast.html#module-astroquery.mast

Social Media

archive@stsci.edu

Total amount of DataCap being requested

8PiB

Expected size of single dataset (one copy)

860TiB

Number of replicas to store

5

Weekly allocation of DataCap requested

1PiB

On-chain address for first allocation

f1rxxy3xsnf6g5yq5maqj5puimxwyrpra33xvatca

Data Type of Application

Public, Open Dataset (Research/Non-Profit)

Custom multisig

Identifier

No response

Share a brief history of your project and organization

The Mikulski Archive for Space Telescopes (MAST) is a NASA funded project made to collect and archive a variety of scientific data to support the astronomical community. The data housed in MAST includes science and engineering data, with a primary focus on data sets in the optical, ultraviolet, and near-infrared parts of the spectrum, from over 20 space-based missions. MAST offers single mission-based queries as well as cross-mission queries. Astroquery’s astroquery.mast module is one tool used to query and access the data in this Archive.

astroquery.mast offers 3 main services: MastClass, CatalogsClass, and Cutouts. MastClass allows direct programatic access to the MAST Portal. Along with ObservationsClass, it is used to query MAST observational data. The Catalogs class is used to query MAST catalog data. The available catalogs include the Pan-STARRS and Hubble Source catalogs along with a few others listed under the Catalog Queries section of this page. Lastly, Cutouts, a newer addition to astroquery.mast, provides access to full-frame image cutouts of Transiting Exoplanet Survey Satellite (TESS), MAST Hubble Advanced Product (HAP),and deep-field images, through TesscutClass, HapcutClass, and ZcutClass respectively.

Is this project associated with other projects/ecosystem stakeholders?

No

If answered yes, what are the other projects/ecosystem stakeholders

No response

Describe the data being stored onto Filecoin

The Mikulski Archive for Space Telescopes (MAST) is a NASA funded project made to collect and archive a variety of scientific data to support the astronomical community. The data housed in MAST includes science and engineering data, with a primary focus on data sets in the optical, ultraviolet, and near-infrared parts of the spectrum, from over 20 space-based missions.

Where was the data currently stored in this dataset sourced from

AWS Cloud

If you answered "Other" in the previous question, enter the details here

No response

If you are a data preparer. What is your location (Country/Region)

United States

If you are a data preparer, how will the data be prepared? Please include tooling used and technical details?

boost and lotus

If you are not preparing the data, who will prepare the data? (Provide name and business)

No response

Has this dataset been stored on the Filecoin network before? If so, please explain and make the case why you would like to store this dataset again to the network. Provide details on preparation and/or SP distribution.

have been stored before, but so few times that, as important public data, they deserve to be stored over and over again.

Please share a sample of the data

https://registry.opendata.aws/hst/

Confirm that this is a public dataset that can be retrieved by anyone on the Network

If you chose not to confirm, what was the reason

No response

What is the expected retrieval frequency for this data

Sporadic

For how long do you plan to keep this dataset stored on Filecoin

1.5 to 2 years

In which geographies do you plan on making storage deals

Greater China, Asia other than Greater China, North America, South America

How will you be distributing your data to storage providers

HTTP or FTP server, Shipping hard drives

How do you plan to choose storage providers

Slack, Partners

If you answered "Others" in the previous question, what is the tool or platform you plan to use

No response

If you already have a list of storage providers to work with, fill out their names and provider IDs below

f02815448
f02944782
f02888899
f02948257
f01877259
f02812307
f02815451

How do you plan to make deals to your storage providers

Boost client, Lotus client

If you answered "Others/custom tool" in the previous question, enter the details here

No response

Can you confirm that you will follow the Fil+ guideline

Yes

large-datacap-requests[bot] commented 5 months ago

Thanks for your request! Everything looks good. :ok_hand:

A Governance Team member will review the information provided and contact you back pretty soon.

alliswella126 commented 5 months ago
image

Dear Team, Having scrutinized the previous application records, it looks like the form is necessary and has been completed ahead of schedule as requested, thanks!

alliswella126 commented 5 months ago

Updation as requested:

f02815448————bluesky f02944782————Greaterheat f02888899————SXX f02948257————Alice Xu f01877259————Mr Li f02812307————Alice Xu f02815451————Tom Liu

Sunnyiscoming commented 5 months ago

SP List provided: [{"providerID": "f02815448","City": "XYZ", "Country": "USA", "SPOrg","bluesky"}, {"providerID": "f02888899","City": "XYZ", "Country": "China", "SPOrg","SXX"}, {"providerID": "f02944782","City": "XYZ", "Country": "USA", "SPOrg","Greaterheat"}, {"providerID": "f02948257","City": "XYZ", "Country": "SG", "SPOrg","Alice"}, {"providerID": "f02815451","City": "XYZ", "Country": "JAPAN", "SPOrg","Tom"}, {"providerID": "f01877259","City": "HUNAN", "Country": "CN", "SPOrg","Mr Li"}, {"providerID": "f02812307","City": "XYZ", "Country": "SG", "SPOrg","Alice"},]

Sunnyiscoming commented 5 months ago

Datacap Request Trigger

Total DataCap requested

8 PiB

Expected weekly DataCap usage rate

1 PiB

Client address

f1rxxy3xsnf6g5yq5maqj5puimxwyrpra33xvatca

large-datacap-requests[bot] commented 5 months ago

DataCap Allocation requested

Multisig Notary address

f02049625

Client address

f1rxxy3xsnf6g5yq5maqj5puimxwyrpra33xvatca

DataCap allocation requested

409.59TiB

Id

38c9fe88-b7d0-4690-a3a6-ebcba3b33450

kernelogic commented 5 months ago

Seems client SP is well prepared and distributed.

kernelogic commented 5 months ago

Request Proposed

Your Datacap Allocation Request has been proposed by the Notary

Message sent to Filecoin Network

bafy2bzacebvqhi2pu7uhh53etdxinrpmkbytc2ae27timvr666j63dqa7qvhe

Address

f1rxxy3xsnf6g5yq5maqj5puimxwyrpra33xvatca

Datacap Allocated

409.59TiB

Signer Address

f1yjhnsoga2ccnepb7t3p3ov5fzom3syhsuinxexa

Id

38c9fe88-b7d0-4690-a3a6-ebcba3b33450

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacebvqhi2pu7uhh53etdxinrpmkbytc2ae27timvr666j63dqa7qvhe

Bitrise0111 commented 5 months ago

Their report show very healthy and we'd like to sign for this round.

Bitrise0111 commented 5 months ago

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzaceasojvw6rjq3b52x3zjhp26twpyxxeyml2ndk2pww7e6k6nhk2vec

Address

f1rxxy3xsnf6g5yq5maqj5puimxwyrpra33xvatca

Datacap Allocated

409.59TiB

Signer Address

f1nknj7ayq4o43czrtdoauggtwl43fbqatmqis3yy

Id

38c9fe88-b7d0-4690-a3a6-ebcba3b33450

You can check the status of the message here: https://filfox.info/en/message/bafy2bzaceasojvw6rjq3b52x3zjhp26twpyxxeyml2ndk2pww7e6k6nhk2vec

alliswella126 commented 5 months ago

on going

github-actions[bot] commented 5 months ago

This application has not seen any responses in the last 10 days. This issue will be marked with Stale label and will be closed in 4 days. Comment if you want to keep this application open.

-- Commented by Stale Bot.

filplus-checker-app[bot] commented 5 months ago

DataCap and CID Checker Report[^1]

No active deals found for this client.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

filplus-checker-app[bot] commented 5 months ago

DataCap and CID Checker Report[^1]

No active deals found for this client.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

large-datacap-requests[bot] commented 4 months ago

DataCap Allocation requested

Request number 2

Multisig Notary address

f02049625

Client address

f1rxxy3xsnf6g5yq5maqj5puimxwyrpra33xvatca

DataCap allocation requested

512TiB

Id

f6f1c753-407f-45aa-9c25-147804d697a3

Chuangshi1 commented 4 months ago

checker:manualTrigger

filplus-checker-app[bot] commented 4 months ago

DataCap and CID Checker Report Summary[^1]

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

Deal Data Replication

✔️ Data replication looks healthy.

Deal Data Shared with other Clients[^3]

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the CID Checker report. Click here to view the Retrieval Dashboard.

Chuangshi1 commented 4 months ago

Request Proposed

Your Datacap Allocation Request has been proposed by the Notary

Message sent to Filecoin Network

bafy2bzacec5fvjkyfy7gadmphxrtbo5peanfcrnc672g6ig32em4limadqpcu

Address

f1rxxy3xsnf6g5yq5maqj5puimxwyrpra33xvatca

Datacap Allocated

512.00TiB

Signer Address

f1mdk7s2vntzm6hu35yuo6vjubtrpfnb2awhgvrri

Id

f6f1c753-407f-45aa-9c25-147804d697a3

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacec5fvjkyfy7gadmphxrtbo5peanfcrnc672g6ig32em4limadqpcu

ipfscn commented 4 months ago

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzacecnzfdephx6ihfwsjwpn63e2i7dhsfge4unsm6tixloxtokq4xgiw

Address

f1rxxy3xsnf6g5yq5maqj5puimxwyrpra33xvatca

Datacap Allocated

512.00TiB

Signer Address

f1j4n74chme7whbz3yls4a7ixqewb6dijypqg2a3a

Id

f6f1c753-407f-45aa-9c25-147804d697a3

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacecnzfdephx6ihfwsjwpn63e2i7dhsfge4unsm6tixloxtokq4xgiw

large-datacap-requests[bot] commented 4 months ago

DataCap Allocation requested

Request number 2

Multisig Notary address

f02049625

Client address

f1rxxy3xsnf6g5yq5maqj5puimxwyrpra33xvatca

DataCap allocation requested

512TiB

Id

cb9c1871-5574-4af7-a40d-43c15e85f026

woshidama323 commented 4 months ago

checker:manualTrigger

filplus-checker-app[bot] commented 4 months ago

DataCap and CID Checker Report Summary[^1]

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

Deal Data Replication

⚠️ 100.00% of deals are for data replicated across less than 4 storage providers.

Deal Data Shared with other Clients[^3]

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the CID Checker report. Click here to view the Retrieval Dashboard.

woshidama323 commented 4 months ago

image

As this still is the first stage of distributing Wish you could find the 4th miner for fixing this issue next round

Will support first

woshidama323 commented 4 months ago

Request Proposed

Your Datacap Allocation Request has been proposed by the Notary

Message sent to Filecoin Network

bafy2bzacedunydm66tsdvzfjv4szaplhnhy4sj6v5fsltlvbaapael6vvfj5g

Address

f1rxxy3xsnf6g5yq5maqj5puimxwyrpra33xvatca

Datacap Allocated

512.00TiB

Signer Address

f12tk3adljauwnd3hjbigpfxb7b7gdlj63p6afwtq

Id

cb9c1871-5574-4af7-a40d-43c15e85f026

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacedunydm66tsdvzfjv4szaplhnhy4sj6v5fsltlvbaapael6vvfj5g

DirectionTechnology commented 4 months ago

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzacectcdcvdj7q3f7wlzvorh4xgdldurkhug2oj7g2hkknoohj4m7s32

Address

f1rxxy3xsnf6g5yq5maqj5puimxwyrpra33xvatca

Datacap Allocated

512.00TiB

Signer Address

f1inkdoatsbfumdvpctxbgcatscewr3rus5pxmsgi

Id

cb9c1871-5574-4af7-a40d-43c15e85f026

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacectcdcvdj7q3f7wlzvorh4xgdldurkhug2oj7g2hkknoohj4m7s32

alliswella126 commented 3 months ago

checker:manualTrigger

filplus-checker-app[bot] commented 3 months ago

DataCap and CID Checker Report Summary[^1]

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

Deal Data Replication

✔️ Data replication looks healthy.

Deal Data Shared with other Clients[^3]

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the CID Checker report. Click here to view the Retrieval Dashboard.

large-datacap-requests[bot] commented 3 months ago

DataCap Allocation requested

Request number 4

Multisig Notary address

f02049625

Client address

f1rxxy3xsnf6g5yq5maqj5puimxwyrpra33xvatca

DataCap allocation requested

2PiB

Id

932ffcdd-7d95-40d5-8dbd-1b7bf05dd5e7

filplus-checker-app[bot] commented 3 months ago

DataCap and CID Checker Report Summary[^1]

Storage Provider Distribution

⚠️ 1 storage providers sealed more than 30% of total datacap - f02970880: 57.17%

⚠️ 1 storage providers have unknown IP location - f02970880

Deal Data Replication

⚠️ 52.85% of deals are for data replicated across less than 4 storage providers.

Deal Data Shared with other Clients[^3]

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the CID Checker report. Click here to view the Retrieval Dashboard.

AlanGreaterheat commented 3 months ago

checker:manualTrigger

filplus-checker-app[bot] commented 3 months ago

DataCap and CID Checker Report Summary[^1]

Storage Provider Distribution

⚠️ 1 storage providers sealed more than 30% of total datacap - f02970880: 57.17%

Deal Data Replication

⚠️ 52.85% of deals are for data replicated across less than 4 storage providers.

Deal Data Shared with other Clients[^3]

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the CID Checker report. Click here to view the Retrieval Dashboard.