filecoin-project / filecoin-plus-large-datasets

Hub for client applications for DataCap at a large scale
109 stars 62 forks source link

[DataCap Application] Hubble Space Telescope Public Data #1573

Closed StoweRudge closed 1 year ago

StoweRudge commented 1 year ago

Data Owner Name

Space Telescope Science Institute

Data Owner Country/Region

United States

Data Owner Industry

Environment

Website

https://astroquery.readthedocs.io/en/latest/mast/mast.html#module-astroquery.mast

Social Media

archive@stsci.edu

Total amount of DataCap being requested

5PiB

Weekly allocation of DataCap requested

400TiB

On-chain address for first allocation

f1tbgm7baejvc5hmzbbeqnuhe7yq6s4e3dwrlacii

Custom multisig

Identifier

No response

Share a brief history of your project and organization

I am a participant who follows the filecoin community. I want to get involved in filecoin deeply by storing useful public datasets first.

The Space Telescope Science Institute (STScI) is operated by the Association of Universities for Research in Astronomy (AURA) with the goal of helping humanity explore the universe with advanced space telescopes and ever-growing data archives.

Is this project associated with other projects/ecosystem stakeholders?

Yes

If answered yes, what are the other projects/ecosystem stakeholders

No response

Describe the data being stored onto Filecoin

The Hubble Space Telescope (HST) is one of the most productive scientific instruments ever created. This dataset contains calibrated and raw data for all of the currently active instruments on HST: ACS, COS, STIS and WFC3.

Where was the data currently stored in this dataset sourced from

AWS Cloud

If you answered "Other" in the previous question, enter the details here

No response

How do you plan to prepare the dataset

lotus

If you answered "other/custom tool" in the previous question, enter the details here

No response

Please share a sample of the data

https://registry.opendata.aws/hst/

Confirm that this is a public dataset that can be retrieved by anyone on the Network

If you chose not to confirm, what was the reason

No response

What is the expected retrieval frequency for this data

Sporadic

For how long do you plan to keep this dataset stored on Filecoin

2 to 3 years

In which geographies do you plan on making storage deals

Greater China, Asia other than Greater China, North America, South America, Europe, Australia (continent)

How will you be distributing your data to storage providers

Cloud storage (i.e. S3), HTTP or FTP server, IPFS, Shipping hard drives

How do you plan to choose storage providers

Slack, Big data exchange

If you answered "Others" in the previous question, what is the tool or platform you plan to use

No response

If you already have a list of storage providers to work with, fill out their names and provider IDs below

No response

How do you plan to make deals to your storage providers

Lotus client, Singularity

If you answered "Others/custom tool" in the previous question, enter the details here

No response

Can you confirm that you will follow the Fil+ guideline

Yes

cryptowhizzard commented 1 year ago

Dear applicant,

Thank you for applying for datacap. As Filecoin FIL+ notary i am screening your application and conducting due diligence.

Looking at your application i have some questions: As you are brand new on Github and have no history of past applications it seems to me that applying for 5PB of datacap is a lot. One needs comprehensive knowledge of Filecoin, packing of data, distribution of data and all it's requirements coming with it. Are you brand new in the Filecoin space or have you applied for datacap in the past on different Github account names?

Can you show us visible proof of the size of your data and the storage systems you have there?

As last question i would like you to fill out this form to provide us with the necessary information to make a educated decision on your LDN request if we would like to support it.

Thanks!

large-datacap-requests[bot] commented 1 year ago

Thanks for your request!

Heads up, you’re requesting more than the typical weekly onboarding rate of DataCap!
large-datacap-requests[bot] commented 1 year ago

Thanks for your request! Everything looks good. :ok_hand:

A Governance Team member will review the information provided and contact you back pretty soon.

herrehesse commented 1 year ago

@StoweRudge Awesome application, looking forward to your KYC form!

Sunnyiscoming commented 1 year ago

Can you provide more detailed information about Sps you will cooperate with?

caoyoungyoung commented 1 year ago

you answered 'yes', so have to list all the stakeholders associated with 22

StoweRudge commented 1 year ago

@caoyoungyoung That was a mistake, and I've fixed it. Thanks!

simonkim0515 commented 1 year ago

Datacap Request Trigger

Total DataCap requested

5PiB

Expected weekly DataCap usage rate

400TiB

Client address

f1tbgm7baejvc5hmzbbeqnuhe7yq6s4e3dwrlacii

large-datacap-requests[bot] commented 1 year ago

DataCap Allocation requested

Multisig Notary address

f02049625

Client address

f1tbgm7baejvc5hmzbbeqnuhe7yq6s4e3dwrlacii

DataCap allocation requested

200TiB

Id

bd8ed51d-e4a1-4bff-9cb9-4794f16280a7

Casey-PG commented 1 year ago

willing to support in the 1st round.

Casey-PG commented 1 year ago

Request Proposed

Your Datacap Allocation Request has been proposed by the Notary

Message sent to Filecoin Network

bafy2bzacecxzlqsralxu75os7bbvjxsdbfbxeqzv2k7enedqnthef5t3c22ha

Address

f1tbgm7baejvc5hmzbbeqnuhe7yq6s4e3dwrlacii

Datacap Allocated

200.00TiB

Signer Address

f1d4yb3wags3mtddzesxoo63jv7dmlec3bq4yteni

Id

bd8ed51d-e4a1-4bff-9cb9-4794f16280a7

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacecxzlqsralxu75os7bbvjxsdbfbxeqzv2k7enedqnthef5t3c22ha

large-datacap-requests[bot] commented 1 year ago

We have found some problems in the information provided in the Approved Comment. We could not find Id** field in the information provided

Please, take a look at the comment and edit the body of the comment providing all the required information.
large-datacap-requests[bot] commented 1 year ago

We have found some problems in the information provided in the Approved Comment. We could not find Id** field in the information provided

Please, take a look at the comment and edit the body of the comment providing all the required information.
large-datacap-requests[bot] commented 1 year ago

Looks like the bot was not able to retrieve the transaction on the lotus node. Please contact governance team. The message cid: bafy2bzacecxzlqsralxu75os7bbvjxsdbfbxeqzv2k7enedqnthef5t3c22ha

Please, contact the governance team.
large-datacap-requests[bot] commented 1 year ago

Looks like the bot was not able to retrieve the transaction on the lotus node. Please contact governance team. The message cid: bafy2bzacecxzlqsralxu75os7bbvjxsdbfbxeqzv2k7enedqnthef5t3c22ha

Please, contact the governance team.
filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report[^1]

No active deals found for this client.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

Suyanj commented 1 year ago

Request Proposed

Your Datacap Allocation Request has been proposed by the Notary

Message sent to Filecoin Network

bafy2bzaceapw5w67tqlhxhqogvkdxjgppwad2ndqkhvz2zzhatztudfqyfku2

Address

f1tbgm7baejvc5hmzbbeqnuhe7yq6s4e3dwrlacii

Datacap Allocated

200.00TiB

Signer Address

f1ihv7gz3vn3xqvikpt4rwryecgisl7745lodx3yi

Id

You can check the status of the message here: https://filfox.info/en/message/bafy2bzaceapw5w67tqlhxhqogvkdxjgppwad2ndqkhvz2zzhatztudfqyfku2

AthSmith commented 1 year ago

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzaceagbvprrie5ywaodxpwahqsomej7xfbrbbpo6jacx7mm7inn6fj3s

Address

f1tbgm7baejvc5hmzbbeqnuhe7yq6s4e3dwrlacii

Datacap Allocated

200.00TiB

Signer Address

f1vxbqrf7rfum3n6m5u6eb4re6xj7amvsaqnzu64y

Id

You can check the status of the message here: https://filfox.info/en/message/bafy2bzaceagbvprrie5ywaodxpwahqsomej7xfbrbbpo6jacx7mm7inn6fj3s

large-datacap-requests[bot] commented 1 year ago

DataCap Allocation requested

Request number 3

Multisig Notary address

f02049625

Client address

f1tbgm7baejvc5hmzbbeqnuhe7yq6s4e3dwrlacii

DataCap allocation requested

800TiB

Id

38faf2a6-33ac-4168-b3c0-b6a01fa6ccb8

large-datacap-requests[bot] commented 1 year ago

Stats & Info for DataCap Allocation

Multisig Notary address

f02049625

Client address

f1tbgm7baejvc5hmzbbeqnuhe7yq6s4e3dwrlacii

Rule to calculate the allocation request amount

200% of weekly dc amount requested

DataCap allocation requested

800TiB

Total DataCap granted for client so far

181898.9YiB

Datacap to be granted to reach the total amount requested by the client (5PiB)

-2.19B

Stats

Number of deals Number of storage providers Previous DC Allocated Top provider Remaining DC
6427 4 200TiB 29.78 73.28TiB
BobbyChoii commented 1 year ago

Request Proposed

Your Datacap Allocation Request has been proposed by the Notary

Message sent to Filecoin Network

bafy2bzacedetvl4ynecgba276y2lzfue7dksqoxnltrjbcbvkh42jkq5uv7dy

Address

f1tbgm7baejvc5hmzbbeqnuhe7yq6s4e3dwrlacii

Datacap Allocated

800.00TiB

Signer Address

f1irqs2gmctiv3jcdfwuch7oxvf4ixh3k4b2wc24i

Id

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacedetvl4ynecgba276y2lzfue7dksqoxnltrjbcbvkh42jkq5uv7dy

filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report Summary[^1]

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

Deal Data Replication

✔️ Data replication looks healthy.

Deal Data Shared with other Clients[^3]

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the full report.

large-datacap-requests[bot] commented 1 year ago

DataCap Allocation requested

Request number 4

Multisig Notary address

f02049625

Client address

f1tbgm7baejvc5hmzbbeqnuhe7yq6s4e3dwrlacii

DataCap allocation requested

1.56PiB

Id

cf7b4506-bfc2-436f-bee8-1170c0cdf941

large-datacap-requests[bot] commented 1 year ago

Stats & Info for DataCap Allocation

Multisig Notary address

f02049625

Client address

f1tbgm7baejvc5hmzbbeqnuhe7yq6s4e3dwrlacii

Rule to calculate the allocation request amount

400% of weekly dc amount requested

DataCap allocation requested

1.56PiB

Total DataCap granted for client so far

727595761418342760448.0YiB

Datacap to be granted to reach the total amount requested by the client (5PiB)

727595761418342760448.0YiB

Stats

Number of deals Number of storage providers Previous DC Allocated Top provider Remaining DC
19736 7 800TiB 28 201.75TiB
Bennyyangpu commented 1 year ago

Request Proposed

Your Datacap Allocation Request has been proposed by the Notary

Message sent to Filecoin Network

bafy2bzacedfgvy5sph5qhvmvdwxsmy6w5atxtzlyrrzzdh7t2rhpg2nx2ydnk

Address

f1tbgm7baejvc5hmzbbeqnuhe7yq6s4e3dwrlacii

Datacap Allocated

1.56PiB

Signer Address

f174fg3bqbln3zjnkxtyf6s54txqkr7yqkj6cig7y

Id

cf7b4506-bfc2-436f-bee8-1170c0cdf941

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacedfgvy5sph5qhvmvdwxsmy6w5atxtzlyrrzzdh7t2rhpg2nx2ydnk

AthSmith commented 1 year ago

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzacedccluqqqa3duc3fxxr3pqh6ygw7bczsi43dmrv5e34kxukznb6ay

Address

f1tbgm7baejvc5hmzbbeqnuhe7yq6s4e3dwrlacii

Datacap Allocated

1.56PiB

Signer Address

f1vxbqrf7rfum3n6m5u6eb4re6xj7amvsaqnzu64y

Id

cf7b4506-bfc2-436f-bee8-1170c0cdf941

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacedccluqqqa3duc3fxxr3pqh6ygw7bczsi43dmrv5e34kxukznb6ay

AthSmith commented 1 year ago

Public data with clear allocation plan, we are willing to help onboard more valuable dataset.

large-datacap-requests[bot] commented 1 year ago

DataCap Allocation requested

Request number 5

Multisig Notary address

f02049625

Client address

f1tbgm7baejvc5hmzbbeqnuhe7yq6s4e3dwrlacii

DataCap allocation requested

1.56PiB

Id

13e0a7c6-08cf-4b02-a1d1-2cdcfda91f2d

large-datacap-requests[bot] commented 1 year ago

Stats & Info for DataCap Allocation

Multisig Notary address

f02049625

Client address

f1tbgm7baejvc5hmzbbeqnuhe7yq6s4e3dwrlacii

Rule to calculate the allocation request amount

400% of weekly dc amount requested

DataCap allocation requested

1.56PiB

Total DataCap granted for client so far

1.4528632164001478e+36YiB

Datacap to be granted to reach the total amount requested by the client (5PiB)

1.4528632164001478e+36YiB

Stats

Number of deals Number of storage providers Previous DC Allocated Top provider Remaining DC
56681 11 1.56PiB 31.44 396.62TiB
spaceT9 commented 1 year ago

checker:manualTrigger

filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report Summary[^1]

Retrieval Statistics

⚠️ All retrieval success ratios are below 1%.

Storage Provider Distribution

⚠️ 1 storage providers sealed more than 30% of total datacap - f0420161: 30.00%

Deal Data Replication

⚠️ 68.61% of deals are for data replicated across less than 4 storage providers.

Deal Data Shared with other Clients[^3]

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the CID Checker report. Click here to view the Retrieval report.

spaceT9 commented 1 year ago

checker:manualTrigger

filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report Summary[^1]

Retrieval Statistics

⚠️ All retrieval success ratios are below 1%.

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

Deal Data Replication

⚠️ 55.41% of deals are for data replicated across less than 4 storage providers.

Deal Data Shared with other Clients[^3]

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the CID Checker report. Click here to view the Retrieval Dashboard. Click here to view the Retrieval report.

spaceT9 commented 1 year ago

Increase retrieval rate before next tranche please

github-actions[bot] commented 1 year ago

This application has not seen any responses in the last 10 days. This issue will be marked with Stale label and will be closed in 4 days. Comment if you want to keep this application open.

StoweRudge commented 1 year ago

checker:manualTrigger

filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report Summary[^1]

Retrieval Statistics

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

Deal Data Replication

⚠️ 55.41% of deals are for data replicated across less than 4 storage providers.

Deal Data Shared with other Clients[^3]

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the CID Checker report. Click here to view the Retrieval Dashboard. Click here to view the Retrieval report.

StoweRudge commented 1 year ago

@spaceT9 ok! We are paying time on updating.

TakiChain commented 1 year ago

checker:manualTrigger

filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report Summary[^1]

Retrieval Statistics

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

Deal Data Replication

⚠️ 55.41% of deals are for data replicated across less than 4 storage providers.

Deal Data Shared with other Clients[^3]

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the CID Checker report. Click here to view the Retrieval Dashboard. Click here to view the Retrieval report.

TakiChain commented 1 year ago

Glad to see the improvement in your reporting, keep up the good work.

TakiChain commented 1 year ago

Request Proposed

Your Datacap Allocation Request has been proposed by the Notary

Message sent to Filecoin Network

bafy2bzaceacvd4wpn7cjoyuqhcb67yh4bghrh6ccfafyy3nr3b75mcynlfck4

Address

f1tbgm7baejvc5hmzbbeqnuhe7yq6s4e3dwrlacii

Datacap Allocated

1.56PiB

Signer Address

f15impf3j2zcaex4lhyxndxswuuhv24vzstuqtxsi

Id

13e0a7c6-08cf-4b02-a1d1-2cdcfda91f2d

You can check the status of the message here: https://filfox.info/en/message/bafy2bzaceacvd4wpn7cjoyuqhcb67yh4bghrh6ccfafyy3nr3b75mcynlfck4

MEIYAN666 commented 1 year ago

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzacedqtio4eosty42fhm5dfwtfzaz73rwypx5jurvgbq7fgcgmmnuqss

Address

f1tbgm7baejvc5hmzbbeqnuhe7yq6s4e3dwrlacii

Datacap Allocated

1.56PiB

Signer Address

f1bwugfihrmn3iyunzyxst5nttql3dge4khwmurtq

Id

13e0a7c6-08cf-4b02-a1d1-2cdcfda91f2d

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacedqtio4eosty42fhm5dfwtfzaz73rwypx5jurvgbq7fgcgmmnuqss

ghost commented 1 year ago

Hello @StoweRudge Per the https://github.com/filecoin-project/notary-governance/issues/922 for Open, Public Dataset applicants, please complete the following Fil+ registration form to identify yourself as the applicant and also please add the contact information of the SP entities you are working with to store copies of the data.

This information will be reviewed by Fil+ Governance team to confirm validity and then the application will open for notary review. Let us know if you have any questions.

github-actions[bot] commented 1 year ago

This application has not seen any responses in the last 10 days. This issue will be marked with Stale label and will be closed in 4 days. Comment if you want to keep this application open.

-- Commented by Stale Bot.

large-datacap-requests[bot] commented 1 year ago

DataCap Allocation requested

Request number 6

Multisig Notary address

f02049625

Client address

f1tbgm7baejvc5hmzbbeqnuhe7yq6s4e3dwrlacii

DataCap allocation requested

725.11TiB

Id

313e2594-72dc-4015-8879-bf3ebe06b801

large-datacap-requests[bot] commented 1 year ago

Stats & Info for DataCap Allocation

Multisig Notary address

f02049625

Client address

f1tbgm7baejvc5hmzbbeqnuhe7yq6s4e3dwrlacii

Rule to calculate the allocation request amount

400% of weekly dc amount requested

DataCap allocation requested

725.11TiB

Total DataCap granted for client so far

1.452863216400147e+52YiB

Datacap to be granted to reach the total amount requested by the client (5PiB)

1.452863216400147e+52YiB

Stats

Number of deals Number of storage providers Previous DC Allocated Top provider Remaining DC
103167 18 1.56PiB 18.58 394.34TiB
Bennyyangpu commented 1 year ago

checker:manualTrigger

filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report Summary[^1]

Retrieval Statistics

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

Deal Data Replication

⚠️ 54.78% of deals are for data replicated across less than 4 storage providers.

Deal Data Shared with other Clients[^3]

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the CID Checker report. Click here to view the Retrieval Dashboard. Click here to view the Retrieval report.

Bennyyangpu commented 1 year ago

We're willing to help support the application. Please follow the plan you have shared. @StoweRudge
"Deal Data Replication" is not good as we expect.