filecoin-project / filecoin-plus-large-datasets

Hub for client applications for DataCap at a large scale
109 stars 62 forks source link

[DataCap Application] NASA/USGS #2163

Open TaylorOshan opened 1 year ago

TaylorOshan commented 1 year ago

Data Owner Name

NASA/USGS

What is your role related to the dataset

Data Preparer

Data Owner Country/Region

United States

Data Owner Industry

Environment

Website

https://www.usgs.gov/landsat-missions/landsat-9

Social Media

[at]USGSLandsat (twitter)

Total amount of DataCap being requested

1500 TiB

Expected size of single dataset (one copy)

275 TiB

Number of replicas to store

5

Weekly allocation of DataCap requested

250 TiB

On-chain address for first allocation

f1uwzfw6hghqf6js4773p62onzvqnupcqdxbkhhvq

Data Type of Application

Public, Open Dataset (Research/Non-Profit)

Custom multisig

Identifier

n/a

Share a brief history of your project and organization

The EASIER Data initiative kicked off late this summer and is a two year project in collaboration with the Filecoin Foundation for the Decentralized Web to build pipelines for storing and extracting geospatial data on Filecoin and IPFS. These pipelines will be prototyped and demonstrated using one year of Landsat 9 satellite data, which is estimated at about 275TB per replication. We originally opened a request, but there was an issue and it was suggested that we open a new one.

Is this project associated with other projects/ecosystem stakeholders?

No

If answered yes, what are the other projects/ecosystem stakeholders

n/a

Describe the data being stored onto Filecoin

Landsat9 satellite remote sensing data for the year 2019

Where was the data currently stored in this dataset sourced from

Other

If you answered "Other" in the previous question, enter the details here

n/a

If you are a data preparer, what is your location (City and Country)

n/a

If you are a data preparer, how will the data be prepared? Please include tooling used and technical details?

n/a

If you are not preparing the data, who will prepare the data? (Provide name and business)

James Hoang - Piknik

Has this dataset been stored on the Filecoin network before? If so, please explain and make the case why you would like to store this dataset again to the network. Provide details on preparation and/or SP distribution.

One copy of the data is currently stored with Piknik, but the initial data cap request has become stale and we have not been able to store the additional replications.

Please share a sample of the data

https://www.usgs.gov/landsat-missions/landsat-9

Confirm that this is a public dataset that can be retrieved by anyone on the Network

Yes

If you chose not to confirm, what was the reason

n/a

What is the expected retrieval frequency for this data

Monthly

For how long do you plan to keep this dataset stored on Filecoin

Permanently

In which geographies do you plan on making storage deals

North America, Europe, Asia other than Greater China, Australia (continent)

How will you be distributing your data to storage providers

HTTP or FTP server

How do you plan to choose SP

Partners

If you answered "Others" in the previous question, what is the tool or platform you plan to use

n/a

If you already have a list of storage providers to work with, fill out their names and provider IDs below

n/a

How do you plan to make deals to your storage providers

Others/custom tool

If you answered "Others/custom tool" in the previous question, enter the details here

n/a

Can you confirm that you will follow the Fil+ guideline

Yes

Application created via filplus.storage

large-datacap-requests[bot] commented 1 year ago

Thanks for your request!

Heads up, you’re requesting more than the typical weekly onboarding rate of DataCap!
large-datacap-requests[bot] commented 1 year ago

Thanks for your request! Everything looks good. :ok_hand:

A Governance Team member will review the information provided and contact you back pretty soon.

Sankara-Jefferson commented 1 year ago

The Social Impact team at Filecoin Foundation works closely with this team to support and enable them develop a decentralized cyber infrastructure for efficiently, accessibly, and sustainably onloading, analyzing, and extracting large amounts of spatial data on the filecoin storage network.

jamerduhgamer commented 1 year ago

Previous application at #995 had a technical issue which required opening a new application.

Sunnyiscoming commented 1 year ago

image

This information will be reviewed by Fil+ Governance team to confirm validity and then the application will be triggered for notary review. Let us know if you have any questions.

jamerduhgamer commented 1 year ago

Hi @Sunnyiscoming!

Sunnyiscoming commented 1 year ago

You should list nodes of more than 4 sps here. Have you completed the following Fil+ registration form?

zcfil commented 1 year ago

checker:manualTrigger

filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report[^1]

There is no previous allocation for this issue.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report[^1]

There is no previous allocation for this issue.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

zcfil commented 1 year ago

Can you provide more detailed information on other storage vendors participating in this program, such as a list of the SPs currently in contact with? Does the SPS you have chosen support data retrieval?

Sunnyiscoming commented 1 year ago

Any update here?

Sunnyiscoming commented 1 year ago

Any update here?

jamerduhgamer commented 1 year ago

We only have the SPs above participating in the program so far. The SPs do support data retrieval.

We are currently looking for more SPs to onboard the data.

Sunnyiscoming commented 1 year ago

4 or more storage providers should be provided.

large-datacap-requests[bot] commented 1 year ago

DataCap Allocation requested

Request number 2

Multisig Notary address

f02049625

Client address

f1uwzfw6hghqf6js4773p62onzvqnupcqdxbkhhvq

DataCap allocation requested

250TiB

Id

cfbc26f0-b122-44ed-b9c9-7e331c1fe43b

zcfil commented 1 year ago

Please list the SPs you are pre-collaborating with and the regions.

dannyob commented 1 year ago

Hi, Filecoin Foundation notary here.

This is a request from a known Filecoin/IPFS user, which is the EASIER project at the University of Maryland (the applicant is the project lead there). While the data is open data from NASA, the project is to work out the best way to make the data usefully accessible from Filecoin for geodata purposes.

I'm willing to support starting up the initial data allocation while UMD sort out their uploading process and attract other SPs.

Happy to walk through the background with other notaries here. To give an example of the work UMD is doing, here's their work on distributing Intelsat 9 data via the public IPFS network.

AthSmith commented 1 year ago

Almost none of the report are passable, this is a poor result of sealing. Can't support it.

dannyob commented 1 year ago

Request Proposed

Your Datacap Allocation Request has been proposed by the Notary

Message sent to Filecoin Network

bafy2bzacednpoz4lllj43wm4a6pilb7zmfvem6c4tuds5ak4pk7x2tjfdhbem

Address

f1uwzfw6hghqf6js4773p62onzvqnupcqdxbkhhvq

Datacap Allocated

250.00TiB

Signer Address

f1k6wwevxvp466ybil7y2scqlhtnrz5atjkkyvm4a

Id

cfbc26f0-b122-44ed-b9c9-7e331c1fe43b

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacednpoz4lllj43wm4a6pilb7zmfvem6c4tuds5ak4pk7x2tjfdhbem

cryptowhizzard commented 1 year ago

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzacebgmc6pwc3wzgdft2ckcmkmdbr6nmiyc2mgwhahtxmt3utqknifqe

Address

f1uwzfw6hghqf6js4773p62onzvqnupcqdxbkhhvq

Datacap Allocated

250.00TiB

Signer Address

f1krmypm4uoxxf3g7okrwtrahlmpcph3y7rbqqgfa

Id

cfbc26f0-b122-44ed-b9c9-7e331c1fe43b

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacebgmc6pwc3wzgdft2ckcmkmdbr6nmiyc2mgwhahtxmt3utqknifqe

cryptowhizzard commented 1 year ago

Hi, Filecoin Foundation notary here.

This is a request from a known Filecoin/IPFS user, which is the EASIER project at the University of Maryland (the applicant is the project lead there). While the data is open data from NASA, the project is to work out the best way to make the data usefully accessible from Filecoin for geodata purposes.

I'm willing to support starting up the initial data allocation while UMD sort out their uploading process and attract other SPs.

Happy to walk through the background with other notaries here. To give an example of the work UMD is doing, here's their work on distributing Intelsat 9 data via the public IPFS network.

I checked their application and it looks good. Same for me as i am willing to see them start up and have usefull and real data onboarded to the network.

github-actions[bot] commented 1 year ago

This application has not seen any responses in the last 10 days. This issue will be marked with Stale label and will be closed in 4 days. Comment if you want to keep this application open.

-- Commented by Stale Bot.

jamerduhgamer commented 1 year ago

Comment to keep LDN open. Sealing has been on-going.

large-datacap-requests[bot] commented 1 year ago

DataCap Allocation requested

Request number 3

Multisig Notary address

f02049625

Client address

f1uwzfw6hghqf6js4773p62onzvqnupcqdxbkhhvq

DataCap allocation requested

500TiB

Id

8198023c-c231-4512-a7a1-b2197952fe6e

github-actions[bot] commented 1 year ago

This application has not seen any responses in the last 10 days. This issue will be marked with Stale label and will be closed in 4 days. Comment if you want to keep this application open.

-- Commented by Stale Bot.

jamerduhgamer commented 1 year ago

Comment to keep the LDN open

xinaxu commented 1 year ago

Request Proposed

Your Datacap Allocation Request has been proposed by the Notary

Message sent to Filecoin Network

bafy2bzacedz2tupauvr74fne2zfhwnjubvornymiejzelkxlxjln6pg7ayvse

Address

f1uwzfw6hghqf6js4773p62onzvqnupcqdxbkhhvq

Datacap Allocated

500.00TiB

Signer Address

f1k3ysofkrrmqcot6fkx4wnezpczlltpirmrpsgui

Id

8198023c-c231-4512-a7a1-b2197952fe6e

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacedz2tupauvr74fne2zfhwnjubvornymiejzelkxlxjln6pg7ayvse

s0nik42 commented 1 year ago

good to me

s0nik42 commented 1 year ago

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzaceck3lqvq7b6iap7hvw7rkfdher5jtrg6uza2o2edqzjvxskgncfd4

Address

f1uwzfw6hghqf6js4773p62onzvqnupcqdxbkhhvq

Datacap Allocated

500.00TiB

Signer Address

f1wxhnytjmklj2czezaqcfl7eb4nkgmaxysnegwii

Id

8198023c-c231-4512-a7a1-b2197952fe6e

You can check the status of the message here: https://filfox.info/en/message/bafy2bzaceck3lqvq7b6iap7hvw7rkfdher5jtrg6uza2o2edqzjvxskgncfd4

large-datacap-requests[bot] commented 1 year ago

DataCap Allocation requested

Request number 3

Multisig Notary address

f02049625

Client address

f1uwzfw6hghqf6js4773p62onzvqnupcqdxbkhhvq

DataCap allocation requested

500TiB

Id

0a08e49c-af7f-4daa-92f0-ea62144340e4

jamerduhgamer commented 1 year ago

Looks like there was a bug with the previous signature. We did not get the 500 TiBs allocated. lotus filplus check-client-datacap f1uwzfw6hghqf6js4773p62onzvqnupcqdxbkhhvq 16492674417

Asking the notaries to try signing again.

xinaxu commented 1 year ago

Request Proposed

Your Datacap Allocation Request has been proposed by the Notary

Message sent to Filecoin Network

bafy2bzacedz6rtjdlpiik72yygqfsd3b7l6mdyakd574uch2mpzwjqh3cog6q

Address

f1uwzfw6hghqf6js4773p62onzvqnupcqdxbkhhvq

Datacap Allocated

500.00TiB

Signer Address

f1k3ysofkrrmqcot6fkx4wnezpczlltpirmrpsgui

Id

0a08e49c-af7f-4daa-92f0-ea62144340e4

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacedz6rtjdlpiik72yygqfsd3b7l6mdyakd574uch2mpzwjqh3cog6q

Sunnyiscoming commented 1 year ago

Hello, @TaylorOshan per the https://github.com/filecoin-project/notary-governance/issues/922 for Open, Public Dataset applicants, please complete the following Fil+ registration form to identify yourself as the applicant and also please add the contact information of the SP entities you are working with to store copies of the data.

This information will be reviewed by Fil+ Governance team to confirm validity and then the application will be allowed to move forward for additional notary review.

jamerduhgamer commented 1 year ago

checker:manualTrigger

filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report Summary[^1]

Storage Provider Distribution

⚠️ 1 storage providers sealed too much duplicate data - f01392893: 55.65%

Deal Data Replication

⚠️ 100.00% of deals are for data replicated across less than 4 storage providers.

Deal Data Shared with other Clients[^3]

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the CID Checker report. Click here to view the Retrieval Dashboard.

TaylorOshan commented 1 year ago

@jamerduhgamer could you please share with me the information regarding the storage provider so that I can fill out the FIL+ registration form?

github-actions[bot] commented 12 months ago

This application has not seen any responses in the last 10 days. This issue will be marked with Stale label and will be closed in 4 days. Comment if you want to keep this application open.

-- Commented by Stale Bot.

jamerduhgamer commented 11 months ago

Comment to keep the application open. Information has been shared with Taylor via Slack.

github-actions[bot] commented 11 months ago

This application has not seen any responses in the last 10 days. This issue will be marked with Stale label and will be closed in 4 days. Comment if you want to keep this application open.

-- Commented by Stale Bot.

TaylorOshan commented 11 months ago

Please keep open. Updating soon.

On Thu, Nov 30, 2023 at 8:28 PM github-actions[bot] < @.***> wrote:

This application has not seen any responses in the last 10 days. This issue will be marked with Stale label and will be closed in 4 days. Comment if you want to keep this application open.

-- Commented by Stale Bot.

— Reply to this email directly, view it on GitHub https://github.com/filecoin-project/filecoin-plus-large-datasets/issues/2163#issuecomment-1835256854, or unsubscribe https://github.com/notifications/unsubscribe-auth/AB553TM5KLXY27BPALD6CO3YHEXCTAVCNFSM6AAAAAA3UJN2VGVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMYTQMZVGI2TMOBVGQ . You are receiving this because you were mentioned.Message ID: <filecoin-project/filecoin-plus-large-datasets/issues/2163/1835256854@ github.com>

Sunnyiscoming commented 11 months ago

SP List provided: [{"providerID": "f02806693", "City": "Sydney", "Country": "AU", "SPOrg","Andrew Sjoquist Enterprises Pty Ltd"}, {"providerID": "f01392893", "City": "Oostknollendam", "Country": "NL","SPOrg","Fusix Networks B.V."}, {"providerID": "f01851060", "City": "Las Vegas", "Country": "US","SPOrg","PIKNIK & Company Inc."},]

jamerduhgamer commented 11 months ago

Hello @Sunnyiscoming and @Kevin-FF-USA, is it possible for us to only have 4 SPs as we are struggling to find a 5th SP for this project.

github-actions[bot] commented 10 months ago

This application has not seen any responses in the last 10 days. This issue will be marked with Stale label and will be closed in 4 days. Comment if you want to keep this application open.

-- Commented by Stale Bot.

github-actions[bot] commented 10 months ago

This application has not seen any responses in the last 14 days, so for now it is being closed. Please feel free to contact the Fil+ Gov team to re-open the application if it is still being processed. Thank you!

-- Commented by Stale Bot.

jamerduhgamer commented 10 months ago

Hello @kevzak can we get this application re-opened?

jamerduhgamer commented 9 months ago

@simonkim0515 and @Kevin-FF-USA, could we get this application re-opened? Looking to start progress on this dataset again soon.

SethDocherty commented 9 months ago

checker:manualTrigger

filplus-checker-app[bot] commented 9 months ago

DataCap and CID Checker Report Summary[^1]

Storage Provider Distribution

⚠️ 1 storage providers sealed too much duplicate data - f01392893: 55.65%

Deal Data Replication

⚠️ 100.00% of deals are for data replicated across less than 4 storage providers.

Deal Data Shared with other Clients[^3]

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the CID Checker report. Click here to view the Retrieval Dashboard.

github-actions[bot] commented 8 months ago

This application has not seen any responses in the last 10 days. This issue will be marked with Stale label and will be closed in 4 days. Comment if you want to keep this application open.

-- Commented by Stale Bot.