filecoin-project / filecoin-plus-large-datasets

Hub for client applications for DataCap at a large scale
109 stars 62 forks source link

[DataCap Application] <FogMeta Lab> - <Southern California Earthquake Data> #1852

Closed hengdingy closed 1 year ago

hengdingy commented 1 year ago

Data Owner Name

FogMeta Lab

Data Owner Country/Region

China

Data Owner Industry

Web3 / Crypto

Website

https://fogmeta.com/

Social Media

Twitter: https://twitter.com/FogMeta
GitHub: https://github.com/FogMeta

Total amount of DataCap being requested

2PiB

Weekly allocation of DataCap requested

1PiB

On-chain address for first allocation

f17ukn3hi4emcgwqzerjcnscbsamy5a3gsbjquoni

Data Type of Application

Public, Open Dataset (Research/Non-Profit)

Custom multisig

Identifier

No response

Share a brief history of your project and organization

FogMeta Lab's research spans multiple levels from system technology, infrastructure, and middleware to services and solutions, and involves future systems, network technology and business, distributed systems and management, information management, and interactive and innovative services. Based on the views on and practices in the industry, FogMeta also solves the problem of business complexity through operations optimization and other technologies.

Is this project associated with other projects/ecosystem stakeholders?

No

If answered yes, what are the other projects/ecosystem stakeholders

No response

Describe the data being stored onto Filecoin

"This dataset contains ground motion velocity and acceleration seismic waveforms recorded by the Southern California Seismic Network (SCSN) and archived at the Southern California Earthquake Data Center (SCEDC). A Distributed Acousting Sensing (DAS) dataset is included."

Source: https://registry.opendata.aws/southern-california-earthquakes/

Where was the data currently stored in this dataset sourced from

AWS Cloud

If you answered "Other" in the previous question, enter the details here

No response

How do you plan to prepare the dataset

IPFS, lotus, graphsplit, others/custom tool

If you answered "other/custom tool" in the previous question, enter the details here

No response

Please share a sample of the data

s3://scedc-pds/ (107.9 TiB)

Confirm that this is a public dataset that can be retrieved by anyone on the Network

If you chose not to confirm, what was the reason

No response

What is the expected retrieval frequency for this data

Monthly

For how long do you plan to keep this dataset stored on Filecoin

2 to 3 years

In which geographies do you plan on making storage deals

Greater China, Asia other than Greater China, Africa, North America, South America, Europe, Australia (continent), Antarctica

How will you be distributing your data to storage providers

Cloud storage (i.e. S3), HTTP or FTP server, IPFS, Shipping hard drives

How do you plan to choose storage providers

Slack, Partners, Others

If you answered "Others" in the previous question, what is the tool or platform you plan to use

We'd also like to use FilSwan platform (https://filswan.com/) to choose storage providers who meet our requirements.

If you already have a list of storage providers to work with, fill out their names and provider IDs below

The storage providers we'd like to work with are presented below. Some of them are from the FilSwan platform.
f01955033
f02029115
f03624
f010088
f02301
f08399
f02401
f01955030
f0187709
f01163272
f01402814
f01390330
f01225882
f0717969
f03223
f01395673
f01072221
f0143858
f01786736
f0836160
f032824
f01443744
f01871352
f01907556
f01955028
f01947280
f01946551
f02012951
f01970630
f0240185

How do you plan to make deals to your storage providers

Boost client, Lotus client, Others/custom tool

If you answered "Others/custom tool" in the previous question, enter the details here

Swan Client tool
https://github.com/filswan/go-swan-client

Can you confirm that you will follow the Fil+ guideline

Yes

large-datacap-requests[bot] commented 1 year ago

Thanks for your request! Everything looks good. :ok_hand:

A Governance Team member will review the information provided and contact you back pretty soon.

large-datacap-requests[bot] commented 1 year ago

Thanks for your request! Everything looks good. :ok_hand:

A Governance Team member will review the information provided and contact you back pretty soon.

Sunnyiscoming commented 1 year ago

Datacap Request Trigger

Total DataCap requested

2PiB

Expected weekly DataCap usage rate

1PiB

Client address

f17ukn3hi4emcgwqzerjcnscbsamy5a3gsbjquoni

large-datacap-requests[bot] commented 1 year ago

DataCap Allocation requested

Multisig Notary address

f02049625

Client address

f17ukn3hi4emcgwqzerjcnscbsamy5a3gsbjquoni

DataCap allocation requested

102.39TiB

Id

aa21056f-6fa3-4850-9446-4d1c54c79f9f

large-datacap-requests[bot] commented 1 year ago

DataCap Allocation requested

Multisig Notary address

f02049625

Client address

f17ukn3hi4emcgwqzerjcnscbsamy5a3gsbjquoni

DataCap allocation requested

102.39TiB

Id

752969db-bdf7-4d2a-b36b-a50fa60ba64b

filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report[^1]

No application info found for this issue on https://filplus.d.interplanetary.one/clients.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report[^1]

No application info found for this issue on https://filplus.d.interplanetary.one/clients.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

zcfil commented 1 year ago

Found only a small amount of data. Can you provide more data samples? Can you email filplus-app-review@fil.org using your official domain name to confirm your identity? Email name should contain Issue ID #1852.

hengdingy commented 1 year ago

Found only a small amount of data. Can you provide more data samples? Can you email filplus-app-review@fil.org using your official domain name to confirm your identity? Email name should contain Issue ID #1852.

@zcfil @Sunnyiscoming I have sent the email, please check it. all dataset is from s3://scedc-pds/ (107.9 TiB), you can read this bucket using awscli command

image
Sunnyiscoming commented 1 year ago

Received that.

kernelogic commented 1 year ago

Well known client, willing to support.

kernelogic commented 1 year ago

Request Proposed

Your Datacap Allocation Request has been proposed by the Notary

Message sent to Filecoin Network

bafy2bzacecmw2gpy6okoytwezvgbstsr2k5kpbhozft4xjwar2unx2l425xoi

Address

f17ukn3hi4emcgwqzerjcnscbsamy5a3gsbjquoni

Datacap Allocated

102.39TiB

Signer Address

f1yjhnsoga2ccnepb7t3p3ov5fzom3syhsuinxexa

Id

752969db-bdf7-4d2a-b36b-a50fa60ba64b

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacecmw2gpy6okoytwezvgbstsr2k5kpbhozft4xjwar2unx2l425xoi

zcfil commented 1 year ago

I see you only have over 100T of display data, how do you allocate this data?

hengdingy commented 1 year ago

@zcfil we will store the data 15 copies to different region SPs. dataset will be split to 16GiB piece, required 32GiB datacap, 100TiB/163215/1024=2.9PiB datacap is needed.

All data piece will be sent to SPs according to the filswan platform. swan-client is a great onboarding tool. The piece will be downloaded to SP.

github-actions[bot] commented 1 year ago

This application has not seen any responses in the last 10 days. This issue will be marked with Stale label and will be closed in 4 days. Comment if you want to keep this application open.

github-actions[bot] commented 1 year ago

This application has not seen any responses in the last 14 days, so for now it is being closed. Please feel free to contact the Fil+ Gov team to re-open the application if it is still being processed. Thank you!

hengdingy commented 1 year ago

@zcfil @simonkim0515 @Sunnyiscoming can you help us reopen this application? we are preparing data, and notary are doing the DD.