filecoin-project / filecoin-plus-large-datasets

Hub for client applications for DataCap at a large scale
109 stars 62 forks source link

[DataCap Application] EMPIAR Public dataset(4/4) #1848

Closed nicelove666 closed 1 year ago

nicelove666 commented 1 year ago

Data Owner Name

EMPIAR Public dataset(3/4)

Data Owner Country/Region

United Kingdom

Data Owner Industry

Life Science / Healthcare

Website

https://www.ebi.ac.uk/empiar

Social Media

https://www.ebi.ac.uk/empiar

Total amount of DataCap being requested

5PiB

Weekly allocation of DataCap requested

500TiB

On-chain address for first allocation

f123flsrflwx35b7spvr2xhwrekz7zvse6753dqwi

Data Type of Application

Public, Open Dataset (Research/Non-Profit)

Custom multisig

Identifier

No response

Share a brief history of your project and organization

EMPIAR, the Electron Microscopy Public Image Archive, is a public resource for raw images underpinning 3D cryo-EM maps and tomograms (themselves archived in EMDB). EMPIAR also accommodates 3D datasets obtained with volume EM techniques and soft and hard X-ray tomography. More ...
As of 2023-03-27, EMPIAR contains 1254 entries, taking up 2.73 PB of storage.

Is this project associated with other projects/ecosystem stakeholders?

Yes

If answered yes, what are the other projects/ecosystem stakeholders

No response

Describe the data being stored onto Filecoin

EMPIAR, the Electron Micronscopy Public Image Archive, is a public resource for raw image underpinning 3D cryo-EM maps and tomograms. EMPIAR also accomodates 3D datasets obtained with volume EM techniques and soft and hard X-ray tomography. The purpose of EMPIAR is to provide easy access to state-of-the-art data to facilitate methods development, validation and re-use, e.g, for Machine Learning applications. EMPIAR data is also used for training and teaching purposes and as part of community challenges.

Where was the data currently stored in this dataset sourced from

AWS Cloud

If you answered "Other" in the previous question, enter the details here

No response

How do you plan to prepare the dataset

IPFS, lotus, singularity, graphsplit

If you answered "other/custom tool" in the previous question, enter the details here

No response

Please share a sample of the data

Yes
https://www.ebi.ac.uk/empiar

Confirm that this is a public dataset that can be retrieved by anyone on the Network

If you chose not to confirm, what was the reason

No response

What is the expected retrieval frequency for this data

Yearly

For how long do you plan to keep this dataset stored on Filecoin

1.5 to 2 years

In which geographies do you plan on making storage deals

Greater China, Asia other than Greater China, North America, Europe

How will you be distributing your data to storage providers

Cloud storage (i.e. S3), HTTP or FTP server, IPFS, Shipping hard drives, Lotus built-in data transfer

How do you plan to choose storage providers

Slack, Filmine

If you answered "Others" in the previous question, what is the tool or platform you plan to use

No response

If you already have a list of storage providers to work with, fill out their names and provider IDs below

No response

How do you plan to make deals to your storage providers

Boost client, Lotus client, Bidbot, Singularity

If you answered "Others/custom tool" in the previous question, enter the details here

No response

Can you confirm that you will follow the Fil+ guideline

Yes

filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report Summary[^1]

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

Deal Data Replication

⚠️ 54.92% of deals are for data replicated across less than 4 storage providers.

Deal Data Shared with other Clients[^3]

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the full report.

nicelove666 commented 1 year ago

Since there is no DC in v3.1, the last signature did not arrive. I contacted the official staff. The official said that the last signatures could not let the DC to arrive, so the last signature is invalid, and it is recommended to re-sign.

So, can you please help me to sign again, thank you very much.

psh0691 commented 1 year ago

Request Proposed

Your Datacap Allocation Request has been proposed by the Notary

Message sent to Filecoin Network

bafy2bzacedm5bykrc3wljfmtqonthmxncbtj3j64ru3frzhvgkpfkxvxgfqx4

Address

f123flsrflwx35b7spvr2xhwrekz7zvse6753dqwi

Datacap Allocated

1.34PiB

Signer Address

f1qdko4jg25vo35qmyvcrw4ak4fmuu3f5rif2kc7i

Id

caa3c302-8fee-4e76-99fd-db930e2c3968

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacedm5bykrc3wljfmtqonthmxncbtj3j64ru3frzhvgkpfkxvxgfqx4

laurarenpanda commented 1 year ago

Request Proposed

Your Datacap Allocation Request has been proposed by the Notary

Message sent to Filecoin Network

bafy2bzacecs7g4i4spmde3tgrntchsalax356oa2oxasrkovvfl7mkufh67ke

Address

f123flsrflwx35b7spvr2xhwrekz7zvse6753dqwi

Datacap Allocated

1.34PiB

Signer Address

f1bp3tzp536edm7dodldceekzbsx7zcy7hdfg6uzq

Id

caa3c302-8fee-4e76-99fd-db930e2c3968

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacecs7g4i4spmde3tgrntchsalax356oa2oxasrkovvfl7mkufh67ke

stcloudlisa commented 1 year ago

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzacebc3ebhwjiunkrbmynzv7ynhataon5tzd3u5vja6oh7bgyaf5fa36

Address

f123flsrflwx35b7spvr2xhwrekz7zvse6753dqwi

Datacap Allocated

1.34PiB

Signer Address

f1jvvltduw35u6inn5tr4nfualyd42bh3vjtylgci

Id

caa3c302-8fee-4e76-99fd-db930e2c3968

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacebc3ebhwjiunkrbmynzv7ynhataon5tzd3u5vja6oh7bgyaf5fa36

psh0691 commented 1 year ago

Request Proposed

Your Datacap Allocation Request has been proposed by the Notary

Message sent to Filecoin Network

bafy2bzaceb7prizrukur2j5kcayqfrcm7xo25nhrdk6ryfkxx3k3uclrjeqz4

Address

f123flsrflwx35b7spvr2xhwrekz7zvse6753dqwi

Datacap Allocated

1.34PiB

Signer Address

f1qdko4jg25vo35qmyvcrw4ak4fmuu3f5rif2kc7i

Id

caa3c302-8fee-4e76-99fd-db930e2c3968

You can check the status of the message here: https://filfox.info/en/message/bafy2bzaceb7prizrukur2j5kcayqfrcm7xo25nhrdk6ryfkxx3k3uclrjeqz4

Joss-Hua commented 1 year ago

checker:manualTrigger

filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report Summary[^1]

Retrieval Statistics

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

Deal Data Replication

⚠️ 41.36% of deals are for data replicated across less than 4 storage providers.

Deal Data Shared with other Clients[^3]

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the CID Checker report. Click here to view the Retrieval report.

github-actions[bot] commented 1 year ago

This application has not seen any responses in the last 10 days. This issue will be marked with Stale label and will be closed in 4 days. Comment if you want to keep this application open.