nicelove666 / Allocator-Pathway-IPFSTT

4 stars 2 forks source link

[DataCap Application] ENCODE #65

Closed dds9988 closed 1 month ago

dds9988 commented 2 months ago

Version

1

DataCap Applicant

Encyclopedia of DNA Elements (ENCODE)

Project ID

ENCODE01

Data Owner Name

ENCODE Data Coordinating Center

Data Owner Country/Region

United States

Data Owner Industry

Life Science / Healthcare

Website

https://www.encodeproject.org/

Social Media Handle

https://www.encodeproject.org/

Social Media Type

Slack

What is your role related to the dataset

Data Preparer

Total amount of DataCap being requested

5PiB

Expected size of single dataset (one copy)

1280TiB

Number of replicas to store

4

Weekly allocation of DataCap requested

512TiB

On-chain address for first allocation

f1qqop6wsiuqnavnvfwsvfm2bdhjmoskfdfutyaii

Data Type of Application

Public, Open Dataset (Research/Non-Profit)

Custom multisig

Identifier

No response

Share a brief history of your project and organization

I'm individual dp. The project I'd like to join in filecoin is storing the data I've downloaded before. The data is from the Encode. The ENCODE Consortium not only produces high-quality data, but also analyzes the data in an integrative fashion. The ENCODE Encyclopedia organizes the most salient analysis products into annotations and provides tools to search and visualize them. The goal of ENCODE is to build a comprehensive parts list of functional elements in the human genome, including elements that act at the protein and RNA levels, and regulatory elements that control cells and circumstances in which a gene is active.

Is this project associated with other projects/ecosystem stakeholders?

No

If answered yes, what are the other projects/ecosystem stakeholders

No response

Describe the data being stored onto Filecoin

Released and archived ENCODE data. Regulatory elements are typically investigated through DNA hypersensitivity assays, assays of DNA methylation, and immunoprecipitation (IP) of proteins that interact with DNA and RNA, i.e., modified histones, transcription factors, chromatin regulators, and RNA-binding proteins, followed by sequencing.

Where was the data currently stored in this dataset sourced from

AWS Cloud

If you answered "Other" in the previous question, enter the details here

No response

If you are a data preparer. What is your location (Country/Region)

Singapore

If you are a data preparer, how will the data be prepared? Please include tooling used and technical details?

No response

If you are not preparing the data, who will prepare the data? (Provide name and business)

No response

Has this dataset been stored on the Filecoin network before? If so, please explain and make the case why you would like to store this dataset again to the network. Provide details on preparation and/or SP distribution.

No response

Please share a sample of the data

https://registry.opendata.aws/encode-project/

https://www.encodeproject.org

Confirm that this is a public dataset that can be retrieved by anyone on the Network

If you chose not to confirm, what was the reason

No response

What is the expected retrieval frequency for this data

Yearly

For how long do you plan to keep this dataset stored on Filecoin

1.5 to 2 years

In which geographies do you plan on making storage deals

Greater China, Asia other than Greater China, North America, Europe

How will you be distributing your data to storage providers

Cloud storage (i.e. S3), IPFS, Lotus built-in data transfer

How did you find your storage providers

Slack, Partners

If you answered "Others" in the previous question, what is the tool or platform you used

No response

Please list the provider IDs and location of the storage providers you will be working with.

Data Solutions -- f02984331 --Australia 
Coffee Cloud -- f02883857-- Singapore 
CloudWings -- f02852273-- United Kingdom 
Smart Data -- f02973061-- Russia

How do you plan to make deals to your storage providers

Boost client, Lotus client, Bidbot

If you answered "Others/custom tool" in the previous question, enter the details here

No response

Can you confirm that you will follow the Fil+ guideline

Yes

datacap-bot[bot] commented 2 months ago

Application is waiting for allocator review

nicelove666 commented 2 months ago

Thank you for your application, please fill in https://github.com/nicelove666/IPFSTT-Client-Due-Diligence-Form/issues/new/choose.

Our main focus is:

You need to provide comprehensive SP information, including SP, geographical location, and company name. Why do you want to put the data on Filecoin? Do you have relevant data processing experience? Can you satisfy spark retrieval and store unseal files?

dds9988 commented 2 months ago

This is the sp we cooperate with,All SPs support spark: Data Solutions -- f02984331 --Singapore Coffee Cloud -- f02883857-- Singapore CloudWings -- f02852273-- United Kingdom Smart Data -- f02973061-- Russia

We have filled in:https://github.com/nicelove666/IPFSTT-Client-Due-Diligence-Form/issues/7

We prepared the data for this website very early and downloaded it for 2 months. Now our data is downloaded and the sp pledge is ready. We hope to get your support.

datacap-bot[bot] commented 2 months ago

Datacap Request Trigger

Total DataCap requested

5PiB

Expected weekly DataCap usage rate

512TiB

DataCap Amount - First Tranche

256TiB

Client address

f1qqop6wsiuqnavnvfwsvfm2bdhjmoskfdfutyaii

datacap-bot[bot] commented 2 months ago

DataCap Allocation requested

Multisig Notary address

Client address

f1qqop6wsiuqnavnvfwsvfm2bdhjmoskfdfutyaii

DataCap allocation requested

256TiB

Id

c0b1411e-2d13-4d14-bca0-9db48c124fd3

datacap-bot[bot] commented 2 months ago

Application is ready to sign

datacap-bot[bot] commented 2 months ago

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzacecv7pgouaygfcs7rkpvb236qap55hlyb3xbdilbct7qwnrrlzc5h4

Address

f1qqop6wsiuqnavnvfwsvfm2bdhjmoskfdfutyaii

Datacap Allocated

256TiB

Signer Address

f16hmuu3w247dkkhsrbbcbeqbugmpjbxpkrpcdatq

Id

c0b1411e-2d13-4d14-bca0-9db48c124fd3

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacecv7pgouaygfcs7rkpvb236qap55hlyb3xbdilbct7qwnrrlzc5h4

datacap-bot[bot] commented 2 months ago

Application is Granted

datacap-bot[bot] commented 2 months ago

Client used 75% of the allocated DataCap. Consider allocating next tranche.

nicelove666 commented 2 months ago

checker:manualTrigger

datacap-bot[bot] commented 2 months ago

DataCap and CID Checker Report[^1]

No active deals found for this client.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

dds9988 commented 2 months ago

checker:manualTrigger

datacap-bot[bot] commented 2 months ago

DataCap and CID Checker Report[^1]

No active deals found for this client.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

dds9988 commented 2 months ago

Why is there no data? We have finished using it. Is there anything we need to do?

dds9988 commented 2 months ago

@nicelove666 can you continue to help us

usejsi commented 2 months ago

checker:manualTrigger

datacap-bot[bot] commented 2 months ago

DataCap and CID Checker Report Summary[^1]

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

⚠️ 100.00% of Storage Providers have retrieval success rate less than 75%.

⚠️ The average retrieval success rate is 11.28%

Deal Data Replication

✔️ Data replication looks healthy.

Deal Data Shared with other Clients[^3]

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the CID Checker report.

dds9988 commented 1 month ago

checker:manualTrigger

datacap-bot[bot] commented 1 month ago

DataCap and CID Checker Report Summary[^1]

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

⚠️ 100.00% of Storage Providers have retrieval success rate less than 75%.

⚠️ The average retrieval success rate is 12.45%

Deal Data Replication

✔️ Data replication looks healthy.

Deal Data Shared with other Clients[^3]

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the CID Checker report.

datacap-bot[bot] commented 1 month ago

Application is in Refill

datacap-bot[bot] commented 1 month ago

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzacebiyp6xgow4ta4x6hlabhfnd2spu5etkex7z3lks2btuy7ow7cp76

Address

f1qqop6wsiuqnavnvfwsvfm2bdhjmoskfdfutyaii

Datacap Allocated

512TiB

Signer Address

f16hmuu3w247dkkhsrbbcbeqbugmpjbxpkrpcdatq

Id

23fe1824-4c31-432d-a2cc-4a3dc2164ee5

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacebiyp6xgow4ta4x6hlabhfnd2spu5etkex7z3lks2btuy7ow7cp76

datacap-bot[bot] commented 1 month ago

Application is Granted

nicelove666 commented 1 month ago

checker:manualTrigger

datacap-bot[bot] commented 1 month ago

DataCap and CID Checker Report Summary[^1]

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

⚠️ 25.00% of Storage Providers have retrieval success rate equal to zero.

⚠️ 100.00% of Storage Providers have retrieval success rate less than 75%.

⚠️ The average retrieval success rate is 9.47%

Deal Data Replication

✔️ Data replication looks healthy.

Deal Data Shared with other Clients[^3]

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the CID Checker report.

nicelove666 commented 1 month ago

The retrieval success rate is too low. I see that you only used 256T in the first round, and did not use 512T in the second round. Whether we can sign for the third round depends on your retrieval success rate after the second round. If the retrieval success rate cannot be improved, we cannot sign for you again. Sorry!

datacap-bot[bot] commented 1 month ago

Client used 75% of the allocated DataCap. Consider allocating next tranche.

nicelove666 commented 1 month ago

checker:manualTrigger

datacap-bot[bot] commented 1 month ago

DataCap and CID Checker Report Summary[^1]

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

⚠️ 100.00% of Storage Providers have retrieval success rate less than 75%.

Deal Data Replication

✔️ Data replication looks healthy.

Deal Data Shared with other Clients[^3]

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the CID Checker report.

nicelove666 commented 1 month ago

checker:manualTrigger

datacap-bot[bot] commented 1 month ago

DataCap and CID Checker Report Summary[^1]

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

⚠️ 100.00% of Storage Providers have retrieval success rate less than 75%.

Deal Data Replication

✔️ Data replication looks healthy.

Deal Data Shared with other Clients[^3]

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the CID Checker report.

nicelove666 commented 1 month ago

image

nicelove666 commented 1 month ago

Although there is an improvement, the retrieval rate is too low!

nicelove666 commented 1 month ago

checker:manualTrigger

datacap-bot[bot] commented 1 month ago

DataCap and CID Checker Report Summary[^1]

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

⚠️ 100.00% of Storage Providers have retrieval success rate less than 75%.

Deal Data Replication

✔️ Data replication looks healthy.

Deal Data Shared with other Clients[^3]

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the CID Checker report.

dds9988 commented 1 month ago

Spark is improving! @nicelove666

dds9988 commented 1 month ago

checker:manualTrigger

datacap-bot[bot] commented 1 month ago

DataCap and CID Checker Report Summary[^1]

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

⚠️ 100.00% of Storage Providers have retrieval success rate less than 75%.

Deal Data Replication

✔️ Data replication looks healthy.

Deal Data Shared with other Clients[^3]

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the CID Checker report.

dds9988 commented 1 month ago

The improvement of spark takes time. After this period of adjustment, the spark retrieval success rate is gradually improving. We hope that you can bring us 1PiB.

dds9988 commented 1 month ago

2273 went from not supporting spark to retrieval from 0 to 37%, it is 42% now!

datacap-bot[bot] commented 1 month ago

Application is in Refill

datacap-bot[bot] commented 1 month ago

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzacecdkm3n3y4cah5brfplj4zcod5mbfobdsw6djr45fsvhik3q72xnc

Address

f1qqop6wsiuqnavnvfwsvfm2bdhjmoskfdfutyaii

Datacap Allocated

1PiB

Signer Address

f16hmuu3w247dkkhsrbbcbeqbugmpjbxpkrpcdatq

Id

dcc5132d-00fe-44c3-9572-c745c2d93157

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacecdkm3n3y4cah5brfplj4zcod5mbfobdsw6djr45fsvhik3q72xnc

datacap-bot[bot] commented 1 month ago

Application is Granted

datacap-bot[bot] commented 1 month ago

Client used 75% of the allocated DataCap. Consider allocating next tranche.

dds9988 commented 1 month ago

checker:manualTrigger

datacap-bot[bot] commented 1 month ago

DataCap and CID Checker Report Summary[^1]

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

⚠️ 100.00% of Storage Providers have retrieval success rate less than 75%.

Deal Data Replication

✔️ Data replication looks healthy.

Deal Data Shared with other Clients[^3]

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the CID Checker report.

nicelove666 commented 1 month ago

Considering that the dataset is stored multiple times, we cannot sign for you. If you need continued support, please find a new dataset. We need to understand that storing the data multiple times can not help the growth and development of the Filecoin network.

dds9988 commented 1 month ago

checker:manualTrigger

datacap-bot[bot] commented 1 month ago

DataCap and CID Checker Report Summary[^1]

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

⚠️ 100.00% of Storage Providers have retrieval success rate less than 75%.

Deal Data Replication

✔️ Data replication looks healthy.

Deal Data Shared with other Clients[^3]

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the CID Checker report.

dds9988 commented 1 month ago

Can we get more DC?

dds9988 commented 1 month ago

This data set is very large and no one has stored it all yet, so we think it can still be used @nicelove666