ipfsforcezuofu / ipfsforce-allocator

Allocator for Fileplus Program
0 stars 0 forks source link

DC application - Filedrive #15

Open stph51 opened 1 month ago

stph51 commented 1 month ago

Data Owner Name

FileDrive Labs

Data Owner Country/Region

China

Data Owner Industry

Life Science / Healthcare

Website

https://filedrive.io

Social Media Handle

@FileDrive

Social Media Type

Slack

What is your role related to the dataset

Data Preparer

Total amount of DataCap being requested

4PiB

Expected size of single dataset (one copy)

1000TiB

Number of replicas to store

4

Weekly allocation of DataCap requested

1000TiB

On-chain address for first allocation

f3uutkfpsrwph4e47znxeq3pf2yywz7l77hnr7ppoo3k3ohowx72ropumnmc3k7v54x3hq2oaik6pi2n2le5pa

Data Type of Application

Public, Open Dataset (Research/Non-Profit)

Custom multisig

Identifier

No response

Share a brief history of your project and organization

FileDrive Labs has always defined ourselves as tool developers and infrastructure builders in the Filecoin ecosystem.    From 2019, we continuously focus on technical solutions and development based on IPFS protocol and the Filecoin network and do our best to contribute to the community.
Over 80% of our team are qualified engineers, and half of them have more than 10-year development experience in multiple industries, including Communication, the Internet, and blockchain.
Since 2020, we have participated in Slingshot Competition, become one of the top teams, and stored over 5 PiB useful data from public datasets to the Filecoin network.
To contribute to the Filecoin Community, we developed an open-source data prep tool Graphsplit, FIL+ project dashboard filplus.info and storage provider discovery platform filfind,info.
Besides, we have also hold weekly online virtual events named FileDrive Meetup from March 2022, which aims to provide a platform for community members to grasp the latest trends of the Filecoin network and our work and research.

Please check the following links for more details.
- GitHub: https://github.com/filedrive-team
- Twitter: https://twitter.com/FileDrive1
- Eventbrite: https://www.eventbrite.hk/o/filedrive-labs-42456337463
- YouTube Channel: https://www.youtube.com/channel/UCxcZC1dtBUlQvZY7DX13W1w
- Medium: https://medium.com/@FileDrive1

Is this project associated with other projects/ecosystem stakeholders?

No

If answered yes, what are the other projects/ecosystem stakeholders

No response

Describe the data being stored onto Filecoin

for example, smithsonian Open Access, the Smithsonian’s mission is the "increase and diffusion of knowledge" and has been collecting since 1846. The Smithsonian, through its efforts to digitize its multidisciplinary collections, has created millions of digital assets and related metadata describing the collection objects. On February 25th, 2020, the Smithsonian released over 2.8 million CC0 interdisciplinary 2-D and 3-D images, related metadata, and additionally, research data from researches across the Smithsonian. The 2.8 million "open access" collections are a subset of the Smithsonian’s 155 million objects, 2.1 million library volumes and 156,000 cubic feet of archival collections held in 19 museums, 9 research centers, libraries, archives and the National Zoo. Digitization of collections is ongoing.

Where was the data currently stored in this dataset sourced from

AWS Cloud

If you answered "Other" in the previous question, enter the details here

No response

If you are a data preparer. What is your location (Country/Region)

China

If you are a data preparer, how will the data be prepared? Please include tooling used and technical details?

Graphsplit

If you are not preparing the data, who will prepare the data? (Provide name and business)

No response

Has this dataset been stored on the Filecoin network before? If so, please explain and make the case why you would like to store this dataset again to the network. Provide details on preparation and/or SP distribution.

No

Please share a sample of the data

https://registry.opendata.aws/smithsonian-open-access/

Confirm that this is a public dataset that can be retrieved by anyone on the Network

If you chose not to confirm, what was the reason

No response

What is the expected retrieval frequency for this data

Yearly

For how long do you plan to keep this dataset stored on Filecoin

1 to 1.5 years

In which geographies do you plan on making storage deals

Greater China, Asia other than Greater China, North America, Australia (continent)

How will you be distributing your data to storage providers

Cloud storage (i.e. S3)

How did you find your storage providers

Slack, Others

If you answered "Others" in the previous question, what is the tool or platform you used

Weixin

Please list the provider IDs and location of the storage providers you will be working with.

1, f03214937, US
2, f03151456, China 
3, f03179570, Singapore
4, f03229933, South Korea

How do you plan to make deals to your storage providers

Droplet client

If you answered "Others/custom tool" in the previous question, enter the details here

No response

Can you confirm that you will follow the Fil+ guideline

Yes

datacap-bot[bot] commented 1 month ago

Application is waiting for allocator review

ipfsforcezuofu commented 1 month ago

@stph51 Since it's your first time applying for DC with us, we can only assign 1 PiB initially to ensure compliance. It will be scheduled as 63t, 125t, 250t, and 562t. If this arrangement works for you, please email jiemezhang@gmail.com with information about your organization and SPs for KYB and KYC. Thank you for your collaboration.

stph51 commented 1 month ago

@ipfsforcezuofu Thank you for your response. Could you please verify the information I sent to you via email?

ipfsforcezuofu commented 1 month ago

@stph51 The materials for your organization and SPs have been received and successfully verified. The KYB and KYC processes are now complete. I'm going to assign the first batch 63t.

datacap-bot[bot] commented 1 month ago

Datacap Request Trigger

Total DataCap requested

3PiB

Expected weekly DataCap usage rate

250TiB

DataCap Amount - First Tranche

63TiB

Client address

f3uutkfpsrwph4e47znxeq3pf2yywz7l77hnr7ppoo3k3ohowx72ropumnmc3k7v54x3hq2oaik6pi2n2le5pa

datacap-bot[bot] commented 1 month ago

DataCap Allocation requested

Multisig Notary address

Client address

f3uutkfpsrwph4e47znxeq3pf2yywz7l77hnr7ppoo3k3ohowx72ropumnmc3k7v54x3hq2oaik6pi2n2le5pa

DataCap allocation requested

63TiB

Id

47fda476-57a5-4182-b3fe-35220200fbc3

datacap-bot[bot] commented 1 month ago

Application is ready to sign

datacap-bot[bot] commented 1 month ago

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzacecfnaj3kyzznpe4mysdigt2dmotgtqwm4m3fdl4ozghoql7byh72y

Address

f3uutkfpsrwph4e47znxeq3pf2yywz7l77hnr7ppoo3k3ohowx72ropumnmc3k7v54x3hq2oaik6pi2n2le5pa

Datacap Allocated

63TiB

Signer Address

f1x4nh2yvv2o2wwr4f7l7ocuenz7trdv7z5oqlgni

Id

47fda476-57a5-4182-b3fe-35220200fbc3

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacecfnaj3kyzznpe4mysdigt2dmotgtqwm4m3fdl4ozghoql7byh72y

datacap-bot[bot] commented 1 month ago

Application is Granted

datacap-bot[bot] commented 1 month ago

Client used 75% of the allocated DataCap. Consider allocating next tranche.

ipfsforcezuofu commented 1 month ago

checker:manualTrigger

datacap-bot[bot] commented 1 month ago

DataCap and CID Checker Report[^1]

No active deals found for this client.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

ipfsforcezuofu commented 1 month ago

@stph51 Could you explain why there are no active deals for this client?

stph51 commented 1 month ago

@ipfsforcezuofu We're using Droplet DDO to place orders, but as I understand it, SPARK does not yet support their retrieval.

datacap-bot[bot] commented 1 month ago

Application is in Refill

datacap-bot[bot] commented 1 month ago

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzaceahbgs5o6aflx2bwqh45ut7m4bpny5uqvo3tlgn3samcqt3x4ga4m

Address

f3uutkfpsrwph4e47znxeq3pf2yywz7l77hnr7ppoo3k3ohowx72ropumnmc3k7v54x3hq2oaik6pi2n2le5pa

Datacap Allocated

125TiB

Signer Address

f1x4nh2yvv2o2wwr4f7l7ocuenz7trdv7z5oqlgni

Id

8bfc4baa-2fb5-4d6c-8c6a-37aee355bed5

You can check the status of the message here: https://filfox.info/en/message/bafy2bzaceahbgs5o6aflx2bwqh45ut7m4bpny5uqvo3tlgn3samcqt3x4ga4m

datacap-bot[bot] commented 1 month ago

Application is Granted

datacap-bot[bot] commented 1 month ago

Client used 75% of the allocated DataCap. Consider allocating next tranche.

ipfsforcezuofu commented 1 month ago

checker:manualTrigger

datacap-bot[bot] commented 1 month ago

DataCap and CID Checker Report Summary[^1]

Storage Provider Distribution

⚠️ 1 storage providers sealed more than 90% of total datacap - f02984331: 100.00%

⚠️ All storage providers are located in the same region.

Deal Data Replication

✔️ Data replication looks healthy.

Deal Data Shared with other Clients[^3]

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the CID Checker report.

ipfsforcezuofu commented 1 month ago

@stph51 The report highlights an issue with SP distribution. Could you provide the actual distribution details?

stph51 commented 1 month ago

@ipfsforcezuofu Here is the actual distribution for the 180t DC I applied: f02984331: 90t f02883857: 90t

datacap-bot[bot] commented 1 month ago

Application is in Refill

datacap-bot[bot] commented 1 month ago

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzaced5qqfebq7t425gwgqk37iik7zz33o3gfj7iwb4ewmerllxnncwce

Address

f3uutkfpsrwph4e47znxeq3pf2yywz7l77hnr7ppoo3k3ohowx72ropumnmc3k7v54x3hq2oaik6pi2n2le5pa

Datacap Allocated

250TiB

Signer Address

f1x4nh2yvv2o2wwr4f7l7ocuenz7trdv7z5oqlgni

Id

a39c7754-d786-4684-908c-f081e997d316

You can check the status of the message here: https://filfox.info/en/message/bafy2bzaced5qqfebq7t425gwgqk37iik7zz33o3gfj7iwb4ewmerllxnncwce

datacap-bot[bot] commented 1 month ago

Application is Granted

ipfsforcezuofu commented 1 month ago

checker:manualTrigger

datacap-bot[bot] commented 1 month ago

DataCap and CID Checker Report Summary[^1]

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

⚠️ 50.00% of Storage Providers have retrieval success rate less than 75%.

Deal Data Replication

✔️ Data replication looks healthy.

Deal Data Shared with other Clients[^3]

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the CID Checker report.

ipfsforcezuofu commented 1 month ago

@stph51 Could you explain the low retrieval rate for SP f02883857?

stph51 commented 1 month ago

@ipfsforcezuofu This SP is using Boost Non-DDO to place orders. I noticed this issue this morning as well and got response from SP an error message of Boost. They are working on analyzing the issue. Please continue assigning DC today to meet the demand for this weekend. Thank you. 图片

datacap-bot[bot] commented 1 month ago

Client used 75% of the allocated DataCap. Consider allocating next tranche.

datacap-bot[bot] commented 1 month ago

Application is in Refill

datacap-bot[bot] commented 1 month ago

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzacecddfhgco4vhgezxgaweech7v6zghg3mrcewitnzoedulfm3serkg

Address

f3uutkfpsrwph4e47znxeq3pf2yywz7l77hnr7ppoo3k3ohowx72ropumnmc3k7v54x3hq2oaik6pi2n2le5pa

Datacap Allocated

562TiB

Signer Address

f1x4nh2yvv2o2wwr4f7l7ocuenz7trdv7z5oqlgni

Id

f7fea3f0-eca4-484e-a852-8f75bff7592e

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacecddfhgco4vhgezxgaweech7v6zghg3mrcewitnzoedulfm3serkg

datacap-bot[bot] commented 1 month ago

Application is Granted

ipfsforcezuofu commented 1 month ago

checker:manualTrigger

datacap-bot[bot] commented 1 month ago

DataCap and CID Checker Report Summary[^1]

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

⚠️ 50.00% of Storage Providers have retrieval success rate less than 75%.

Deal Data Replication

✔️ Data replication looks healthy.

Deal Data Shared with other Clients[^3]

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the CID Checker report.

datacap-bot[bot] commented 1 month ago

Client used 75% of the allocated DataCap. Consider allocating next tranche.

ipfsforcezuofu commented 1 month ago

@stph51

You claim that one copy is 250 TiB with 4 replicas, yet you requested 3 PiB. Could you explain the 2 PiB discrepancy?

stph51 commented 1 month ago

@ipfsforcezuofu

To be honest, our actual storage demand is 3 PiB. However, after reviewing your first client, we learned that the maximum DC you might assign is 1 PiB. Therefore, we lowered our target in the first batch, hoping to apply for the remaining 2 PiB at a later time.

stph51 commented 1 week ago

@ipfsforcezuofu We plan to store a 10P dataset in Filecoin and hope you can meet our requirements. To support this large data storage, the previous SPs will be replaced with the following new ones, which can immediately fulfill our needs. Could you support? Japan : f03178144 US:f03214937 China: f03151456 Singapore:f03179570 South Korea: f03229933

ipfsforcezuofu commented 1 week ago

@stph51

I can currently allocate only 3P to you. Since all your SPs are new, the first tranche will be 500T to evaluate their performance. If this arrangement works for you, please send their information for KYC. Thank you for your understanding.

stph51 commented 1 week ago

@ipfsforcezuofu

Could you please verify the information for those SPs sent to you via email? thank you.

ipfsforcezuofu commented 1 week ago

@stph51

Thank you for providing the information. The KYC for the 5 SPs has been completed. I'm posting those information for community review, and then allocated the first tranche DC 500T. f03178144:

image

f03151456:

image

f03179570:

image

f03229933:

image

f03214937:

image
datacap-bot[bot] commented 1 week ago

Application is in Refill

datacap-bot[bot] commented 1 week ago

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzaceabmikcyuxa56ymrxav5lg3ubhh7aeq33h6d3y5lcd6mdujrdyrdk

Address

f3uutkfpsrwph4e47znxeq3pf2yywz7l77hnr7ppoo3k3ohowx72ropumnmc3k7v54x3hq2oaik6pi2n2le5pa

Datacap Allocated

500TiB

Signer Address

f1x4nh2yvv2o2wwr4f7l7ocuenz7trdv7z5oqlgni

Id

654f6d2f-a819-4447-9e6f-95f3f434515c

You can check the status of the message here: https://filfox.info/en/message/bafy2bzaceabmikcyuxa56ymrxav5lg3ubhh7aeq33h6d3y5lcd6mdujrdyrdk

datacap-bot[bot] commented 1 week ago

Application is Granted

ipfsforcezuofu commented 6 days ago

checker:manualTrigger

datacap-bot[bot] commented 6 days ago

DataCap and CID Checker Report Summary[^1]

Storage Provider Distribution

⚠️ 2 storage providers sealed more than 25% of total datacap - f02984331: 38.16%, f02883857: 38.25%

⚠️ 100.00% of Storage Providers have retrieval success rate less than 75%.

Deal Data Replication

⚠️ 100.00% of deals are for data replicated across less than 4 storage providers.

Deal Data Shared with other Clients[^3]

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the CID Checker report.

datacap-bot[bot] commented 3 days ago

Client used 75% of the allocated DataCap. Consider allocating next tranche.

ipfsforcezuofu commented 3 days ago

checker:manualTrigger

datacap-bot[bot] commented 3 days ago

DataCap and CID Checker Report Summary[^1]

Storage Provider Distribution

⚠️ 2 storage providers sealed more than 25% of total datacap - f02984331: 30.36%, f02883857: 30.44%

⚠️ 2 storage providers sealed too much duplicate data - f03179570: 21.11%, f03229933: 23.01%

⚠️ 60.00% of Storage Providers have retrieval success rate less than 75%.

Deal Data Replication

⚠️ 98.53% of deals are for data replicated across less than 4 storage providers.

Deal Data Shared with other Clients[^3]

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the CID Checker report.