filecoin-project / filecoin-plus-large-datasets

Hub for client applications for DataCap at a large scale
109 stars 62 forks source link

[DataCap Application]<Jiangsu Byte ZhiHong Technology Co., Ltd.>-<Cloud storage platform> #1442

Closed leoink closed 1 year ago

leoink commented 1 year ago

Data Owner Name

Jiangsu Byte Zhihong Technology Co., Ltd.

Data Owner Country/Region

China

Data Owner Industry

IT & Technology Services

Website

www.zjzhpool.com

Social Media

www.zjzhpool.com

Total amount of DataCap being requested

1PiB

Weekly allocation of DataCap requested

50TiB

On-chain address for first allocation

f1qysp3gb7j5bjw4jf2n5njtfmmsmc5q36j6gjn7q

Custom multisig

Identifier

No response

Share a brief history of your project and organization

Jiangsu Byte Zhihong Technology Co., Ltd., headquartered in Nanjing, Jiangsu, was established on March 1, 2021, with a registered capital of 10 million yuan. The company is committed to providing users with distributed storage solutions, distributed network solutions, distributed computing solutions, Web3.0 application development and blockchain technology services, and deeply cultivating the construction of a distributed business ecosystem.
We believe that Filecoin will be an integral part of achieving this vision.
We hope to continue to expand our participation in networks and communities

Is this project associated with other projects/ecosystem stakeholders?

No

If answered yes, what are the other projects/ecosystem stakeholders

No response

Describe the data being stored onto Filecoin

We have a lot of customer data to archive

Where was the data currently stored in this dataset sourced from

My Own Storage Infra

If you answered "Other" in the previous question, enter the details here

No response

How do you plan to prepare the dataset

lotus

If you answered "other/custom tool" in the previous question, enter the details here

No response

Please share a sample of the data

https://pan.baidu.com/s/1jTgF5Enqz4G0g1Enla2-pw  Extraction code:zjzh
https://pan.baidu.com/s/1i-z7TWRN65CggGqUvupDfg  Extraction code:zjzh

Confirm that this is a public dataset that can be retrieved by anyone on the Network

If you chose not to confirm, what was the reason

No response

What is the expected retrieval frequency for this data

Yearly

For how long do you plan to keep this dataset stored on Filecoin

1 to 1.5 years

In which geographies do you plan on making storage deals

Greater China

How will you be distributing your data to storage providers

Shipping hard drives

How do you plan to choose storage providers

Big data exchange

If you answered "Others" in the previous question, what is the tool or platform you plan to use

No response

If you already have a list of storage providers to work with, fill out their names and provider IDs below

No response

How do you plan to make deals to your storage providers

Boost client

If you answered "Others/custom tool" in the previous question, enter the details here

No response

Can you confirm that you will follow the Fil+ guideline

Yes

large-datacap-requests[bot] commented 1 year ago

Thanks for your request! Everything looks good. :ok_hand:

A Governance Team member will review the information provided and contact you back pretty soon.

Sunnyiscoming commented 1 year ago

The website of your organization shows that you operate an E-commerce platform. The Data samples you provided doesn't seem to have much to do with your business. Can you explain about that? Have you authorized to store the data of users? Can you explain user data composition and provide sufficient data samples separately? How many copies will you store? What's the relationship between you and the organization? Can you provide more detailed information about other storage providers participated in this program, such as you can list SPs you have contacted with at present? Whether the sps you choose can support data retrieval?

leoink commented 1 year ago

Thank you for your reply.

The website of your organization shows that you operate an E-commerce platform. The Data samples you provided doesn't seem to have much to do with your business. Can you explain about that?

The website is another business of the company. The company also provides distributed storage solutions. Data samples are from the storage business. https://pan.baidu.com/s/1Ir7hZE3GSekioR-DAfTNrw Extraction code:zjzh This is the copyright registration certificate of some software related to our storage.

Have you authorized to store the data of users? Can you explain user data composition and provide sufficient data samples separately? How many copies will you store? Yes. https://pan.baidu.com/s/1L8lTVYt5paUQnkjhEzb8fg Extraction code:zjzh This is the data stored by users, These data are live playback backups of live broadcast platforms (such as twitch),plan to store 1-2 copies, maybe more.

What's the relationship between you and the organization? Partner.

Can you provide more detailed information about other storage providers participated in this program, such as you can list SPs you have contacted with at present? http://www.sunrisegroup.com.cn/software2_detail.aspx?t=100&cid=93

Whether the sps you choose can support data retrieval? Yes.

simonkim0515 commented 1 year ago

Datacap Request Trigger

Total DataCap requested

1PiB

Expected weekly DataCap usage rate

50TiB

Client address

f1qysp3gb7j5bjw4jf2n5njtfmmsmc5q36j6gjn7q

large-datacap-requests[bot] commented 1 year ago

DataCap Allocation requested

Multisig Notary address

f02049625

Client address

f1qysp3gb7j5bjw4jf2n5njtfmmsmc5q36j6gjn7q

DataCap allocation requested

25TiB

Id

7d0ad56a-03f1-4882-828f-54df7169124f

filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report[^1]

There is no previous allocation for this issue.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

herrehesse commented 1 year ago

Dear Applicant,

Due to the increased amount of erroneous/wrong Filecoin+ data recently, on behalf of the entire community, we feel compelled to go deeper into datacap requests. Hereby to ensure that the overall value of the Filecoin network and Filecoin+ program increases and is not abused.

Please answer the questions below as comprehensively as possible.

Customer data

We expect that for the onboarding of customers with the scale of an LDN there would have been at least multiple email and perhaps several chat conversations preceding it. A single email with an agreement does not qualify here.

Should this only be soley for acquiring datacap this is of course out of the question. The customer must have a legitimate reason for wanting to use the Filecoin+ program which is intended as a program to store useful and public datasets on the network.

(As an intermediate solution Filecoin offers the FIL-E program or the glif.io website for business datasets that do not meet the requirements for a Filecoin+ dataset)

Files and Processing

Hopefully you understand the caution the overall community has for onboarding the wrong data. We understand the increased need for Filecoin+, however, we must not allow the program to be misused. Everything depends on a valuable and useful network, let's do our best to make this happen. Together.

leoink commented 1 year ago

Could you demonstrate exactly how and to what extent customer contact occurred? This is a public dataset and the dataset can be onboarded without customer contact Other questions are hence not applicable

Why is the customer data considered Filecoin+ eligible? Because it is public dataset

Would you tell us how the data set preparer takes into account the prevention of duplicates in order to prevent data cap abuse? We will use the program developed by ourselves to distribute, usually there will be no duplicate transactions

@herrehesse

herrehesse commented 1 year ago

@leoink your answers do not align with your datacap requests. Please explain:

“We have a lot of customer data to archive”

Why this is a “public dataset” ?

leoink commented 1 year ago

For example, https://pan.baidu.com/s/1L8lTVYt5paUQnkjhEzb8fg Extraction code:zjzh This is the data stored by users, These data are live playback backups of live broadcast platforms (such as twitch),plan to store 1-2 copies, maybe more.

The data itself is published on the web and available for public inquiry @herrehesse

herrehesse commented 1 year ago

@leoink Thank you for providing a sample and an explanation, appreciated. I do not fully understand why this data fits the narrative of "storing humanities most important information". I can setup a permanent copy from YouTube or Twitch channels and store them with 5 copies onto the Filecoin network with Datacap, this is NOT where the Filecoin+ program is meant for.

Please, if you want to store livestreams, use regular deals.

@raghavrmadya @Sunnyiscoming

cryptowhizzard commented 1 year ago

Dear applicant,

Thank you for applying for datacap. As Filecoin FIL+ notary i am screening your application and conducting due diligence.

Can you show us visible proof of the size of your data and the storage systems you have there?

As last question i would like you to fill out this form to provide us with the necessary information to make a educated decision on your LDN request if we would like to support it.

Thanks!

github-actions[bot] commented 1 year ago

This application has not seen any responses in the last 10 days. This issue will be marked with Stale label and will be closed in 4 days. Comment if you want to keep this application open.

github-actions[bot] commented 1 year ago

This application has not seen any responses in the last 14 days, so for now it is being closed. Please feel free to contact the Fil+ Gov team to re-open the application if it is still being processed. Thank you!