filecoin-project / filecoin-plus-large-datasets

Hub for client applications for DataCap at a large scale
110 stars 62 forks source link

[DataCap Application] <Xingchi-Media> - [ 9 / 10 ] #1059

Closed Xingchi-Media closed 1 year ago

Xingchi-Media commented 2 years ago

Large Dataset Notary Application

To apply for DataCap to onboard your dataset to Filecoin, please fill out the following.

Core Information

Please respond to the questions below by replacing the text saying "Please answer here". Include as much detail as you can in your answer.

Project details

Share a brief history of your project and organization.

Founded in 2013, it is a full process content production service company for program development and production. The core creative team has more than 10 years of experience in the industry, and has many in-depth project cooperation with Korean, British, American and other program producers. It provides content planning, model development, director, screenwriter, post production and other services for more than 30 domestic top variety shows every year. In 2019, the business income will exceed 200 million yuan, and the number of production personnel will exceed 200.
Founded in 2016, Xingchi College is a training and education institution for film and television post production professionals,
With rich post production experience, massive original program materials, and elaborate curriculum system design,
To provide students with a professional and practical film and television production education system,
At present, it has delivered more than 800 professional film and television production personnel to major satellite TV and TV program production companies.
image image image image

What is the primary source of funding for this project?

Own enterprise funds.

What other projects/ecosystem stakeholders is this project associated with?

No.

Use-case details

Describe the data being stored onto Filecoin

Dataset from
More than 200 TV variety shows produced by us since 2013.
Mainly various variety show video materials produced by us under commercial authorization.

Where was the data in this dataset sourced from?

More than 200 TV variety shows produced by us since 2013.
Including 115 cooperation projects with TV stations and 87 cooperation projects with network platforms
Including: variety show post production; Stage multimedia design; Logo design; Production of variety show special effects; Variety program title and ending design; Program packaging design materials, etc
image image image image image image image image image image image

Can you share a sample of the data? A link to a file, an image, a table, etc., are good ways to do this.

Project Dataset
Data sets of 150 variety show projects completed by us since 2013。
https://docs.google.com/spreadsheets/d/1G22bmWv5rKlx83OQIIG2pcTJEnG2EUsFi28xZ1WQvnU/edit?usp=sharing

Confirm that this is a public dataset that can be retrieved by anyone on the Network (i.e., no specific permissions or access rights are required to view the data).

No, the data of this LDN is not public, because many of the data in this part are filmed tidbits and variety shows that have not been broadcast, which involves commercial copyright and star privacy. Therefore, encryption is required.

What is the expected retrieval frequency for this data?

Once a year, mainly for disaster recovery.

For how long do you plan to keep this dataset stored on Filecoin?

Want permanent storage.

DataCap allocation plan

In which geographies (countries, regions) do you plan on making storage deals?

There is no regional restriction. 

How will you be distributing your data to storage providers? Is there an offline data transfer process?

Offline transmission
Because the dataset is very large, we only support offline transmission.

How do you plan on choosing the storage providers with whom you will be making deals? This should include a plan to ensure the data is retrievable in the future both by you and others.

We confirm that SP for cooperation is:

image

How will you be distributing deals across storage providers?

Hard drive by mail, offline transfer.

Do you have the resources/funding to start making deals as soon as you receive DataCap? What support from the community would help you onboard onto Filecoin?

Yes, we already prepared enough funding for making deals.
In addition, the Filecoin Foundation will give us some help.
large-datacap-requests[bot] commented 2 years ago

Thanks for your request!

Heads up, you’re requesting more than the typical weekly onboarding rate of DataCap!
large-datacap-requests[bot] commented 2 years ago

Thanks for your request! Everything looks good. :ok_hand:

A Governance Team member will review the information provided and contact you back pretty soon.

large-datacap-requests[bot] commented 2 years ago

Thanks for your request!

Heads up, you’re requesting more than the typical weekly onboarding rate of DataCap!
large-datacap-requests[bot] commented 2 years ago

Thanks for your request! Everything looks good. :ok_hand:

A Governance Team member will review the information provided and contact you back pretty soon.

large-datacap-requests[bot] commented 2 years ago

Thanks for your request!

Heads up, you’re requesting more than the typical weekly onboarding rate of DataCap!
large-datacap-requests[bot] commented 2 years ago

Thanks for your request! Everything looks good. :ok_hand:

A Governance Team member will review the information provided and contact you back pretty soon.

kevzak commented 2 years ago

@raghavrmadya FYI this private dataset is meant for E-Fil+ Pilot

Fenbushi-Filecoin commented 2 years ago

KYB info looks all good from Qcc.

kevzak commented 2 years ago

Yes, I can also confirm a positive business check from Qcc inquiry.

kevzak commented 2 years ago

As a next step, I would ask @Xingchi-Media to have five E-Fil notaries publicly document support in the comment section for their application and exception proposal info.

NDLABS-Leo commented 2 years ago

I am very happy to see the submission of the e+fil application. If necessary, please contact me on slack: ND LABS - OFFICE

Fatman13 commented 2 years ago

I would like to support the KYB process of this fil+e application if my service is still needed.

Fatman13 commented 2 years ago

Hello, @kevzak, I have verified the documentation submitted in our private Slack chat and confirm that this is a legit business operating in China.

Xingchi-Media commented 2 years ago

We have confirmed the SPs investors who come from "Zhongheyi", "Lucky Technology", "Jiuzhou Cloud", "Chengxin Cloud", "Huicun IPFS", "Time Space Technology"

SP ID SP region
f0753213 Xiamen
f0845552 Xiamen
f01699999 Jiangmen
f0744513 Guangzhou
f066270 Dongguan
f0121260 Nanning
f01527777 Nanning
f01777785 Singapore
f01920091 HK

PS: Some investors may choose to create new nodes.We will update at any time

liyunzhi-666 commented 2 years ago

I have verified the relevant information of this client in the slack group.

NDLABS-Leo commented 2 years ago

I verified the KYB of this business in slack

large-datacap-requests[bot] commented 2 years ago

Thanks for your request!

Heads up, you’re requesting more than the typical weekly onboarding rate of DataCap!
large-datacap-requests[bot] commented 2 years ago

Thanks for your request! Everything looks good. :ok_hand:

A Governance Team member will review the information provided and contact you back pretty soon.

kevzak commented 2 years ago

Hello - I can confirm for the Fil+ team that @Xingchi-Media has completed the Client Registration and Business Verification Check successfully. 5 Notaries reviewed the business and have shown support for the this specific application #9/10 that is private data.

Assigned notaries can proceed to review the application and the exceptions proposal

kevzak commented 2 years ago

Datacap Request Trigger

Total DataCap requested

5PiB

Expected weekly DataCap usage rate

200TiB

Client address

f1ecr5djz7v4lgae67fqrwtn6qkpz2zh2gfop5xci

large-datacap-requests[bot] commented 2 years ago

DataCap Allocation requested

Multisig Notary address

f01940930

Client address

f1ecr5djz7v4lgae67fqrwtn6qkpz2zh2gfop5xci

DataCap allocation requested

100TiB

Id

a43ba2d3-b4f5-4c76-be94-f2ffe274e528

large-datacap-requests[bot] commented 2 years ago

Hello @davidthoms - @NDLABS-OFFICE , please sign the datacap request

Neal-fil commented 2 years ago

Hi @kevzak, have you confirmed with all the SPs within the final list #1051 about their participation?

Xingchi-Media commented 2 years ago

Tomorrow we will contact the participating SPs to transfer 0.01Fil to f1vnfbc3siofrens2inl66gjzevqrto5wlsray6gy (DC address of # 1051) using the Owner/Walker wallet address to prove

kevzak commented 2 years ago

Hi @kevzak, have you confirmed with all the SPs within the final list #1051 about their participation?

Hi @Neal-fil I have not confirmed with SPs listed in exceptions proposal of their participation. What mechanisms do you recommend to do this? How do we contact SP miner ids?

Current state for this program, we are asking the client to list all SPs involved in the project along with with their details. Xingchi-media has done that to my understanding. (I have seen your comments and you are asking for an updated list. If there is an updated list, @Xingchi-Media should please update and communicate being fully open)

I'm not a notary, but my understanding of this is that if the applicants do not follow their plan as described, using the SPs listed in their plan, then notaries will discontinue Datacap allocations.

Neal-fil commented 2 years ago

@kevzak, thanks for more details.

  1. I would recommend that you confirm with each SP per the information provided to your team. https://github.com/filecoin-project/filecoin-plus-large-datasets/issues/1051#issuecomment-1334003183
  2. As this application involves a lot of notaries who are also sps, I would also suggest you sync up with the trust and transparency team and run periodic checks afterward to prevent any violations.
Xingchi-Media commented 2 years ago

Hey, @kevzak @liyunzhi-666 @Fenbushi-Filecoin @Fatman13 @NDLABS-OFFICE @newwebgroup

The nodes held by SPs have been verified by transfer

imagehttps://filfox.info/en/address/f1vnfbc3siofrens2inl66gjzevqrto5wlsray6gy

Next step: 1.We will send the corresponding relationship of SPs to the governance team 2.We prepare the data and start cutting the Carfile 3.Offline data transmission 4.Contact the notary and sign for us 5.Start loading the dataset into Filecoin

kevzak commented 2 years ago

@davidthoms - @NDLABS-OFFICE , please proceed as assigned E-Fil Notaries to review this application, thank you

kevzak commented 2 years ago

Datacap Request Trigger

Total DataCap requested

5PiB

Expected weekly DataCap usage rate

200TiB

Client address

f1ecr5djz7v4lgae67fqrwtn6qkpz2zh2gfop5xci

large-datacap-requests[bot] commented 2 years ago

DataCap Allocation requested

Multisig Notary address

f01940930

Client address

f1ecr5djz7v4lgae67fqrwtn6qkpz2zh2gfop5xci

DataCap allocation requested

100TiB

Id

16c4b666-3207-4ced-b192-5bd1d8279770

large-datacap-requests[bot] commented 2 years ago

Hello @flyworker - @jamerduhgamer , please sign the datacap request

kevzak commented 2 years ago

It's been more than 2 business days. No review or comments from @NDLABS-OFFICE @davidthoms. So we've assigned two additional Notaries to review. See all details above. Let me know if you have any questions.

kevzak commented 1 year ago

Hello - I can confirm for the Fil+ team that @Xingchi-Media has completed the Client Registration and Business Verification Check successfully. 5 Notaries reviewed the business and have shown support for the this specific application https://github.com/filecoin-project/filecoin-plus-large-datasets/issues/9/10 that is private data.

Assigned notaries can proceed to review the application and the https://github.com/filecoin-project/notary-governance/issues/782

bmcnabb25 commented 1 year ago

Request Proposed

Your Datacap Allocation Request has been proposed by the Notary

Message sent to Filecoin Network

bafy2bzaceazhwhfms3dot6cje2bpk3ipstic33litgz7cep53kc4spdp2hzaq

Address

f1ecr5djz7v4lgae67fqrwtn6qkpz2zh2gfop5xci

Datacap Allocated

100.00TiB

Signer Address

f1jqk7xok5kautet2knhwlg74jvcfbrqlj47kbp2i

Id

16c4b666-3207-4ced-b192-5bd1d8279770

You can check the status of the message here: https://filfox.info/en/message/bafy2bzaceazhwhfms3dot6cje2bpk3ipstic33litgz7cep53kc4spdp2hzaq

bmcnabb25 commented 1 year ago

Client registration and KYB complete. Application and exceptions proposal checked out. Willing to support first tranche and monitor

herrehesse commented 1 year ago

Dear Applicant,

Due to the increased amount of erroneous/wrong Filecoin+ data recently, on behalf of the entire community, we feel compelled to go deeper into datacap requests. Hereby to ensure that the overall value of the Filecoin network and Filecoin+ program increases and is not abused.

Please answer the questions below as comprehensively as possible.

Customer data

We expect that for the onboarding of customers with the scale of an LDN there would have been at least multiple email and perhaps several chat conversations preceding it. A single email with an agreement does not qualify here.

Should this only be soley for acquiring datacap this is of course out of the question. The customer must have a legitimate reason for wanting to use the Filecoin+ program which is intended as a program to store useful and public datasets on the network.

(As an intermediate solution Filecoin offers the FIL-E program or the glif.io website for business datasets that do not meet the requirements for a Filecoin+ dataset)

Files and Processing

Hopefully you understand the caution the overall community has for onboarding the wrong data. We understand the increased need for Filecoin+, however, we must not allow the program to be misused. Everything depends on a valuable and useful network, let's do our best to make this happen. Together.

kevzak commented 1 year ago

@herrehesse for your info, this is an E-Fil+ application.

Xingchi-Media commented 1 year ago

Dear Notary Public About #1059 We will cooperate with the following SPs.

image

kevzak commented 1 year ago

Thank you @Xingchi-Media for providing a list of SPs that will be storing copies of the data.

Let's move forward with the updated SLA for E-Fil+ Notaries and assign randomly: @Fenbushi-Filecoin

Please review the application details for the Client, Data, and SPs and leave a comment. Please also note that this client passed a manual KYB check, where 5 notaries reviewed business documents and confirmed the company and applicant.

If you chose not to sign, please leave a comment why so I can assign another notary if needed.

Fenbushi-Filecoin commented 1 year ago

Hey, @kevzak @liyunzhi-666 @Fenbushi-Filecoin @Fatman13 @NDLABS-OFFICE @newwebgroup

The nodes held by SPs have been verified by transfer

imagehttps://filfox.info/en/address/f1vnfbc3siofrens2inl66gjzevqrto5wlsray6gy

Next step: 1.We will send the corresponding relationship of SPs to the governance team 2.We prepare the data and start cutting the Carfile 3.Offline data transmission 4.Contact the notary and sign for us 5.Start loading the dataset into Filecoin

Just want to confirm whether the corresponding relationship of SPs is sent to the governance team @kevzak.

kevzak commented 1 year ago

It says it was sent to @cryptowizzard and @raghavrmadya. https://github.com/filecoin-project/filecoin-plus-large-datasets/issues/1051#issuecomment-1410152562

@Fenbushi-Filecoin if you want to review more details, you can ask @Xingchi-Media . Thank you.

Xingchi-Media commented 1 year ago

I sent the correspondence of SP to RG, KZ and the relevant notary. If you need it, I'll send it to you on Slack. @Fenbushi-Filecoin

Xingchi-Media commented 1 year ago

Sent to you via Slack, please check @Fenbushi-Filecoin

Fenbushi-Filecoin commented 1 year ago

Sent to you via Slack, please check @Fenbushi-Filecoin

Received. Looked legit. Will expect you to store only on the listed SP.

Fenbushi-Filecoin commented 1 year ago

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzacebbtr2jjmm3rhshd2hvwalv2npvhvnnp4gwbmkkq7fauyyaco3mxq

Address

f1ecr5djz7v4lgae67fqrwtn6qkpz2zh2gfop5xci

Datacap Allocated

100.00TiB

Signer Address

f1yqydpmqb5en262jpottko2kd65msajax7fi4rmq

Id

16c4b666-3207-4ced-b192-5bd1d8279770

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacebbtr2jjmm3rhshd2hvwalv2npvhvnnp4gwbmkkq7fauyyaco3mxq

large-datacap-requests[bot] commented 1 year ago

Thanks for your request!

Heads up, you’re requesting more than the typical weekly onboarding rate of DataCap!
large-datacap-requests[bot] commented 1 year ago

DataCap Allocation requested

Request number 3

Multisig Notary address

f01940930

Client address

f1ecr5djz7v4lgae67fqrwtn6qkpz2zh2gfop5xci

DataCap allocation requested

400TiB

Id

14595b3c-8087-4e41-b586-a1d1d5f520d0

large-datacap-requests[bot] commented 1 year ago

Stats & Info for DataCap Allocation

Multisig Notary address

f01940930

Client address

f1ecr5djz7v4lgae67fqrwtn6qkpz2zh2gfop5xci

Last two approvers

Fenbushi-Filecoin & bmcnabb25

Rule to calculate the allocation request amount

200% of weekly dc amount requested

DataCap allocation requested

400TiB

Total DataCap granted for client so far

100TiB

Datacap to be granted to reach the total amount requested by the client (5PiB)

4.90PiB

Stats

Number of deals Number of storage providers Previous DC Allocated Top provider Remaining DC
null null 100TiB null 9.25TiB
filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report[^1]

Storage Provider Distribution

The below table shows the distribution of storage providers that have stored data for this client.

If this is the first time a provider takes verified deal, it will be marked as new.

For most of the datacap application, below restrictions should apply.

Since this is the 3rd allocation, the following restrictions have been relaxed:

✔️ Storage provider distribution looks healthy.

Provider Location Total Deals Sealed Percentage Unique Data Duplicate Deals
f01520487 Xiamen, Fujian, CN
China Mobile Communications Group Co., Ltd.
16.25 TiB 34.28% 16.25 TiB 0.00%
f01527777 Nanning, Guangxi, CN
China Telecom
16.25 TiB 34.28% 16.25 TiB 0.00%
f02023435 Hong Kong, Central and Western, HK
Diyixian.com Limited
14.91 TiB 31.44% 12.91 TiB 13.42%

Provider Distribution

Deal Data Replication

The below table shows how each many unique data are replicated across storage providers.

Since this is the 3rd allocation, the following restrictions have been relaxed:

⚠️ 88.33% of deals are for data replicated across less than 3 storage providers.

Unique Data Size Total Deals Made Number of Providers Deal Percentage
11.06 TiB 13.06 TiB 1 27.55%
14.41 TiB 28.81 TiB 2 60.78%
1.84 TiB 5.53 TiB 3 11.67%

Replication Distribution

Deal Data Shared with other Clients

The below table shows how many unique data are shared with other clients. Usually different applications owns different data and should not resolve to the same CID.

However, this could be possible if all below clients use same software to prepare for the exact same dataset or they belong to a series of LDN applications for the same dataset.

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

kevzak commented 1 year ago

Let's move forward with the https://github.com/filecoin-project/notary-governance/discussions/807 and assign randomly: @BlockMakeronline @MatrixStorage @MRJAVAZHAO @1475notary

Remember this is a second allocation. So please review the usage thus far via CID checker report above and the SP list provided https://github.com/filecoin-project/filecoin-plus-large-datasets/issues/1059#issuecomment-1410306583 and leave a comment or questions as needed. Please leave at least a comment by Feb 9th.

If you chose not to sign, please leave a comment why so I can assign another notary if needed.