filecoin-project / filecoin-plus-large-datasets

Hub for client applications for DataCap at a large scale
109 stars 62 forks source link

[DataCap Application] <Shenzhen Weite Network Culture Co., Ltd.> - <video archive> #1436

Closed hhhhxiao closed 11 months ago

hhhhxiao commented 1 year ago

Data Owner Name

Shenzhen Weite Network Culture Co., Ltd.

Data Owner Country/Region

China

Data Owner Industry

Information, Media & Telecommunications

Website

http://www.oktrust.com.cn/

Social Media

http://www.oktrust.com.cn/

Total amount of DataCap being requested

5PiB

Weekly allocation of DataCap requested

300TiB

On-chain address for first allocation

f16xlgzogkkd5qm44uftfprkvh44gu4ruo7dmngca

Custom multisig

Identifier

No response

Share a brief history of your project and organization

Shenzhen Weite Network Culture Co., Ltd. was established in 2017. We have been engaged in e-commerce, live broadcast, short video shooting, corporate promotional videos and other businesses.
We are a comprehensive advertising communication agency with creative images as the core and all-media "one-stop" communication as the channel. The integrated brand communication work we are engaged in covers creative film and television production, corporate brand building, new media interactive marketing, etc.
Video: As a professional commercial video production and dissemination agency, Witt Media always takes the maximization of brand communication as the basis of creation. It has industry-leading directors and production team resources. Foreign large and medium-sized enterprises have provided hundreds of excellent film and television works.
Brand: From communication strategy to advertisement creation and placement, Weite Media provides customers with one-stop full-service services. With accurate consumer insight, strategic thinking, creative expression and integrated communication capabilities, we will create more prominent business value and social influence for customers in the all-media era.
New Media: Weite Media focuses on the most advanced new media marketing model in the "Internet +" era, integrates interactive media, advertising creativity and innovative technology, and provides the most complete innovative marketing services in the industry with rigorous execution and a professional attitude . Connect customers, brands, products, users, and society, and create new brand experiences through communication in the interactive field.

Is this project associated with other projects/ecosystem stakeholders?

No

If answered yes, what are the other projects/ecosystem stakeholders

No response

Describe the data being stored onto Filecoin

We have a lot of live, e-commerce videos that need to be archived。

Where was the data currently stored in this dataset sourced from

My Own Storage Infra

If you answered "Other" in the previous question, enter the details here

No response

How do you plan to prepare the dataset

lotus

If you answered "other/custom tool" in the previous question, enter the details here

No response

Please share a sample of the data

https://drive.google.com/file/d/1XAqPnIuVMC8tisHbDmcZgVsnHltJfObl/view?usp=share_link
https://drive.google.com/file/d/15A9rGG0mUr3DTUD8TWg7nVuZErdRCGcv/view?usp=share_link
https://drive.google.com/file/d/1AW4BXp_uc0ZdhbKXYxzE_F3acesZaA_4/view?usp=share_link

Confirm that this is a public dataset that can be retrieved by anyone on the Network

If you chose not to confirm, what was the reason

No response

What is the expected retrieval frequency for this data

Yearly

For how long do you plan to keep this dataset stored on Filecoin

More than 3 years

In which geographies do you plan on making storage deals

Greater China

How will you be distributing your data to storage providers

Shipping hard drives

How do you plan to choose storage providers

Slack

If you answered "Others" in the previous question, what is the tool or platform you plan to use

No response

If you already have a list of storage providers to work with, fill out their names and provider IDs below

No response

How do you plan to make deals to your storage providers

Boost client, Lotus client

If you answered "Others/custom tool" in the previous question, enter the details here

No response

Can you confirm that you will follow the Fil+ guideline

Yes

large-datacap-requests[bot] commented 1 year ago

Thanks for your request!

Heads up, you’re requesting more than the typical weekly onboarding rate of DataCap!
large-datacap-requests[bot] commented 1 year ago

Thanks for your request! Everything looks good. :ok_hand:

A Governance Team member will review the information provided and contact you back pretty soon.

Sunnyiscoming commented 1 year ago

Data samples you provided is from https://space.bilibili.com/515848288?spm_id_from=333.337.0.0. What's the relationship between the bilibili channel and the organization? Can you provide relative materials? The website is recorded by 成都西维数码科技有限公司. Can you explain about that? Can you explain your data composition and provide sufficient data samples separately? How many copies will you store? What's the relationship between you and the organization? Can you provide more detailed information about other storage providers participated in this program, such as you can list SPs you have contacted with at present? Whether the sps you choose can support data retrieval?

hhhhxiao commented 1 year ago

hi,@Sunnyiscoming Thank you for your question, the following is our answer

Data samples you provided is from https://space.bilibili.com/515848288?spm_id_from=333.337.0.0. What's the relationship between the bilibili channel and the organization?

We have signed many anchors and talents, and we will upload their live broadcast process to some media platforms, such as bilibili, tiktok, etc., to increase their exposure.

Can you provide relative materials? The website is recorded by 成都西维数码科技有限公司. Can you explain about that?

成都西维数码科技有限公司 is not a domain name filing company, but a domain name registration agency similar to Alibaba Cloud and godaddy.

Can you explain your data composition and provide sufficient data samples separately? How many copies will you store?

The main components of the data are the live broadcast videos and videos taken by the company's anchors and talents, and we uploaded an additional data sample https://drive.google.com/file/d/1suQ0so60JweJ9QrqZyxM5Fc6jJDHVCXd/view?usp=share_link Plan to store 3-5 copies, maybe more.

What's the relationship between you and the organization?

Partner, we assist these companies with large amounts of data in long-term storage on the chain.

Can you provide more detailed information about other storage providers participated in this program, such as you can list SPs you have contacted with at present?

Currently there is no contact SP, we plan to find a suitable SP through slack

Whether the sps you choose can support data retrieval?

yes

Sunnyiscoming commented 1 year ago

Could you send an email to filplus-app-review@fil.org with your official domain in order to confirm your identity? Email name should includes the issue id #1436.

hhhhxiao commented 1 year ago

Could you send an email to filplus-app-review@fil.org with your official domain in order to confirm your identity? Email name should includes the issue id #1436.

hi,@Sunnyiscoming ,We have sent an email to filplus-app-review@fil.org as requested 微信截图_20221221152408

simonkim0515 commented 1 year ago

Datacap Request Trigger

Total DataCap requested

5PiB

Expected weekly DataCap usage rate

300TiB

Client address

f16xlgzogkkd5qm44uftfprkvh44gu4ruo7dmngca

large-datacap-requests[bot] commented 1 year ago

DataCap Allocation requested

Multisig Notary address

f01858410

Client address

f16xlgzogkkd5qm44uftfprkvh44gu4ruo7dmngca

DataCap allocation requested

150TiB

Id

bedc95da-280c-4249-8b08-41d47191ac7e

filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report[^1]

There is no previous allocation for this issue.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

cryptowhizzard commented 1 year ago

Dear applicant,

Thank you for applying for datacap. As Filecoin FIL+ notary i am screening your application and conducting due diligence.

Looking at your application i have some questions: As you are brand new on Github and have no history of past applications it seems to me that applying for 5PB of datacap is a lot. One needs comprehensive knowledge of Filecoin, packing of data, distribution of data and all it's requirements coming with it. Are you brand new in the Filecoin space or have you applied for datacap in the past on different Github account names?

Can you show us visible proof of the size of your data and the storage systems you have there?

As last question i would like you to fill out this form to provide us with the necessary information to make a educated decision on your LDN request if we would like to support it.

Thanks!

hhhhxiao commented 1 year ago

Hi,@cryptowhizzard, thank you for your question, we have not applied for datacap before, this is the information of our storage system. We have several other groups of similar servers, and I can provide more if needed. 微信图片编辑_20230215144623

cryptowhizzard commented 1 year ago

Thanks

Will you fill out the form?

hhhhxiao commented 1 year ago

hi, @cryptowhizzard, I have filled out the form and submitted successfully

cryptowhizzard commented 1 year ago

Hi @hhhhxiao

Thanks for the help and the form. Well received.

I take it that you are making preparations and are building the dataset. Good to hear!

Let me know when you have SP's where you want to store the data. I will screen them for you to make sure they have not been involved in FIL+ abuse. You can mail their info to kyc@dcent.nl

Thanks!

NDLABS-Leo commented 1 year ago

Please explain your current data details, overall data volume, weekly storage volume, etc. and provide relevant evidence

hhhhxiao commented 1 year ago

Please explain your current data details, overall data volume, weekly storage volume, etc. and provide relevant evidence

Hi @NDLABS-OFFICE , thank you for your question. The following information is relevant. Data details: live & e-commerce videos Data volume: 1100Tib Weekly storage volume: 300Tib

微信图片_20230310104934
NDLABS-Leo commented 1 year ago

@hhhhxiao Ok, I'll do the first round of support to help you get started with storage

NDLABS-Leo commented 1 year ago

Request Proposed

Your Datacap Allocation Request has been proposed by the Notary

Message sent to Filecoin Network

bafy2bzacedhq5bfhln3vmkjx7rv2s6p5g5c7qteynbdco52cozrfwquvpdwto

Address

f16xlgzogkkd5qm44uftfprkvh44gu4ruo7dmngca

Datacap Allocated

150.00TiB

Signer Address

f1yayfsv6whu3rheviucvventj3y6t542xfpb47ei

Id

bedc95da-280c-4249-8b08-41d47191ac7e

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacedhq5bfhln3vmkjx7rv2s6p5g5c7qteynbdco52cozrfwquvpdwto

newwebgroup commented 1 year ago

Agree with ND and have checked GIthub's history and would be willing to make an endorsement in the first round. I will keep an eye on the follow-up to check if SP supports the retrieval and to see if the content matches the description.

newwebgroup commented 1 year ago

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzacecmjusbi6oxaowvh2seoknawv3oxbtwte3m2jt6q6xfljppv2yiqu

Address

f16xlgzogkkd5qm44uftfprkvh44gu4ruo7dmngca

Datacap Allocated

150.00TiB

Signer Address

f1e77zuityhvvw6u2t6tb5qlnsegy2s67qs4lbbbq

Id

bedc95da-280c-4249-8b08-41d47191ac7e

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacecmjusbi6oxaowvh2seoknawv3oxbtwte3m2jt6q6xfljppv2yiqu

large-datacap-requests[bot] commented 1 year ago

DataCap Allocation requested

Request number 2

Multisig Notary address

f02049625

Client address

f16xlgzogkkd5qm44uftfprkvh44gu4ruo7dmngca

DataCap allocation requested

300TiB

Id

ad932ce6-319c-4fe8-b2f1-7e62e861aa83

large-datacap-requests[bot] commented 1 year ago

Stats & Info for DataCap Allocation

Multisig Notary address

f01858410

Client address

f16xlgzogkkd5qm44uftfprkvh44gu4ruo7dmngca

Rule to calculate the allocation request amount

100% of weekly dc amount requested

DataCap allocation requested

300TiB

Total DataCap granted for client so far

150TiB

Datacap to be granted to reach the total amount requested by the client (5PiB)

4.85PiB

Stats

Number of deals Number of storage providers Previous DC Allocated Top provider Remaining DC
2373 2 150TiB 94.61 43.90TiB
filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report Summary[^1]

Storage Provider Distribution

⚠️ 1 storage providers sealed more than 70% of total datacap - f0130556: 90.02%

Deal Data Replication

⚠️ 100.00% of deals are for data replicated across less than 3 storage providers.

Deal Data Shared with other Clients[^3]

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the full report.

filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report Summary[^1]

Storage Provider Distribution

⚠️ 1 storage providers sealed more than 70% of total datacap - f0130556: 90.02%

Deal Data Replication

⚠️ 100.00% of deals are for data replicated across less than 3 storage providers.

Deal Data Shared with other Clients[^3]

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the full report.

hhhhxiao commented 1 year ago

It should be that the network has not been updated up yet and has been docked to more than 3 SPs to store the data. Colleagues mishandled the first round of complete processing will be more than 36% of the first round of processing, we will keep a close eye on the proportion of the follow-up to meet the requirements.

METAVERSEDATAMINING commented 1 year ago

Can you share your storage plan for the next stage? Including details such as how many copies you plan to store and information about the SPs involved.

hhhhxiao commented 1 year ago
The overall plan stores 5 copies. Here is the list of SPs we have worked with, and the new SPs will be synchronized as soon as possible. SPID Region City
f0130556 CN Zhanjiang
f02040772 UK London
f02051716 US Herndon
hhhhxiao commented 1 year ago

checker:manualTrigger

filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report Summary[^1]

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

Deal Data Replication

✔️ Data replication looks healthy.

Deal Data Shared with other Clients[^3]

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the full report.

mikezli commented 1 year ago

checker:manualTrigger

filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report Summary[^1]

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

Deal Data Replication

✔️ Data replication looks healthy.

Deal Data Shared with other Clients[^3]

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the full report.

mikezli commented 1 year ago

Good check, willing to support this round,

mikezli commented 1 year ago

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzaceabptuvyfqxxjxr4tyfr26dumazyfuwvvcsgrteeqlp2nw7e7x542

Address

f16xlgzogkkd5qm44uftfprkvh44gu4ruo7dmngca

Datacap Allocated

300.00TiB

Signer Address

f1dnb3uz7sylxk6emti3ififcvu3nlufnnsjui6ea

Id

You can check the status of the message here: https://filfox.info/en/message/bafy2bzaceabptuvyfqxxjxr4tyfr26dumazyfuwvvcsgrteeqlp2nw7e7x542

large-datacap-requests[bot] commented 1 year ago

DataCap Allocation requested

Request number 3

Multisig Notary address

f02049625

Client address

f16xlgzogkkd5qm44uftfprkvh44gu4ruo7dmngca

DataCap allocation requested

600TiB

Id

23bc3db5-cb57-4fe3-ae29-289790b36dde

large-datacap-requests[bot] commented 1 year ago

Stats & Info for DataCap Allocation

Multisig Notary address

f02049625

Client address

f16xlgzogkkd5qm44uftfprkvh44gu4ruo7dmngca

Rule to calculate the allocation request amount

200% of weekly dc amount requested

DataCap allocation requested

600TiB

Total DataCap granted for client so far

272848.4YiB

Datacap to be granted to reach the total amount requested by the client (5PiB)

-3.29B

Stats

Number of deals Number of storage providers Previous DC Allocated Top provider Remaining DC
null null 300TiB null 75.84TiB
sxxfuture-official commented 1 year ago

checker:manualTrigger

filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report Summary[^1]

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

Deal Data Replication

⚠️ 100.00% of deals are for data replicated across less than 4 storage providers.

Deal Data Shared with other Clients[^3]

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the full report.

sxxfuture-official commented 1 year ago

@hhhhxiao Most of the results displayed by the CID Checker Report are satisfactory, except that the number of replicas of the data is insufficient, and more SPs need to be added to store copies of the data. But considering that the project has just gone through the second round, I hope this problem can be solved in the future.

The results of the retrieval test are as follows, deal_id from https://datacapstats.io/clients/f02060232?page=567

image

sxxfuture-official commented 1 year ago

Request Proposed

Your Datacap Allocation Request has been proposed by the Notary

Message sent to Filecoin Network

bafy2bzacedlkj7zbvmntgsgbwnc7taw6suozzhyl57wg7yrmukck36ubm3cam

Address

f16xlgzogkkd5qm44uftfprkvh44gu4ruo7dmngca

Datacap Allocated

600.00TiB

Signer Address

f1foiomqlmoshpuxm6aie4xysffqezkjnokgwcecq

Id

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacedlkj7zbvmntgsgbwnc7taw6suozzhyl57wg7yrmukck36ubm3cam

woshidama323 commented 1 year ago

checker:manualTrigger

filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report Summary[^1]

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

Deal Data Replication

⚠️ 100.00% of deals are for data replicated across less than 4 storage providers.

Deal Data Shared with other Clients[^3]

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the full report.

woshidama323 commented 1 year ago

image

Hope you will fix this next round

woshidama323 commented 1 year ago

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzacebzgqvryntn4r6k2ufyf26a7vrq6cr6jhxbaxovq5toautqep357y

Address

f16xlgzogkkd5qm44uftfprkvh44gu4ruo7dmngca

Datacap Allocated

600.00TiB

Signer Address

f12tk3adljauwnd3hjbigpfxb7b7gdlj63p6afwtq

Id

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacebzgqvryntn4r6k2ufyf26a7vrq6cr6jhxbaxovq5toautqep357y

large-datacap-requests[bot] commented 1 year ago

DataCap Allocation requested

Request number 4

Multisig Notary address

f02049625

Client address

f16xlgzogkkd5qm44uftfprkvh44gu4ruo7dmngca

DataCap allocation requested

1.17PiB

Id

1b3ba6af-e26c-48c5-8674-2cfe0bcd6d38

large-datacap-requests[bot] commented 1 year ago

Stats & Info for DataCap Allocation

Multisig Notary address

f02049625

Client address

f16xlgzogkkd5qm44uftfprkvh44gu4ruo7dmngca

Rule to calculate the allocation request amount

400% of weekly dc amount requested

DataCap allocation requested

1.17PiB

Total DataCap granted for client so far

545696821063757201408.0YiB

Datacap to be granted to reach the total amount requested by the client (5PiB)

-6.59B

Stats

Number of deals Number of storage providers Previous DC Allocated Top provider Remaining DC
null null 600TiB null 149.15TiB
hhhhxiao commented 1 year ago

checker:manualTrigger

filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report Summary[^1]

Storage Provider Distribution

⚠️ 1 storage providers sealed more than 30% of total datacap - f02121645: 43.40%

Deal Data Replication

⚠️ 100.00% of deals are for data replicated across less than 4 storage providers.

Deal Data Shared with other Clients[^3]

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the full report.

cryptowhizzard commented 1 year ago

Good morning,

It seems your SP's do not support retrieval of data. This is against FIL+ rules and guidelines.

./f02060232.sh Error: retrieval query for miner f02121645 failed: failed to dial 12D3KooWCK6NUYAEYrdpfH1ENc7XiFKX7qQ1RYphgLdWgYsa7Njs:

zcfil commented 1 year ago

checker:manualTrigger

filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report Summary[^1]

Storage Provider Distribution

⚠️ 1 storage providers sealed more than 30% of total datacap - f02121645: 43.11%

Deal Data Replication

⚠️ 100.00% of deals are for data replicated across less than 4 storage providers.

Deal Data Shared with other Clients[^3]

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the full report.