filecoin-project / filecoin-plus-large-datasets

Hub for client applications for DataCap at a large scale
110 stars 62 forks source link

[DataCap Application] Bole Digital #654

Closed BoleDigital closed 1 year ago

BoleDigital commented 2 years ago

Large Dataset Notary Application

To apply for DataCap to onboard your dataset to Filecoin, please fill out the following.

Core Information

Please respond to the questions below by replacing the text saying "Please answer here". Include as much detail as you can in your answer.

Project details

Share a brief history of your project and organization.

We are a team with a background in industrial. Our team is a mixed team of "light industrial design", "planning", "video" and "marketing". We are enthusiastic and puts 100% passion and creativity into every project to make your brand or product meaningful.

What is the primary source of funding for this project?

Own income.

What other projects/ecosystem stakeholders is this project associated with?

No.

Use-case details

Describe the data being stored onto Filecoin

Promotion videos.

Where was the data in this dataset sourced from?

We provide services such as video filming and production.

Can you share a sample of the data? A link to a file, an image, a table, etc., are good ways to do this. 

[14.zip](https://github.com/filecoin-project/filecoin-plus-large-datasets/files/9338809/14.zip)

         Confirm that this is a public dataset that can be retrieved by anyone on the Network (i.e., no specific permissions or access rights are required to view the data).

Yes.

What is the expected retrieval frequency for this data?

Once a year. We mainly store these data as copies.

For how long do you plan to keep this dataset stored on Filecoin?

2 years

DataCap allocation plan

In which geographies (countries, regions) do you plan on making storage deals?

Asia.

How will you be distributing your data to storage providers? Is there an offline data transfer process?

Offline data transfer is our first choice. 

How do you plan on choosing the storage providers with whom you will be making deals? This should include a plan to ensure the data is retrievable in the future both by you and others.

We'll use slack and github to find out suitable SP and they should approve retrieval.

How will you be distributing deals across storage providers?

Every Sp will be given under 25% whole deals.

Do you have the resources/funding to start making deals as soon as you receive DataCap? What support from the community would help you onboard onto Filecoin?

Yes.
large-datacap-requests[bot] commented 2 years ago

Thanks for your request!

Heads up, you’re requesting more than the typical weekly onboarding rate of DataCap!
large-datacap-requests[bot] commented 2 years ago

Thanks for your request! Everything looks good. :ok_hand:

A Governance Team member will review the information provided and contact you back pretty soon.

large-datacap-requests[bot] commented 2 years ago

Thanks for your request! Everything looks good. :ok_hand:

A Governance Team member will review the information provided and contact you back pretty soon.

Sunnyiscoming commented 2 years ago

Can you provide more information about organization? What's the relationship between you and organization? Can you provide more detailed information about other storage providers participated in this program, such as you can list SPs you have contacted with at present? Could you send an email to filplus-app-review@fil.org with your official domain in order to confirm your identity? Email name should includes the issue id #654.

BoleDigital commented 2 years ago

We are a team with a background in industrial. Our team is located in Taichung, Taiwan. I am an employee of Bole Digital, and my boss asked me to apply for datacap. We've contacted f0504054, which can cooperate with us in the next process. The ideal region for us to send deals to is China, Japan. Our team will attempt to find more storage providers from Asian countries. 5email

Sunnyiscoming commented 2 years ago

Received email.

raghavrmadya commented 2 years ago

Hi, we need more data samples. 5 PiBs of promotional videos is quite hard to believe and we would like to know about the scale of your business operations. Please provide at least 10 samples as supporting evidence for your requested amount of DC

BoleDigital commented 2 years ago

@raghavrmadya https://drive.google.com/file/d/1kVEXigTpda4Zs-0aVljo20pXea0c9G-I/view?usp=sharing hi, I've uploaded my dataset. Many of our designs are 4k HD videos with a large amount of data, I have only selected a small amount of sample to show.

raghavrmadya commented 2 years ago

Still, 5 PiBs of promotional videos is an insane amount of data? How much data do you have and how many copies are you storing?

BoleDigital commented 2 years ago

@raghavrmadya Hi Raghav, actually the size of 4k HD videos is usually large, and the dataset I showed is just a little part of whole data. Now we have 531.2TiB data which need to be stored as 10 copies. As it is closely related to our business, we want to keep this data in a good way by filecoin.

raghavrmadya commented 1 year ago

Datacap Request Trigger

Total DataCap requested

5PiB

Expected weekly DataCap usage rate

100TiB

Client address

f1fmdka4yaspkjp3kw65azeygbmgp5dujl6wlghti

large-datacap-requests[bot] commented 1 year ago

DataCap Allocation requested

Multisig Notary address

f02049625

Client address

f1fmdka4yaspkjp3kw65azeygbmgp5dujl6wlghti

DataCap allocation requested

50TiB

1475Notary commented 1 year ago

Request Proposed

Your Datacap Allocation Request has been proposed by the Notary

Message sent to Filecoin Network

bafy2bzacecr222t32kp426c5ej32x52fqvmsmg2732wnkfst3zurrjv7jema4

Address

f1fmdka4yaspkjp3kw65azeygbmgp5dujl6wlghti

Datacap Allocated

50.00TiB

Signer Address

f1ofq4mngy7ggcp755pfquq2gphjjnlydolf6awtq

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacecr222t32kp426c5ej32x52fqvmsmg2732wnkfst3zurrjv7jema4

liyunzhi-666 commented 1 year ago

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzaceagzsctkgpkyzlfzzsywdrf6ixb4cdfv6bi3me6yvvdsoliwejnjy

Address

f1fmdka4yaspkjp3kw65azeygbmgp5dujl6wlghti

Datacap Allocated

50.00TiB

Signer Address

f1pszcrsciyixyuxxukkvtazcokexbn54amf7gvoq

You can check the status of the message here: https://filfox.info/en/message/bafy2bzaceagzsctkgpkyzlfzzsywdrf6ixb4cdfv6bi3me6yvvdsoliwejnjy

BDE-io commented 1 year ago

@BoleDigital Hi! Great to see you have gotten approval for DataCap and advancing the mission of preserving humanity’s most important information. If you are looking for storage providers to store these data or have any questions, please visit #bigdata-exchange on Filecoin Slack or reply here.

We have strong demand from a diverse group of SPs, who are actively looking to onboard more data.

large-datacap-requests[bot] commented 1 year ago

DataCap Allocation requested

Request number 2

Multisig Notary address

f02049625

Client address

f1fmdka4yaspkjp3kw65azeygbmgp5dujl6wlghti

DataCap allocation requested

100TiB

Id

84b88189-a2f2-4021-8833-7db62a9bda35

large-datacap-requests[bot] commented 1 year ago

Stats & Info for DataCap Allocation

Multisig Notary address

f01858410

Client address

f1fmdka4yaspkjp3kw65azeygbmgp5dujl6wlghti

Last two approvers

liyunzhi-666 & 1475Notary

Rule to calculate the allocation request amount

100% of weekly dc amount requested

DataCap allocation requested

100TiB

Total DataCap granted for client so far

50TiB

Datacap to be granted to reach the total amount requested by the client (5PiB)

4.95PiB

Stats

Number of deals Number of storage providers Previous DC Allocated Top provider Remaining DC
0 0 50TiB 0 1.12TiB
UnionLabs2020 commented 1 year ago

Request Proposed

Your Datacap Allocation Request has been proposed by the Notary

Message sent to Filecoin Network

bafy2bzacecodmniz6vhjtrmysci3s4z3xcarrbtf3jbokycy6jahvoomlw3as

Address

f1fmdka4yaspkjp3kw65azeygbmgp5dujl6wlghti

Datacap Allocated

100.00TiB

Signer Address

f17xdri3wunqgld7dm23e4f3eqsntjakwc47xjo6i

Id

84b88189-a2f2-4021-8833-7db62a9bda35

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacecodmniz6vhjtrmysci3s4z3xcarrbtf3jbokycy6jahvoomlw3as

MetaWaveInfo commented 1 year ago

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzacecak7tkcjmbkz4s4awxgegeeguiw6mkw7dlscfb5ipigvf5c25e5s

Address

f1fmdka4yaspkjp3kw65azeygbmgp5dujl6wlghti

Datacap Allocated

100.00TiB

Signer Address

f1ktlkcxnmzxcdaoqfsunrg3vocfbmgv4n3mrn74a

Id

84b88189-a2f2-4021-8833-7db62a9bda35

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacecak7tkcjmbkz4s4awxgegeeguiw6mkw7dlscfb5ipigvf5c25e5s

large-datacap-requests[bot] commented 1 year ago

DataCap Allocation requested

Request number 3

Multisig Notary address

f02049625

Client address

f1fmdka4yaspkjp3kw65azeygbmgp5dujl6wlghti

DataCap allocation requested

200TiB

Id

72460c68-5299-402e-be3a-b6c6388402a5

large-datacap-requests[bot] commented 1 year ago

Stats & Info for DataCap Allocation

Multisig Notary address

f01858410

Client address

f1fmdka4yaspkjp3kw65azeygbmgp5dujl6wlghti

Last two approvers

MetaWaveInfo & UnionLabs2020

Rule to calculate the allocation request amount

200% of weekly dc amount requested

DataCap allocation requested

200TiB

Total DataCap granted for client so far

150TiB

Datacap to be granted to reach the total amount requested by the client (5PiB)

4.85PiB

Stats

Number of deals Number of storage providers Previous DC Allocated Top provider Remaining DC
38 1 100TiB 100.00 25TiB
filplus-checker commented 1 year ago

DataCap and CID Checker Report[^1]

If this is the first time a provider takes verified deal, it will be marked as new.

For most of the datacap application, below restrictions should apply.

⚠️ f01974746 has sealed 35.90% of total datacap.

⚠️ 89.75% of total deal sealed by f01974746 are duplicate data.

⚠️ f01974746 has unknown IP location.

⚠️ 89.90% of total deal sealed by f01962589 are duplicate data.

⚠️ f01962589 has unknown IP location.

⚠️ 79.78% of total deal sealed by f01926802 are duplicate data.

⚠️ 77.02% of total deal sealed by f01926914 are duplicate data.

⚠️ 78.91% of total deal sealed by f01926585 are duplicate data.

Provider Location Total Deals Sealed Percentage Unique Data Duplicate Deals
f01974746 Unknown 44.81 TiB 35.90% 4.59 TiB 89.75%
f01962589 Unknown 30.00 TiB 24.04% 3.03 TiB 89.90%
f01926802 Ashburn, Virginia, US 17.00 TiB 13.62% 3.44 TiB 79.78%
f01926914 Ashburn, Virginia, US 17.00 TiB 13.62% 3.91 TiB 77.02%
f01926585 Ashburn, Virginia, US 16.00 TiB 12.82% 3.38 TiB 78.91%

Provider Distribution

Deal Data Replication

The below table shows how each many unique data are replicated across storage providers.

⚠️ 100.00% of deals are for data replicated across less than 4 storage providers.

Unique Data Size Total Deals Made Number of Providers Deal Percentage
13.72 TiB 106.56 TiB 1 85.38%
2.31 TiB 18.25 TiB 2 14.62%

Replication Distribution

Deal Data Shared with other Clients

The below table shows how many unique data are shared with other clients. Usually different applications owns different data and should not resolve to the same CID.

⚠️ CID sharing has been observed.

Other Client Application Total Deals Affected Unique CIDs Verifier
f1zav4f4rak2yvbrq357z3whj224wn4dipl7eiesy Shanghai Zhixun 17.56 TiB 113 LDN v3 multisig
f1l2hkjqfh5abpd5x6lxj4tjrphaa44pnumt5ms5i Fly Brain Anatomy - Slingshot 16.50 TiB 127 LDN # 153
f1xyx3tvfoy7ue62oqdtcpud5ewcw2i7ctu4kyylq zhenshuai 608.00 GiB 4 LDN # 332
f1jsgrkuicjbw5g3u7f4tcs4b6zdctwxq4h6rw5na Neusoft 352.00 GiB 3 LDN v3 multisig
f1quofpadzcqlonmz7v3mv7dfqvm5hdztucdsjqsy HENAN 863 SOFTWARE CO., LTD 224.00 GiB 1 LDN v3 multisig

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

github-actions[bot] commented 1 year ago

This application has not seen any responses in the last 10 days. This issue will be marked with Stale label and will be closed in 4 days. Comment if you want to keep this application open.

github-actions[bot] commented 1 year ago

This application has not seen any responses in the last 14 days, so for now it is being closed. Please feel free to contact the Fil+ Gov team to re-open the application if it is still being processed. Thank you!

aggregation-and-compliance-bot[bot] commented 10 months ago
Client f01905508 does not follow the datacap usage rules. More info here. This application has been failing the requirements for 7 days. Please take appropiate action to fix the following DataCap usage problems. Criteria Treshold Reason
Cid Checker score > 25% The client has a CID checker score of 0%. This should be greater than 25%. To find out more about CID checker score please look at this issue: https://github.com/filecoin-project/notary-governance/issues/986
Shared data percent < 20% 22.46% of the clients data is shared with other clients. This should be less than 20%