filecoin-project / filecoin-plus-large-datasets

Hub for client applications for DataCap at a large scale
110 stars 62 forks source link

[DataCap Application]AOLIGEI #377

Closed aoligei-01 closed 1 year ago

aoligei-01 commented 2 years ago

Large Dataset Notary Application

To apply for DataCap to onboard your dataset to Filecoin, please fill out the following.

Core Information

Please respond to the questions below by replacing the text saying "Please answer here". Include as much detail as you can in your answer.

Project details

Share a brief history of your project and organization.

AOLIGEI Media Co., Ltd is a media platform focused on the development of artists. With accurate market positioning, we’ve won a certain market share out of many competitors. Engaged in the filed of the image promotion of internet influencers, short video planning, filming training consulting, etc. We are not only have a large number of media resources, but also create their own flow of IP with the professional operation teams, helping them achieve their own dreams!

What is the primary source of funding for this project?

Income from influencers and e-commerce products revenue

What other projects/ecosystem stakeholders is this project associated with?

Our company currently has more than 100 influencers under contract, and our current business has close ties with film and television companies, social media platforms, and e-commerce.
1、Cooperating with film and TV companies to shoot TV series and advertisement promotion works.
2、Cooperating with other media platforms to publish videos, pictures and advertisements of our contracted artists.
3、We also do e-commerce by selling a full range of products through live-stream media.

Use-case details

Describe the data being stored onto Filecoin

The data stored on Filecoin are our business videos, commercials, image and client’s video materials. 

Where was the data in this dataset sourced from?

From our own production

Can you share a sample of the data? A link to a file, an image, a table, etc., are good ways to do this.

Douying:
https://www.douyin.com/user/MS4wLjABAAAAAD-lSehi3CNwG1I8bTSfQUZ-Ak2j9Rxdulx6TZNFsAc
https://www.douyin.com/user/MS4wLjABAAAAkp1fflZfY49fdNe0G35qHPFMiIQllV9hQPzhSMQXbvk
https://www.douyin.com/user/MS4wLjABAAAAqHIH7OBGnA77xHvRJNYaNXRS5MAm-2lDxNIk3dJSHtU

Confirm that this is a public dataset that can be retrieved by anyone on the Network (i.e., no specific permissions or access rights are required to view the data).

Confirm that the data we will store is publicly available

What is the expected retrieval frequency for this data?

Just few times for check.

For how long do you plan to keep this dataset stored on Filecoin?

The data we store on filecoin is subject to the demand and business needs. It is expected that 20% of the data will be stored for 540 days and 80% of the data will be stored permanently.

DataCap allocation plan

In which geographies (countries, regions) do you plan on making storage deals?

Singapore、America

How will you be distributing your data to storage providers? Is there an offline data transfer process?

Both online and offline transfer.

How do you plan on choosing the storage providers with whom you will be making deals? This should include a plan to ensure the data is retrievable in the future both by you and others.

We choose those with real data storage experience and stable operation storage providers from the github repo.

How will you be distributing deals across storage providers?

Each storage provider will receive 15% of our dataCap, and max 2 copies per SP.

Do you have the resources/funding to start making deals as soon as you receive DataCap? What support from the community would help you onboard onto Filecoin?

Yes. We are ready to make deals. 
large-datacap-requests[bot] commented 2 years ago

Thanks for your request! Everything looks good. :ok_hand:

A Governance Team member will review the information provided and contact you back pretty soon.

Kakkouii commented 2 years ago

Can you post a video or photo on the Douyin account you've mentioned above in data sample to verify you're making videos for it?

Sunnyiscoming commented 2 years ago

I have sent a message to the above three exposed Douyin accounts by using the personal Douyin id to prove whether there is any business association with 浙 江 奥 利 给 文 化 传 媒 有 限 公 司. Douyin id: hanhanmami8310 Please check it.

aoligei-01 commented 2 years ago

@Sunnyiscoming Hello, our artist has sent a confirmation message to Douyin id: hanhanmami8310, please check. Artist Douyin id: dandanjieshuo; DY533813 Our artists have hidden works because of their personal preferences, so you can't see all the works WechatIMG5281 WechatIMG5282 WechatIMG5284

We confirm that the data we store on Filecoin is publicly available,Thank you!

Sunnyiscoming commented 2 years ago

image @aoligei-01 Can you explain about that?

aoligei-01 commented 2 years ago

@Sunnyiscoming This influencer is only cooperating with us in the online sales business. The data we want to store on the Filecoin network is just related to our business. NOT ALL OF the Influencer's VIDEO. BTW, it seems like you are also Chinese. You might know the meaning of "influencer" and how influencer works. What's the meaning of operation for those influencers' accounts. Also, It might be offensive to do this as a governance team assistant.

Sunnyiscoming commented 2 years ago

@aoligei-01 This is client due diligence, and hope you can understand it. If you are just a influencer, can you provide data storage authorization of this douyin channel?

aoligei-01 commented 2 years ago

@Sunnyiscoming Sorry, you may have misunderstood me. We are a media company, we have signed some artists and there will be a lot of video material that needs to be stored for backup. Secondly, we have business cooperation with external KOLs and there will be many business opportunities, so the video data used for online sales also need to be stored.Thank you!

Sunnyiscoming commented 2 years ago

@aoligei-01 Ok, got it. Hey. Could you send an email to filplus@fil.org with your official domain in order to confirm your identity?

aoligei-01 commented 2 years ago

Hello, @Sunnyiscoming we have sent an identity confirmation email to filplus@fil.org, please check. thank you!

galen-mcandrew commented 2 years ago

Datacap Request Trigger

Total DataCap requested

5PiB

Expected weekly DataCap usage rate

100TiB

Client address

f1unz5yuxgui573lpuh2wyxpsx5ahvw5farqb7hji

large-datacap-requests[bot] commented 2 years ago

DataCap Allocation requested

Multisig Notary address

f02049625

Client address

f1unz5yuxgui573lpuh2wyxpsx5ahvw5farqb7hji

DataCap allocation requested

50TiB

psh0691 commented 2 years ago

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzacedb7p7jythcmp4naa6scygmspl5rsey7qlpbjbh7rcjj7g4orv5cq

Address

f1unz5yuxgui573lpuh2wyxpsx5ahvw5farqb7hji

Datacap Allocated

50.00TiB

Signer Address

f1qdko4jg25vo35qmyvcrw4ak4fmuu3f5rif2kc7i

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacedb7p7jythcmp4naa6scygmspl5rsey7qlpbjbh7rcjj7g4orv5cq

BDE-io commented 2 years ago

@aoligei-01 Hi! Great to see you have gotten approval for DataCap. If you are looking for storage providers to store these data, please visit #bigdata-exchange on Filecoin Slack or reply here.

large-datacap-requests[bot] commented 1 year ago

DataCap Allocation requested

Request number 2

Multisig Notary address

f02049625

Client address

f1unz5yuxgui573lpuh2wyxpsx5ahvw5farqb7hji

DataCap allocation requested

100TiB

Id

8e040a72-d0e1-42a2-83aa-e805d0abf233

large-datacap-requests[bot] commented 1 year ago

Stats & Info for DataCap Allocation

Multisig Notary address

f01858410

Client address

f1unz5yuxgui573lpuh2wyxpsx5ahvw5farqb7hji

Last two approvers

psh0691 & not found

Rule to calculate the allocation request amount

100% of weekly dc amount requested

DataCap allocation requested

100TiB

Total DataCap granted for client so far

350TiB

Datacap to be granted to reach the total amount requested by the client (5 PiB)

4.65PiB

Stats

Number of deals Number of storage providers Previous DC Allocated Top provider Remaining DC
11201 4 50TiB 39.20 2.34TiB
filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report[^1]

If this is the first time a provider takes verified deal, it will be marked as new.

For most of the datacap application, below restrictions should apply.

Since this is the 2nd allocation, the following restrictions have been relaxed:

⚠️ 20.16% of total deal sealed by f01922865 are duplicate data.

⚠️ 73.15% of total deal sealed by f01916645 are duplicate data.

⚠️ f01916645 has unknown IP location.

Provider Location Total Deals Sealed Percentage Unique Data Duplicate Deals
f01922865 Ho Chi Minh City, Ho Chi Minh, VN
Zenlayer Inc
135.66 TiB 39.02% 108.31 TiB 20.16%
f01915287 Hanoi, Hanoi, VN
Zenlayer Inc
121.44 TiB 34.93% 99.97 TiB 17.68%
f01916645 Unknown
Unknown
49.81 TiB 14.33% 13.38 TiB 73.15%
f01922893 Hanoi, Hanoi, VN
Zenlayer Inc
40.75 TiB 11.72% 40.75 TiB 0.00%

Provider Distribution

Deal Data Replication

The below table shows how each many unique data are replicated across storage providers.

Since this is the 2nd allocation, the following restrictions have been relaxed:

✔️ Data replication looks healthy.

Unique Data Size Total Deals Made Number of Providers Deal Percentage
190.03 TiB 257.28 TiB 1 74.00%
36.19 TiB 90.38 TiB 2 26.00%

Replication Distribution

Deal Data Shared with other Clients

The below table shows how many unique data are shared with other clients. Usually different applications owns different data and should not resolve to the same CID.

However, this could be possible if all below clients use same software to prepare for the exact same dataset or they belong to a series of LDN applications for the same dataset.

⚠️ CID sharing has been observed.

Other Client Application Total Deals Affected Unique CIDs Verifier
f16ioghg3qy36f6572viouwv4dqow5ejpolo4kodi Shenzhen kuaixue Education Development Co., Ltd 2.74 PiB 4,890 LDN v3 multisig
f1t3buz7oqz4fktpthqe43vauhzlnuztpgm3iyhbi Shenzhen kuaixue Education Development Co., Ltd 2.59 PiB 6,870 LDN v3 multisig
f1tb2hrxk5eaeewcesnid6xmvfkklfdrxsjr5k6iy Yisainuo 540.25 TiB 5,541 LDN v3 multisig
f1mhyxd4unemmhrw4dbhjcovivayrj3tyactezmzq GOLDEN SECURITY 83.59 TiB 427 LDN v3 multisig
f3qebbkqspq4w6deouaubtngt4bmaada76uqs3omy
3tki6hoeocpgxyplknev5u3oi5e7xnltobrvgxnpa
3qga
codex8080 - Slingshot Restore 42.84 TiB 1,371 LDN v3 multisig
f3v7x4a2aapgx6o2r477tenoin3u5oadaeqyd7kjd
sitykvf4ok7vq2utcyh34lmu5u7oybs25ff6s4dbu
dpma
LeoCheung - Slingshot Restore 22.59 TiB 723 LDN v3 multisig
f126k3tkdwfaqpflgcclkiwhqxhh73ebqqazwgcoy New Web Group 13.38 TiB 426 LDN v3 multisig
f1x7wsqpj6waymzzfqmu4hh32tyc4pbbqnpwy2ucq Glif auto verified 32.00 GiB 1 Jonathan Schwartz

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report[^1]

If this is the first time a provider takes verified deal, it will be marked as new.

For most of the datacap application, below restrictions should apply.

Since this is the 2nd allocation, the following restrictions have been relaxed:

⚠️ 20.16% of total deal sealed by f01922865 are duplicate data.

⚠️ 73.15% of total deal sealed by f01916645 are duplicate data.

⚠️ f01916645 has unknown IP location.

Provider Location Total Deals Sealed Percentage Unique Data Duplicate Deals
f01922865 Ho Chi Minh City, Ho Chi Minh, VN
Zenlayer Inc
135.66 TiB 39.02% 108.31 TiB 20.16%
f01915287 Hanoi, Hanoi, VN
Zenlayer Inc
121.44 TiB 34.93% 99.97 TiB 17.68%
f01916645 Unknown
Unknown
49.81 TiB 14.33% 13.38 TiB 73.15%
f01922893 Hanoi, Hanoi, VN
Zenlayer Inc
40.75 TiB 11.72% 40.75 TiB 0.00%

Provider Distribution

Deal Data Replication

The below table shows how each many unique data are replicated across storage providers.

Since this is the 2nd allocation, the following restrictions have been relaxed:

✔️ Data replication looks healthy.

Unique Data Size Total Deals Made Number of Providers Deal Percentage
190.03 TiB 257.28 TiB 1 74.00%
36.19 TiB 90.38 TiB 2 26.00%

Replication Distribution

Deal Data Shared with other Clients

The below table shows how many unique data are shared with other clients. Usually different applications owns different data and should not resolve to the same CID.

However, this could be possible if all below clients use same software to prepare for the exact same dataset or they belong to a series of LDN applications for the same dataset.

⚠️ CID sharing has been observed.

Other Client Application Total Deals Affected Unique CIDs Verifier
f16ioghg3qy36f6572viouwv4dqow5ejpolo4kodi Shenzhen kuaixue Education Development Co., Ltd 2.74 PiB 4,890 LDN v3 multisig
f1t3buz7oqz4fktpthqe43vauhzlnuztpgm3iyhbi Shenzhen kuaixue Education Development Co., Ltd 2.59 PiB 6,870 LDN v3 multisig
f1tb2hrxk5eaeewcesnid6xmvfkklfdrxsjr5k6iy Yisainuo 540.25 TiB 5,541 LDN v3 multisig
f1mhyxd4unemmhrw4dbhjcovivayrj3tyactezmzq GOLDEN SECURITY 83.59 TiB 427 LDN v3 multisig
f3qebbkqspq4w6deouaubtngt4bmaada76uqs3omy
3tki6hoeocpgxyplknev5u3oi5e7xnltobrvgxnpa
3qga
codex8080 - Slingshot Restore 42.84 TiB 1,371 LDN v3 multisig
f3v7x4a2aapgx6o2r477tenoin3u5oadaeqyd7kjd
sitykvf4ok7vq2utcyh34lmu5u7oybs25ff6s4dbu
dpma
LeoCheung - Slingshot Restore 22.59 TiB 723 LDN v3 multisig
f126k3tkdwfaqpflgcclkiwhqxhh73ebqqazwgcoy New Web Group 13.38 TiB 426 LDN v3 multisig
f1x7wsqpj6waymzzfqmu4hh32tyc4pbbqnpwy2ucq Glif auto verified 32.00 GiB 1 Jonathan Schwartz

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

github-actions[bot] commented 1 year ago

This application has not seen any responses in the last 10 days. This issue will be marked with Stale label and will be closed in 4 days. Comment if you want to keep this application open.

github-actions[bot] commented 1 year ago

This application has not seen any responses in the last 14 days, so for now it is being closed. Please feel free to contact the Fil+ Gov team to re-open the application if it is still being processed. Thank you!

aggregation-and-compliance-bot[bot] commented 9 months ago
Client f01903807 does not follow the datacap usage rules. More info here. This application has been failing the requirements for 7 days. Please take appropiate action to fix the following DataCap usage problems. Criteria Treshold Reason
Shared data percent < 20% 34.49% of the clients data is shared with other clients. This should be less than 20%