filecoin-project / filecoin-plus-large-datasets

Hub for client applications for DataCap at a large scale
109 stars 62 forks source link

[DataCap Application] <The data from MCN company> #1504

Closed dpx-office closed 9 months ago

dpx-office commented 1 year ago

Data Owner Name

Sichuan Horizon Culture Communication Co. LTD

Data Owner Country/Region

China

Data Owner Industry

Information, Media & Telecommunications

Website

https://www.cddpx.com/

Social Media

https://www.cddpx.com/

Total amount of DataCap being requested

5PiB

Weekly allocation of DataCap requested

200TiB

On-chain address for first allocation

f1fbpvedozoqtijqf4fwkifjhhlg3i2c6twyct2xi

Custom multisig

Identifier

None

Share a brief history of your project and organization

The Sichuan Horizon Culture Communication Co., Ltd. is a first-class MCN agency in China, founded in 2019 and headquartered in Chengdu.
We are an all-round multi-channel cashing new online culture and entertainment ecology company with IP incubation, brand promotion, video media agency operation and other main businesses, with short video as the core. We also provide a series of offline full media training services such as media operation, video creation, video shooting and editing, network operation, and also provide the lessons of advertising optimizer and SEO optimizer.etc.
The company has contracted experts in multiple categories such as games,appearance, animation and agriculture (agriculture, rural areas, farmers), with a total of 500 million+ video views and over 10 million fans.

Since 2021, the company has been responding to the "Rural Revitalisation" plan, incubating a number of new farmers from the perspective of agricultural products, rural culture and rural life, training young people in rural areas on new media production methods and special techniques to attract fans, helping them to introduce the beauty of their hometowns, recommend local cuisine and tell the history and stories of their surroundings.
By exporting diversified quality rural content, the company promotes the transformation of "local memories" into "local economy" and becomes an excellent platform for the incubation of "three agriculture" weblebrities.

The company has accumulated a large number of creators' works in our media operation business for many years, mainly short videos, which currently storage in Ali cloud and Tencent cloud platforms. We were looking for a more efficient, lower-cost storage method with long-term stability, and we were approached by Filecoin SPs who wanted to deposit the data into the Filecoin network and introduced us to decentralized storage field. On the one hand, with the help of the miners, there is no additional cost for us to store this data; On the other hand, we understand that the Filecoin network is moving forward with the Saturn Project to implement CDN capabilities and we would like to participate in the Filecoin CDN network if this is possible.

Is this project associated with other projects/ecosystem stakeholders?

No

If answered yes, what are the other projects/ecosystem stakeholders

No response

Describe the data being stored onto Filecoin

Most of data we plan to storage is from the original video works of the creators we cooperate with. Video themes are planned and filmed by the company's professional team, then released by the operations team who will continue the subsequent operations.The business partnership generates a large amount of video data and we are keen to find a decentralized storage platform like Filecoin for long-term cooperation.
The volume of data provided is over 1p and we are planning to be stored as 5 copies.

Where was the data currently stored in this dataset sourced from

Other

If you answered "Other" in the previous question, enter the details here

Aliyun\Tencent Cloud

How do you plan to prepare the dataset

others/custom tool

If you answered "other/custom tool" in the previous question, enter the details here

Self-developed packing tool

Please share a sample of the data

agriculture, the countryside and farmers
https://1drv.ms/u/s!Asy39Y6I4odCapzqHP3iQjvufgs?e=eHK7I4
Society
https://1drv.ms/u/s!Asy39Y6I4odCa9I89fRoYUnBpAg?e=Xa8Hpk
Gastronomy
https://1drv.ms/u/s!Asy39Y6I4odCbNJwthtRRAlzfDo?e=VRhFVq
Comic
https://1drv.ms/u/s!Asy39Y6I4odCabhQ7Aa2BNFrghk?e=5TAkkN
Game
https://1drv.ms/u/s!Asy39Y6I4odCaGV2oseNaLZ2zKI?e=UW5JLg

Confirm that this is a public dataset that can be retrieved by anyone on the Network

If you chose not to confirm, what was the reason

No response

What is the expected retrieval frequency for this data

Sporadic

For how long do you plan to keep this dataset stored on Filecoin

Permanently

In which geographies do you plan on making storage deals

Greater China, Asia other than Greater China, North America, Europe, Australia (continent), Antarctica

How will you be distributing your data to storage providers

HTTP or FTP server, Shipping hard drives

How do you plan to choose storage providers

Slack

If you answered "Others" in the previous question, what is the tool or platform you plan to use

No response

If you already have a list of storage providers to work with, fill out their names and provider IDs below

f02003553、f02003731、f02004886、f02005062
We will continue to contact more SPs and update their information here.

How do you plan to make deals to your storage providers

Lotus client

If you answered "Others/custom tool" in the previous question, enter the details here

No response

Can you confirm that you will follow the Fil+ guideline

Yes

large-datacap-requests[bot] commented 1 year ago

Thanks for your request!

Heads up, you’re requesting more than the typical weekly onboarding rate of DataCap!
large-datacap-requests[bot] commented 1 year ago

Thanks for your request! Everything looks good. :ok_hand:

A Governance Team member will review the information provided and contact you back pretty soon.

large-datacap-requests[bot] commented 1 year ago

Thanks for your request!

Heads up, you’re requesting more than the typical weekly onboarding rate of DataCap!
large-datacap-requests[bot] commented 1 year ago

Thanks for your request! Everything looks good. :ok_hand:

A Governance Team member will review the information provided and contact you back pretty soon.

Sunnyiscoming commented 1 year ago

Can you provide some Internet celebrity accounts that organizations have signed up to? And provide the signed certification materials. What's the relationship between you and the organization? Whether the sps you choose can support data retrieval?

dpx-office commented 1 year ago

Hi, @Sunnyiscoming I am a member of the operations staff at the company. Regarding partner(Internet celebrity ) information, as privacy is involved, please provide the official email address below and I will send the relevant signing agreement for review. In addition, we have communicated with SP that the data we store is public and supports retrieval, and they will cooperate with a good retrieval configuration.

herrehesse commented 1 year ago

Dear Applicant,

Due to the increased amount of erroneous/wrong Filecoin+ data recently, on behalf of the entire community, we feel compelled to go deeper into datacap requests. Hereby to ensure that the overall value of the Filecoin network and Filecoin+ program increases and is not abused.

Please answer the questions below as comprehensively as possible.

Customer data

We expect that for the onboarding of customers with the scale of an LDN there would have been at least multiple email and perhaps several chat conversations preceding it. A single email with an agreement does not qualify here.

Should this only be soley for acquiring datacap this is of course out of the question. The customer must have a legitimate reason for wanting to use the Filecoin+ program which is intended as a program to store useful and public datasets on the network.

(As an intermediate solution Filecoin offers the FIL-E program or the glif.io website for business datasets that do not meet the requirements for a Filecoin+ dataset)

Files and Processing

Hopefully you understand the caution the overall community has for onboarding the wrong data. We understand the increased need for Filecoin+, however, we must not allow the program to be misused. Everything depends on a valuable and useful network, let's do our best to make this happen. Together.

kevzak commented 1 year ago

@herrehesse @dpx-office Just to confirm, this application is listed as a public dataset. E-Fil+ is a pilot program for private datasets.

simonkim0515 commented 1 year ago

Datacap Request Trigger

Total DataCap requested

5PiB

Expected weekly DataCap usage rate

200TiB

Client address

f1fbpvedozoqtijqf4fwkifjhhlg3i2c6twyct2xi

large-datacap-requests[bot] commented 1 year ago

DataCap Allocation requested

Multisig Notary address

f02049625

Client address

f1fbpvedozoqtijqf4fwkifjhhlg3i2c6twyct2xi

DataCap allocation requested

100TiB

Id

603acfae-2992-49ef-966d-22bc4d1353d5

filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report[^1]

There is no previous allocation for this issue.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

cryptowhizzard commented 1 year ago

Dear applicant,

Thank you for applying for datacap. As notary i am screening your application and doing duediligence.

Looking at your application i have some questions:

As you are brand new on Github and have no history of past applications. Applying for 5PB of datacap is a lot. One needs comprehensive knowledge of Filecoin, packing data and it's requirements and distribution of the data. Are you brand new in the Filecoin space or have you applied for datacap in the past on different Github account names?

Can you provide me with some KYC details per e-mail at kyc@dcent.nl. I would like to receive an e-mail from the original domainname / contact as listed on the website that refers to this application.

Your application is for 5PB of datacap. Can you show us some visible proof from the datasize and the storage you have there?

Regarding the data itself you mention that it is made by professionals. I presume this is copyrighted then. How can we check that you have permission to store this copyrighted materials?

Can you provide us with a distribution plan supposingly that you would be granted datacap to store? Is the list you provided still accurate? "f02003553、f02003731、f02004886、f02005062" because if so your LDN request would be denied.

As you can see here : https://github.com/filecoin-project/filecoin-plus-large-datasets it states:

In order for a client and their dataset to be eligible: the dataset should be public, open, and mission aligned with Filecoin and Filecoin Plus. This also means that the data should be accessible to anyone > in the network, without requiring any special permissions or access requirement stored data should be readily retrievable on the network and this should be regularly verified (though the use of manual or automated verification > that includes retrieving data from various miners over the course of the DataCap allocation timeframe)

I did some testing and none of this SP's are eligible to store data.

lotus net connect f02003553 f02003553 -> {12D3KooWPo1AXAYVn1dVsbtZDNd6WtyCAcdtCVUdtsCqmfRVXh2f: []} ERROR: failed to parse multiaddr "f02003553": must begin with /

root@proposals:~/api/fullauto# lotus net connect f02003731 f02003731 -> {12D3KooWDNT4YfVWbLW6r88UXCL3CHJ3TCYXUcD1BUgw7QYdk8xM: []} ERROR: failed to parse multiaddr "f02003731": must begin with /

root@proposals:~/api/fullauto# lotus net connect f02004886 f02004886 -> {12D3KooWR3kiL1R2bbzTyNHYMQBRpahnfZLgVrAWergSW9QK2k1K: []} ERROR: failed to parse multiaddr "f02004886": must begin with /

root@proposals:~/api/fullauto# lotus net connect f02005062 f02005062 -> {12D3KooWLz6CxBpUo6oubT49VxnxCsyoDAWFv5QDhu2rW9sxKFVk: []} ERROR: failed to parse multiaddr "f02005062": must begin with /

To make sure that you understand the rules and guidelines I would like to receive a detailed allocation plan where the amount of data and reputational SP's are included.

dpx-office commented 1 year ago

Hi, thank you for due diligence.

I am the operation staff of Sichuan Horizon Culture Communication Co., Ltd. I have sent kyc email through our company email, please check it. The reason for submitting this LDN application is that we have learned thoroughly about filecoin as well as fil+ and have opened a new github account, which is normal because I am not a developer and Github is not commonly used in China.

The data we are going to store is more than 1P and will be divided to 5 copies to store, if our application get allowance, we will allocate it to SPs in different regions according to fil+ regulations, and we want to store most data in Greater China region, each SP gets no more than 25% of the allocation.

Regarding the copyright, I have provided the contract with the creators, and taken some of the contents for verification. The accounts of these creators are: "阆中放牛娃", "谢女子" and "悦悦的凉茶" Link:Contract

According to the requirements of application submission, we must provide information about the SPs we have already contacted, and we will update information on the sp's who eventually choose to work with us as the application progresses. It is necessary to note that deals sealing process involves data transfer and provisioning of equipment, SPs need to make everything ready before seal, and If there is a change in the application, the SPs will be at great risk.Therefore, SPs will only choose to cooperate with us when we already own the DC allocation. The SPs we have contacted so far have committed to providing new nodes first and will use them for deals sealing when our application is successful.

cryptowhizzard commented 1 year ago

Hello @dpx-office

Thanks for your clarification.

I can move ahead once i get the data i need. ( Sp's + Contact information ).

dpx-office commented 1 year ago

Hello @dpx-office Thanks for your clarification. I can move ahead once i get the data i need. ( Sp's + Contact information ). @cryptowhizzard thanks for your work,SPs info is disclosed as follows,please help us to sign the first round. Sp id --------- Region------Name------- Org f02003553 ---- CN ------ Yang ---- ---- Chengdu Jusha Cheng Box Technology Co., LTD f02003731 ---- SGP ------ Gaobo ------ AirBook Private Limited f02005062 ---- HK ------ kingkuang ---- kingkuang f02004886 ---- SGP ------ Downey ------ MetaOne Deta Ltd

dpx-office commented 1 year ago

Hello@cryptowhizzard, regarding the application, is there anything else you need to know? What else can I do to facilitate the application process? Please let me know, many thanks.

OpenGate01 commented 1 year ago

Since this is currently the first round of applications, there are no bot check results available for review. We think this application meets the Fil+ criteria based on the applicant's description and the distribution plan.

OpenGate01 commented 1 year ago

Request Proposed

Your Datacap Allocation Request has been proposed by the Notary

Message sent to Filecoin Network

bafy2bzacedtiqixpcwjmdjht355f6igfe547gp7rbyoy6dihck7gmzcfispyk

Address

f1fbpvedozoqtijqf4fwkifjhhlg3i2c6twyct2xi

Datacap Allocated

100.00TiB

Signer Address

f1im4hmtbfzqnx7ir74kdaiu4ynjhgqh3sdi2snla

Id

603acfae-2992-49ef-966d-22bc4d1353d5

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacedtiqixpcwjmdjht355f6igfe547gp7rbyoy6dihck7gmzcfispyk

METAVERSEDATAMINING commented 1 year ago

I noticed that the SPs you provided are concentrated in Asia. According to FIL+ rules, storage needs to be distributed across at least three regions. Since this is the first allocation and there is room for improvement, I will mark this application. Will approve this round

METAVERSEDATAMINING commented 1 year ago

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzacebskwnt5sbnvbgraztntlbrzzza23h37amirgjd2lg2uhpholfddg

Address

f1fbpvedozoqtijqf4fwkifjhhlg3i2c6twyct2xi

Datacap Allocated

100.00TiB

Signer Address

f17idrnfnxl2mbgcgr57a6z2c6lj2qx56gvm3336i

Id

603acfae-2992-49ef-966d-22bc4d1353d5

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacebskwnt5sbnvbgraztntlbrzzza23h37amirgjd2lg2uhpholfddg

dpx-office commented 1 year ago

checker:manualTrigger

filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report Summary[^1]

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

Deal Data Replication

✔️ Data replication looks healthy.

Deal Data Shared with other Clients[^3]

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the full report.

dpx-office commented 1 year ago

The DataCap allocation has been used up and the bot is not working yet. The encapsulaing of the SPs has been stagnant for a while. I need your assistance. @fabriziogianni7

dpx-office commented 1 year ago

The bot hasn't triggered yet here , I need your help. @simonkim0515 @Sunnyiscoming @fabriziogianni7

dpx-office commented 1 year ago

How to expedite the application process? Has the issue with the bot been resolved? @fabriziogianni7

dpx-office commented 1 year ago

checker:manualTrigger

filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report Summary[^1]

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

Deal Data Replication

✔️ Data replication looks healthy.

Deal Data Shared with other Clients[^3]

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the full report.

dpx-office commented 1 year ago

Please help manually trigger the allocation. The current label display is abnormal . @simonkim0515 @fabriziogianni7

large-datacap-requests[bot] commented 1 year ago

DataCap Allocation requested

Request number 2

Multisig Notary address

f02049625

Client address

f1fbpvedozoqtijqf4fwkifjhhlg3i2c6twyct2xi

DataCap allocation requested

200TiB

Id

edfbd0b6-9119-4340-8f2d-073ab49a4159

large-datacap-requests[bot] commented 1 year ago

Stats & Info for DataCap Allocation

Multisig Notary address

f02049625

Client address

f1fbpvedozoqtijqf4fwkifjhhlg3i2c6twyct2xi

Rule to calculate the allocation request amount

100% of weekly dc amount requested

DataCap allocation requested

200TiB

Total DataCap granted for client so far

100TiB

Datacap to be granted to reach the total amount requested by the client (5PiB)

4.90PiB

Stats

Number of deals Number of storage providers Previous DC Allocated Top provider Remaining DC
1600 2 100TiB 76 0B
dpx-office commented 1 year ago

We are currently in the waiting stage for the notary's review, and making efforts to seek their assistance.

dpx-office commented 1 year ago

Still waiting to be reviewed....

kernelogic commented 1 year ago

Request Proposed

Your Datacap Allocation Request has been proposed by the Notary

Message sent to Filecoin Network

bafy2bzacedslbm6eej3g7f7fzbwovtp2gk2tjx5ehwlbucdrdb4w5cqdv5rqw

Address

f1fbpvedozoqtijqf4fwkifjhhlg3i2c6twyct2xi

Datacap Allocated

200.00TiB

Signer Address

f1yjhnsoga2ccnepb7t3p3ov5fzom3syhsuinxexa

Id

edfbd0b6-9119-4340-8f2d-073ab49a4159

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacedslbm6eej3g7f7fzbwovtp2gk2tjx5ehwlbucdrdb4w5cqdv5rqw

NDLABS-Leo commented 1 year ago

checker:manualTrigger

filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report Summary[^1]

Storage Provider Distribution

⚠️ 1 storage providers sealed more than 70% of total datacap - f02031264: 75.73%

Deal Data Replication

⚠️ 100.00% of deals are for data replicated across less than 3 storage providers.

Deal Data Shared with other Clients[^3]

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the full report.

dpx-office commented 1 year ago

This is currently the second distribution and the report information is incorrect. Next, new service providers will join in, with quotas more dispersed, and storage will be significantly improved.

NDLABS-Leo commented 1 year ago

@dpx-office Thanks for the explanation, it seems to me that the distribution has different onboarding speed does have this, hope to see a healthier report in the next round of check bot. No data sharing which is good and willing to support.

NDLABS-Leo commented 1 year ago

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzacecnpq2iv36jdaawo74a5r7bvffrzgkl3wefunpbyljjjvl6stvmrq

Address

f1fbpvedozoqtijqf4fwkifjhhlg3i2c6twyct2xi

Datacap Allocated

200.00TiB

Signer Address

f1yayfsv6whu3rheviucvventj3y6t542xfpb47ei

Id

edfbd0b6-9119-4340-8f2d-073ab49a4159

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacecnpq2iv36jdaawo74a5r7bvffrzgkl3wefunpbyljjjvl6stvmrq

dpx-office commented 1 year ago

status update

dpx-office commented 1 year ago

Data Prep Phase...

dpx-office commented 1 year ago

PAUSE

github-actions[bot] commented 1 year ago

This application has not seen any responses in the last 10 days. This issue will be marked with Stale label and will be closed in 4 days. Comment if you want to keep this application open.

dpx-office commented 1 year ago

Data Prep Phase...

github-actions[bot] commented 1 year ago

This application has not seen any responses in the last 10 days. This issue will be marked with Stale label and will be closed in 4 days. Comment if you want to keep this application open.

dpx-office commented 1 year ago

In preparation.

dpx-office commented 1 year ago

In preparation.

dpx-office commented 1 year ago

In preparation

dpx-office commented 1 year ago

In preparation

dpx-office commented 1 year ago

In preparation

github-actions[bot] commented 1 year ago

This application has not seen any responses in the last 10 days. This issue will be marked with Stale label and will be closed in 4 days. Comment if you want to keep this application open.

-- Commented by Stale Bot.