filecoin-project / filecoin-plus-large-datasets

Hub for client applications for DataCap at a large scale
110 stars 62 forks source link

[DataCap Application] Zhejiang Geely Holding Group - <Project Name> #1019

Closed johnhash1992 closed 1 year ago

johnhash1992 commented 1 year ago

Large Dataset Notary Application

To apply for DataCap to onboard your dataset to Filecoin, please fill out the following.

Core Information

Please respond to the questions below by replacing the text saying "Please answer here". Include as much detail as you can in your answer.

Project details

Share a brief history of your project and organization.

Geely Holding Group is committed to becoming a globally competitive and influential intelligent electric mobility and energy service technology company, with businesses covering automobiles and upstream and downstream industry chains, intelligent mobility services, green transportation, digital technology, etc. The group is headquartered in Hangzhou, and its subsidiaries Geely, Lynk & Co, Krypton, Geometry, Volvo, Polestar, Lotus, Yinglun Electric Vehicles, Long-range New Energy Commercial Vehicles, Radar New Energy Vehicles, Cao Cao Travel, etc. are actively participating in market competition around their respective brand positioning. The Group takes the electrification and intelligent transformation of the automobile industry as its core and builds a technology moat and strengthens the technology ecosystem in the frontier technology fields of new energy technology, shared mobility, vehicle networking, intelligent driving, and vehicle chips..

What is the primary source of funding for this project?

Own capital.

What other projects/ecosystem stakeholders is this project associated with?

None

Use-case details

Describe the data being stored onto Filecoin

Geely automobile product introduction and review videos.

Where was the data in this dataset sourced from?

Company’s product and marketing department.

Can you share a sample of the data? A link to a file, an image, a table, etc., are good ways to do this.

https://drive.google.com/drive/folders/1Z8NVOVj_qj7-gDqrRGH4VInn8OAQxyfR?usp=sharing

Confirm that this is a public dataset that can be retrieved by anyone on the Network (i.e., no specific permissions or access rights are required to view the data).

Yes, these are publicly available data.

What is the expected retrieval frequency for this data?

80% of our data does not need to be retrieved, and 20% has no fixed retrieval frequency, depending on the business.

For how long do you plan to keep this dataset stored on Filecoin?

Permanent storage.

DataCap allocation plan

In which geographies (countries, regions) do you plan on making storage deals?

North America.

How will you be distributing your data to storage providers? Is there an offline data transfer process?

We plan to distribute the data to SPs via online. If there are SPs in close proximity, we will consider manual transmission.

How do you plan on choosing the storage providers with whom you will be making deals? This should include a plan to ensure the data is retrievable in the future both by you and others.

We welcome any quality SPs but must ensure that retrieval requirements are met before distribution can take place. We will check this from time to time.

How will you be distributing deals across storage providers?

We will divide the data equally and send it to different miners.

Do you have the resources/funding to start making deals as soon as you receive DataCap? What support from the community would help you onboard onto Filecoin?

Yes, we are available for trading at any time.
large-datacap-requests[bot] commented 1 year ago

Thanks for your request! Everything looks good. :ok_hand:

A Governance Team member will review the information provided and contact you back pretty soon.

johnhash1992 commented 1 year ago

Hi,@galen-mcandrew @raghavrmadya 21 days passed, what else do I need to do?

raghavrmadya commented 1 year ago

You need to justify how an electric mobility and energy service technology company has 5 PiBs of data.

johnhash1992 commented 1 year ago

Hello, @raghavrmadya Zhejiang Geely Holding Group (“Geely Holding” / “the Group”) was founded in 1986. In 1997, Geely Holding entered the automotive industry and has since focused our core business on the development and production of automobiles. Since 2012, Geely Holding has ranked among the Fortune Global 500 for eleven consecutive years (ranked 229th in 2022). Geely Holding is committed to becoming a globally competitive and influential smart electric mobility technology enterprise and energy service provider, engaged in automotive, upstream and downstream industrial chains, intelligent travel services, green transportation capacity, digital technology, etc. We have over a hundred species models of commercial vehicles on sale, each of which we need to take high definition pictures as well as promotional videos of the vehicles and test videos of the vehicles to promote them on all major social media. At least 100T of data per week. Currently we have accumulated no less than 5P of data. Filecoin is the diverse storage method we expect to choose. You can view our business description through http://zgh.com/our-business/?lang=en

johnhash1992 commented 1 year ago

@raghavrmadya hello,do I need to provide something else?

raghavrmadya commented 1 year ago

Unblocking from the trigger as the client has been responsive and holistic in answer queries. Notaries are requested to please conduct due diligence openly as they seem fit.

raghavrmadya commented 1 year ago

Datacap Request Trigger

Total DataCap requested

5PiB

Expected weekly DataCap usage rate

100TiB

Client address

f1imiomfkdnrnu4g6lykbyzmepbvsnbta44jylfla

large-datacap-requests[bot] commented 1 year ago

DataCap Allocation requested

Multisig Notary address

f02049625

Client address

f1imiomfkdnrnu4g6lykbyzmepbvsnbta44jylfla

DataCap allocation requested

50TiB

Id

9e1145e7-66a5-4553-bf5b-86535474da2f

raghavrmadya commented 1 year ago

@johnhash1992, can you share what is your relationship with the company?

celine3650 commented 1 year ago

@johnhash1992, would like to get a call with you and unblock the datacap application once we can validate the relationship and understand your use case.

liyunzhi-666 commented 1 year ago

Has the client sent an email to the filplus-app-review@fil.org for the simple KYB? @raghavrmadya

raghavrmadya commented 1 year ago

@Sunnyiscoming could you please confirm the KYB?

johnhash1992 commented 1 year ago

Hi guys, I'm Zhou from the Digital Technology Department of Geely Holding Group. The dataset including public video data that we are applying for this LDN application is mainly from Digital Technology Department. Here is a confirmation that we have sent the kYB email to the Fil+ team.

PS. The other application might submitted by other colleagues, which I have no idea about that.

WechatIMG4
Sunnyiscoming commented 1 year ago

@johnhash1992 It seems that you use your private email address send this email not the email address with your official domain. Could you send an email to filplus-app-review@fil.org with your official domain in order to confirm your identity? Email name should includes the issue id #1019. It is best to include your company's business license as well as the authorization sheet that authorizes the data to be stored on the Filecoin network and stamped with the official seal.

johnhash1992 commented 1 year ago

@johnhash1992 It seems that you use your private email address send this email not the email address with your official domain. Could you send an email to filplus-app-review@fil.org with your official domain in order to confirm your identity? Email name should includes the issue id #1019. It is best to include your company's business license as well as the authorization sheet that authorizes the data to be stored on the Filecoin network and stamped with the official seal.

@Sunnyiscoming I sent kyc email on September 27. For privacy reasons, the sensitive information in the screenshot is covered

image
Defil2022 commented 1 year ago

@johnhash1992 Does your dataset have been prepared yet? When will you be able to onboard your data on the network if you get the Datacap?

johnhash1992 commented 1 year ago

Hi,@Defil2022 YES,we have prepared datasets ready to transfer data

Defil2022 commented 1 year ago

Well, thanks for your reply. @johnhash1992 since you have done the KYB, I'd love to see more famous enterprises join in the Filecoin network. will support it. and I will keep in touch about this ticket.

Defil2022 commented 1 year ago

Request Proposed

Your Datacap Allocation Request has been proposed by the Notary

Message sent to Filecoin Network

bafy2bzacecmiuxusd6ahdzpju3phqf23oyo3cgjdgweq4arov4z4zppvaff2c

Address

f1imiomfkdnrnu4g6lykbyzmepbvsnbta44jylfla

Datacap Allocated

50.00TiB

Signer Address

f1dnb3uz7sylxk6emti3ififcvu3nlufnnsjui6ea

Id

9e1145e7-66a5-4553-bf5b-86535474da2f

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacecmiuxusd6ahdzpju3phqf23oyo3cgjdgweq4arov4z4zppvaff2c

YuanHeHK commented 1 year ago

Zhejiang Geely Holding Group is a very well-known car manufacturing company in China. The application applicant contacted me and showed me sample data, promising that the data was authentic and retrievable. So I will support this signing and continue to follow the app.

YuanHeHK commented 1 year ago

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzacedqx3ahdsdjjk5bnwyiy7s4h2nn5uwz6pmcflhxjcxrhtviwkqmzc

Address

f1imiomfkdnrnu4g6lykbyzmepbvsnbta44jylfla

Datacap Allocated

50.00TiB

Signer Address

f1fg6jkxsr3twfnyhdlatmq36xca6sshptscds7xa

Id

9e1145e7-66a5-4553-bf5b-86535474da2f

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacedqx3ahdsdjjk5bnwyiy7s4h2nn5uwz6pmcflhxjcxrhtviwkqmzc

Sunnyiscoming commented 1 year ago

@YuanHeHK Have you do KYB for this client? Can you share some details?

raghavrmadya commented 1 year ago

There is an another application open. one of these is fake - https://github.com/filecoin-project/filecoin-plus-large-datasets/issues/1067

Notaries are requested to pause signing and conduct further due diligenc

YuanHeHK commented 1 year ago

@Sunnyiscoming @raghavrmadya I checked the domain name before I signed http://zgh.com/. I have also checked the company information corresponding to the domain name. 1 2

I very much hope that such influential large enterprises can enter the Filecoin storage system. But now that #1019 and #1067 look a bit controversial, @johnhash1992 perhaps you should provide more information to illustrate it.

NDLABS-Leo commented 1 year ago

As far as I know, Geely is a large enterprise, they may be applications for different projects submitted by different organizations. @raghavrmadya

filplus-checker commented 1 year ago

DataCap and CID Checker Report[^1]

If this is the first time a provider takes verified deal, it will be marked as new.

For most of the datacap application, below restrictions should apply.

⚠️ f01983523 has sealed 62.11% of total datacap.

⚠️ 86.47% of total deal sealed by f01983523 are duplicate data.

⚠️ f01983523 has unknown IP location.

⚠️ f01924258 has sealed 37.89% of total datacap.

⚠️ 84.35% of total deal sealed by f01924258 are duplicate data.

⚠️ f01924258 has unknown IP location.

⚠️ All storage providers are located in the same region.

Provider Location Total Deals Sealed Percentage Unique Data Duplicate Deals
f01983523 Unknown 11.78 TiB 62.11% 1.59 TiB 86.47%
f01924258 Unknown 7.19 TiB 37.89% 1.13 TiB 84.35%

Provider Distribution

Deal Data Replication

The below table shows how each many unique data are replicated across storage providers.

⚠️ 100.00% of deals are for data replicated across less than 4 storage providers.

Unique Data Size Total Deals Made Number of Providers Deal Percentage
2.72 TiB 18.97 TiB 1 100.00%

Replication Distribution

Deal Data Shared with other Clients

The below table shows how many unique data are shared with other clients. Usually different applications owns different data and should not resolve to the same CID.

⚠️ CID sharing has been observed.

Other Client Application Total Deals Affected Unique CIDs Verifier
f1mhyxd4unemmhrw4dbhjcovivayrj3tyactezmzq GOLDEN SECURITY 7.25 TiB 40 LDN v3 multisig

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

Sunnyiscoming commented 1 year ago

There is an another application open. one of these is fake - https://github.com/filecoin-project/filecoin-plus-large-datasets/issues/1067

Notaries are requested to pause signing and conduct further due diligenc

@johnhash1992 Can you send KYB email again to filplus-app-review@fil.org?

johnhash1992 commented 1 year ago

@raghavrmadya @Sunnyiscoming , HI, I send KYB email again.

image

1067 i guess it's a apply from another department

raghavrmadya commented 1 year ago

@Defil2022 @YuanHeHK , please check client's dal making behavior highlighted by CID checker bot and client is also rquested to send email from again. We can't see email in the email sent image

large-datacap-requests[bot] commented 1 year ago

DataCap Allocation requested

Request number 2

Multisig Notary address

f02049625

Client address

f1imiomfkdnrnu4g6lykbyzmepbvsnbta44jylfla

DataCap allocation requested

100TiB

Id

5771aa9d-b20e-4d45-84fd-3bee20badb61

large-datacap-requests[bot] commented 1 year ago

Stats & Info for DataCap Allocation

Multisig Notary address

f01858410

Client address

f1imiomfkdnrnu4g6lykbyzmepbvsnbta44jylfla

Last two approvers

fireflyHZ & DeFIL123

Rule to calculate the allocation request amount

100% of weekly dc amount requested

DataCap allocation requested

100TiB

Total DataCap granted for client so far

50TiB

Datacap to be granted to reach the total amount requested by the client (5 PiB)

4.95PiB

Stats

Number of deals Number of storage providers Previous DC Allocated Top provider Remaining DC
1596 2 50TiB 75.94 128GiB
filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report[^1]

Storage Provider Distribution

The below table shows the distribution of storage providers that have stored data for this client.

If this is the first time a provider takes verified deal, it will be marked as new.

For most of the datacap application, below restrictions should apply.

Since this is the 3rd allocation, the following restrictions have been relaxed:

⚠️ f01983523 has sealed 83.99% of total datacap.

⚠️ 27.01% of total deal sealed by f01983523 are duplicate data.

⚠️ f01983523 has unknown IP location.

⚠️ 84.35% of total deal sealed by f01924258 are duplicate data.

⚠️ f01924258 has unknown IP location.

⚠️ All storage providers are located in the same region.

Provider Location Total Deals Sealed Percentage Unique Data Duplicate Deals
f01983523 Unknown
Unknown
37.72 TiB 83.99% 27.53 TiB 27.01%
f01924258 Unknown
Unknown
7.19 TiB 16.01% 1.13 TiB 84.35%

Provider Distribution

Deal Data Replication

The below table shows how each many unique data are replicated across storage providers.

Since this is the 3rd allocation, the following restrictions have been relaxed:

⚠️ 100.00% of deals are for data replicated across less than 3 storage providers.

Unique Data Size Total Deals Made Number of Providers Deal Percentage
28.66 TiB 44.91 TiB 1 100.00%

Replication Distribution

Deal Data Shared with other Clients

The below table shows how many unique data are shared with other clients. Usually different applications owns different data and should not resolve to the same CID.

However, this could be possible if all below clients use same software to prepare for the exact same dataset or they belong to a series of LDN applications for the same dataset.

⚠️ CID sharing has been observed.

Other Client Application Total Deals Affected Unique CIDs Approvers
f1mhyxd4unemmhrw4dbhjcovivayrj3tyactezmzq GOLDEN SECURITY 7.25 TiB 40 Unknown

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

cryptowhizzard commented 1 year ago

Cid sharing with https://github.com/filecoin-project/filecoin-plus-large-datasets/issues/1019 ( Golden Security ).

Please explain.

johnhash1992 commented 1 year ago

Hi,@cryptowhizzard

I'm sorry to hear that. We understand that this is because the sp we contacted was distributing the data and some of our data was distributed incorrectly due to technical issues. We will be adjusting our current collaboration for a short period of time to ensure that this issue does not occur in the next data storage.

cryptowhizzard commented 1 year ago

Hi @johnhash1992

I would like you to fill out this form to provide us with the necessary information to make a educated decision on your LDN request if we would like to support it.

johnhash1992 commented 1 year ago

@cryptowhizzard form submited

cryptowhizzard commented 1 year ago

Hi @johnhash1992

Thanks for filling in.

Can you let us know who is taking care of the data packing now as you indicated that you own the data and are the customer. If you can please send me their contact info to kyc@dcent.nl

The miner distribution that you have sent us looks ok. Miners have an OK reputation.

johnhash1992 commented 1 year ago

@cryptowhizzard Hello sir, because our business-level regulations do not allow us to do so. Our requirements for storage service providers are also written in the cooperation agreement, so the storage service providers we contact will ensure that our data is stored in accordance with the rules. As a client, I also make the data public. Hope to get your support.

herrehesse commented 1 year ago

Hello @johnhash1992 please explain why your "business-level regulations" do not allow you to explain who is packing your data?

johnhash1992 commented 1 year ago

@herrehesse Because we signed a non-disclosure agreement

lvschouwen commented 1 year ago

@johnhash1992 So you confirm this LDN is not public data that cannot be retrieve and you are not able to disclose any information because of signed NDAs.

Shall we stop this nonsense and close this LDN as we can conclude this is all bogus?

herrehesse commented 1 year ago

@johnhash1992 Could you please provide full disclosure of the miner ID's, the business names, and the entity responsible for the data preparation for the FIL+ program? As the community places a high value on transparency, the NDA you signed regarding the data preparation is not relevant to us.

github-actions[bot] commented 1 year ago

This application has not seen any responses in the last 10 days. This issue will be marked with Stale label and will be closed in 4 days. Comment if you want to keep this application open.

github-actions[bot] commented 1 year ago

This application has not seen any responses in the last 14 days, so for now it is being closed. Please feel free to contact the Fil+ Gov team to re-open the application if it is still being processed. Thank you!

aggregation-and-compliance-bot[bot] commented 9 months ago
Client f01984645 does not follow the datacap usage rules. More info here. This application has been failing the requirements for 7 days. Please take appropiate action to fix the following DataCap usage problems. Criteria Treshold Reason
Percent of used DataCap stored with top provider < 75 The percent of Data from the client that is stored with their top provider is 75.75%. This should be less than 75%
large-datacap-requests[bot] commented 8 months ago

Thanks for your request! :exclamation: We have found some problems in the information provided. We could not find Website \/ Social Media field in the information provided We could not find Total amount of DataCap being requested (between 500 TiB and 5 PiB) field in the information provided We could not find Weekly allocation of DataCap requested (usually between 1-100TiB) field in the information provided We could not find On-chain address for first allocation field in the information provided We could not find Data Type of Application field in the information provided

Please, take a look at the request and edit the body of the issue providing all the required information.
large-datacap-requests[bot] commented 6 months ago

RootKeyHolders have approved multisig account. You can now request first datacap release