filecoin-project / filecoin-plus-large-datasets

Hub for client applications for DataCap at a large scale
109 stars 62 forks source link

[DataCap Application] - < IvyPal > #1084

Closed sam001nnnhotmail closed 1 year ago

sam001nnnhotmail commented 2 years ago

Large Dataset Notary Application

To apply for DataCap to onboard your dataset to Filecoin, please fill out the following.

Core Information

Please respond to the questions below by replacing the text saying "Please answer here". Include as much detail as you can in your answer.

Project details

Share a brief history of your project and organization.

IvyPal is an international education brand for children under Yijia Intelligent Technology (Shanghai) Co., Ltd. Ir is 1-1 online learningand social network program, whcih provides a platform for Chinese to learnnot only language from the native English speakers but also real-life experiences.

Founded in 2012, Yijia Intelligent Technology (Shanghai) Co., Ltd. is a high-tech enterprise focusing on collaborative interactive software technology and product development. The flying broadcast collaborative learning platform built by the company uses the world's leading mobile whiteboard technology to achieve high-performance two-way synchronization of audio, video and handwriting between mobile terminals.                                                                                                               IvyPal was co-founded by a number of Silicon Valley elites, and their children also have backgrounds in prestigious American schools. Providing the best role models for children and helping them realize their dreams of a prestigious school is one of their original intentions for founding IvyPal.                                                                   

What is the primary source of funding for this project?

Own fund & company business income

What other projects/ecosystem stakeholders is this project associated with?

Nope

Use-case details

Describe the data being stored onto Filecoin

The data we want to store on filecoin are internet dataset which can be shared with the public. They are publicly distributable teaching courses videos/live telecast.

Where was the data in this dataset sourced from?

self-produced videos by the team

Can you share a sample of the data? A link to a file, an image, a table, etc., are good ways to do this.


All our learning and social activities are 100% online across continents. DataCap would be used to store the recorded activities.  You can check some public channels as below: - Facebook: https://www.facebook.com/public/Ivy-Pal
- Instagram: https://www.instagram.com/ivy.pal/?hl=en
- Wechat: Ivypal-cn                                                                                                                                    There are also quite a few articles regarding Ivypal if you search from Google, Baidu, and etc.         

Confirm that this is a public dataset that can be retrieved by anyone on the Network (i.e., no specific permissions or access rights are required to view the data).

Yes

What is the expected retrieval frequency for this data?

Frequently

For how long do you plan to keep this dataset stored on Filecoin?

We hope it's a permanent archival, and we'll constantly update our data stored on Filecoin.

DataCap allocation plan

In which geographies (countries, regions) do you plan on making storage deals?

China and North America

How will you be distributing your data to storage providers? Is there an offline data transfer process?

Offline

How do you plan on choosing the storage providers with whom you will be making deals? This should include a plan to ensure the data is retrievable in the future both by you and others.

We need support from North America miners and Greater China miners. Currently we prefer local SP in China.If everything goes well,we will extend it to the other region for more support. 

How will you be distributing deals across storage providers?

fairly

Do you have the resources/funding to start making deals as soon as you receive DataCap? What support from the community would help you onboard onto Filecoin?

yes
large-datacap-requests[bot] commented 2 years ago

Thanks for your request! :exclamation: We have found some problems in the information provided. We could not find your Name in the information provided We could not find any Region in the information provided

Please, take a look at the request and edit the body of the issue providing all the required information.
large-datacap-requests[bot] commented 2 years ago

Thanks for your request! :exclamation: We have found some problems in the information provided. We could not find your Name in the information provided We could not find any Region in the information provided

Please, take a look at the request and edit the body of the issue providing all the required information.
large-datacap-requests[bot] commented 2 years ago

Thanks for your request! :exclamation: We have found some problems in the information provided. We could not find your Name in the information provided We could not find any Region in the information provided

Please, take a look at the request and edit the body of the issue providing all the required information.
large-datacap-requests[bot] commented 2 years ago

Thanks for your request! Everything looks good. :ok_hand:

A Governance Team member will review the information provided and contact you back pretty soon.

raghavrmadya commented 2 years ago

What's your relationship with IvyPal?

sam001nnnhotmail commented 2 years ago

What's your relationship with IvyPal?

Hi RG I am the staff in Ivypal.

sam001nnnhotmail commented 2 years ago

Hi is there any further comment

simonkim0515 commented 1 year ago

Please provide a breakdown of the 5 PiBs, sample of the data, and SPs you have been in touch with as well as a detailed allocation plan.

sam001nnnhotmail commented 1 year ago

Hi Simon

Appreciate your reply.

The data we are going to store on filecoin are made of: publicly distributable teaching courses videos live telecast record Funny teaching materials we working with English movies/cartoon we use in teaching

Sample of Data:

All our learning and social activities are 100% online across continents.

You can check some public channels as below: Facebook: https://www.facebook.com/public/Ivy-Pal

Instagram: https://www.instagram.com/ivy.pal/?hl=en

Wechat: Ivypal藤师亦友

https://mp.weixin.qq.com/s/J4lddl5d7aq1EL9R9KVRfw

https://mp.weixin.qq.com/s/_nFI6RTgSwD7iGXIiPC4sw

There are also quite a few articles regarding Ivypal if you search from Google, Baidu, and etc. Our allocation plan is trying to reach out to some SPs in China first, and then might extend to some global SPs in North America.

Totally 5-8 SPs are in our plan to decentralize storage and we will also distribute the allocation as fairly as possible, no more 25% in one SP.

Thanks Sam


发件人: Simon Kim @.> 发送时间: 2022年11月17日 23:05 收件人: filecoin-project/filecoin-plus-large-datasets @.> 抄送: Sam @.>; Author @.> 主题: Re: [filecoin-project/filecoin-plus-large-datasets] [DataCap Application] - < IvyPal > (Issue #1084)

Please provide a breakdown of the 5 PiBs, sample of the data, and SPs you have been in touch with as well as a detailed allocation plan.

― Reply to this email directly, view it on GitHubhttps://github.com/filecoin-project/filecoin-plus-large-datasets/issues/1084#issuecomment-1318771349, or unsubscribehttps://github.com/notifications/unsubscribe-auth/AZFR5TCB5TXFX4YRCW7MS73WIZCVLANCNFSM6AAAAAARFHJCCU. You are receiving this because you authored the thread.Message ID: @.***>

simonkim0515 commented 1 year ago

Datacap Request Trigger

Total DataCap requested

5PiB

Expected weekly DataCap usage rate

100TiB

Client address

f15nhptyoatd36b6oa3maw53brb5amkdxjkpxt3ha

large-datacap-requests[bot] commented 1 year ago

DataCap Allocation requested

Multisig Notary address

f02049625

Client address

f15nhptyoatd36b6oa3maw53brb5amkdxjkpxt3ha

DataCap allocation requested

50TiB

Id

175062c6-a4ef-4e3c-9298-45506a98dd39

newwebgroup commented 1 year ago

Hey Client, About KYC&KYB 1:Could you send an email to filplus-app-review@fil.org

The content should include the number of the LDN application. If possible, please attach copies of the business license and other valid certificates

2:Can you provide more detailed information about other storage providers participated in this program, such as you can list SPs you have contacted with at present? In addition, please specify your criteria for selecting SPs. I see that your retrieval frequency is frequent.If possible, please provide specific plans and what you plan to do with SPs to meet the high frequency of retrieval.

3.How large is your existing dataset? How much is the data growth per month?

In addition, I observed that your WeChat official account is very slow to update and seems very inactive. Please explain

  1. Please provide more data samples to prove that you need 5PIB storage space. There is very little data on social media, which is insufficient to support such a large amount of application demand
sam001nnnhotmail commented 1 year ago

Hi There

Sorry for the late reply since I got covid-19 these days. Please see the below answers to your concern.

  1. Could you send an email to @.**@.> ?

Have done that.

  1. Can you provide more detailed information about other storage providers participated in this program, such as you can list SPs you have contacted with at present? In addition, please specify your criteria for selecting SPs.

I have explained some in the application. And regarding the criteria for selecting SPs due to the high frequency of retrieval, we prefer to choose the SPs with stable network first and then the speed.

  1. How large is your existing dataset? How much is the data growth per month?

Currently it is around 3P, and monthly growth might be around 100T.

  1. Wechat issue

I will check with tech team.

Thanks


发件人: newwebgroup @.> 发送时间: 2022年12月12日 14:12 收件人: filecoin-project/filecoin-plus-large-datasets @.> 抄送: Sam @.>; Author @.> 主题: Re: [filecoin-project/filecoin-plus-large-datasets] [DataCap Application] - < IvyPal > (Issue #1084)

Hey Client, About KYC&KYB 1:Could you send an email to @.**@.> ?

The content should include the number of the LDN application. If possible, please attach copies of the business license and other valid certificates

2:Can you provide more detailed information about other storage providers participated in this program, such as you can list SPs you have contacted with at present? In addition, please specify your criteria for selecting SPs. I see that your retrieval frequency is frequent.If possible, please provide specific plans and what you plan to do with SPs to meet the high frequency of retrieval.

3.How large is your existing dataset? How much is the data growth per month?

In addition, I observed that your WeChat official account is very slow to update and seems very inactive. Please explain

  1. Please provide more data samples to prove that you need 5PIB storage space. There is very little data on social media, which is insufficient to support such a large amount of application demand

― Reply to this email directly, view it on GitHubhttps://github.com/filecoin-project/filecoin-plus-large-datasets/issues/1084#issuecomment-1345947771, or unsubscribehttps://github.com/notifications/unsubscribe-auth/AZFR5TGUKM6YOF3G26KXMPTWM265VANCNFSM6AAAAAARFHJCCU. You are receiving this because you authored the thread.Message ID: @.***>

sam001nnnhotmail commented 1 year ago

FB06A9F2-AD00-4FBD-8500-46D68D9E6FD0

newwebgroup commented 1 year ago
  1. Please list the IDs of the currently confirmed SPs And list the network bandwidth of each SPs

2.Please provide screenshots or any evidence to prove that you have a 3P dataset

sam001nnnhotmail commented 1 year ago

Hi There

1.We are trying to approach some SPs within top 200 on the filescan, and not had a final decision yet. If you have a good suggestion, kindly let me know.

2.FYI as partial evidence. Btw, we plan to store 2-3 copies. 335658C5-44CF-48CF-9E92-7502DE9DE148

kernelogic commented 1 year ago

Request Proposed

Your Datacap Allocation Request has been proposed by the Notary

Message sent to Filecoin Network

bafy2bzacedviv35nfnok5pgh6gguncocnzpbp6hwb7gcqfk3y77yv7ryvvlc4

Address

f15nhptyoatd36b6oa3maw53brb5amkdxjkpxt3ha

Datacap Allocated

50.00TiB

Signer Address

f1yjhnsoga2ccnepb7t3p3ov5fzom3syhsuinxexa

Id

175062c6-a4ef-4e3c-9298-45506a98dd39

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacedviv35nfnok5pgh6gguncocnzpbp6hwb7gcqfk3y77yv7ryvvlc4

kernelogic commented 1 year ago

The client has provided enough info for me to support.

stcloudlisa commented 1 year ago

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzacecuu7suvpi6o6iq4ab3hz6qqtqprxn2i4gtjfkncaziveuuercg5u

Address

f15nhptyoatd36b6oa3maw53brb5amkdxjkpxt3ha

Datacap Allocated

50.00TiB

Signer Address

f1jvvltduw35u6inn5tr4nfualyd42bh3vjtylgci

Id

175062c6-a4ef-4e3c-9298-45506a98dd39

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacecuu7suvpi6o6iq4ab3hz6qqtqprxn2i4gtjfkncaziveuuercg5u

herrehesse commented 1 year ago

Dear Applicant,

Due to the increased amount of erroneous/wrong Filecoin+ data recently, on behalf of the entire community, we feel compelled to go deeper into datacap requests. Hereby to ensure that the overall value of the Filecoin network and Filecoin+ program increases and is not abused.

Please answer the questions below as comprehensively as possible.

Customer data

We expect that for the onboarding of customers with the scale of an LDN there would have been at least multiple email and perhaps several chat conversations preceding it. A single email with an agreement does not qualify here.

Should this only be soley for acquiring datacap this is of course out of the question. The customer must have a legitimate reason for wanting to use the Filecoin+ program which is intended as a program to store useful and public datasets on the network.

(As an intermediate solution Filecoin offers the FIL-E program or the glif.io website for business datasets that do not meet the requirements for a Filecoin+ dataset)

Files and Processing

Hopefully you understand the caution the overall community has for onboarding the wrong data. We understand the increased need for Filecoin+, however, we must not allow the program to be misused. Everything depends on a valuable and useful network, let's do our best to make this happen. Together.

sam001nnnhotmail commented 1 year ago

Hi There

Welcome the concerns here. I may not understand all of them clearly, but would like to answer them as much as I can.Hope it can be useful. Though they are various questions,I prefer to sort them out into 2 groups ,and then I will answer them respectively and comprehensively. Thanks

Group 1: Could you demonstrate exactly how and to what extent customer contact occurred? Did the customer specify the amount of data involved in this relevant correspondence? Why does the customer in question want to use the Filecoin+ program? Why is the customer data considered Filecoin+ eligible? Answer: All videos/materials are produced and owned by ourselves. Why we choose Filecoin+. We ever attended another project in Filecoin and had smooth experience and great support as expected. So we decide to try Filecoin+ after evaluation - we are qualified.

Gourp 2: Could you please demonstrate to us how you envision processing and transporting the customer data in question to any location for preparation?

Would you demonstrate to us that the customer, the preparer and the intended storage providers all have adequate bandwidth to process the set with its corresponding size?

Would you tell us how the data set preparer takes into account the prevention of duplicates in order to prevent data cap abuse? Answer: China and North America firstly, and then depends on the resources in hand. 10G bandwidth is required. If you mean how many copies we plan to store for the data - 3 copies

Hopefully you understand the caution the overall community has for onboarding the wrong data. We understand the increased need for Filecoin+, however, we must not allow the program to be misused. Everything depends on a valuable and useful network, let's do our best to make this happen. Together. Response: This is the common goal we share together. Appreciate that.

large-datacap-requests[bot] commented 1 year ago

DataCap Allocation requested

Request number 2

Multisig Notary address

f02049625

Client address

f15nhptyoatd36b6oa3maw53brb5amkdxjkpxt3ha

DataCap allocation requested

100TiB

Id

1a14c25f-0a7b-4e8d-8387-9601f3f745b2

large-datacap-requests[bot] commented 1 year ago

Stats & Info for DataCap Allocation

Multisig Notary address

f01858410

Client address

f15nhptyoatd36b6oa3maw53brb5amkdxjkpxt3ha

Last two approvers

1LISA2 & kernelogic

Rule to calculate the allocation request amount

100% of weekly dc amount requested

DataCap allocation requested

100TiB

Total DataCap granted for client so far

50TiB

Datacap to be granted to reach the total amount requested by the client (5PiB)

4.95PiB

Stats

Number of deals Number of storage providers Previous DC Allocated Top provider Remaining DC
821 5 50TiB 25.82 9.78TiB
filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report[^1]

Storage Provider Distribution

The below table shows the distribution of storage providers that have stored data for this client.

If this is the first time a provider takes verified deal, it will be marked as new.

For most of the datacap application, below restrictions should apply.

Since this is the 3rd allocation, the following restrictions have been relaxed:

⚠️ f01972110 has unknown IP location.

Provider Location Total Deals Sealed Percentage Unique Data Duplicate Deals
f02006374 Hong Kong, Central and Western, HK
7Road International HK Limited
2.09 TiB 9.41% 2.09 TiB 0.00%
f01890456 Los Angeles, California, US
Zenlayer Inc
6.44 TiB 28.93% 6.44 TiB 0.00%
f01854772 Los Angeles, California, US
Zenlayer Inc
6.38 TiB 28.65% 6.38 TiB 0.00%
f01834291 Los Angeles, California, US
Zenlayer Inc
5.88 TiB 26.40% 5.88 TiB 0.00%
f01972110 Unknown
Unknown
1.47 TiB 6.60% 1.47 TiB 0.00%

Provider Distribution

Deal Data Replication

The below table shows how each many unique data are replicated across storage providers.

Since this is the 3rd allocation, the following restrictions have been relaxed:

⚠️ 100.00% of deals are for data replicated across less than 3 storage providers.

Unique Data Size Total Deals Made Number of Providers Deal Percentage
22.25 TiB 22.25 TiB 1 100.00%

Replication Distribution

Deal Data Shared with other Clients

The below table shows how many unique data are shared with other clients. Usually different applications owns different data and should not resolve to the same CID.

However, this could be possible if all below clients use same software to prepare for the exact same dataset or they belong to a series of LDN applications for the same dataset.

⚠️ CID sharing has been observed.

Other Client Application Total Deals Affected Unique CIDs Approvers
f1ed7vkc4uqrjlj4pzoo7pdytlbh6wphleptud2ji SuperMeofficial 224.00 GiB 7 1Defil2022
1psh0691
f16efumcgpjamyhfrflxsu2brd7wrhniqwqssfpzq F00TAGE 128.00 GiB 4 1kernelogic
1xingjitansuo
f1gmiepn73zoa5gz2oiqyugjmrsecwj5qxd42vmyi `` 32.00 GiB 1 Unknown
f1ng6g57r4u62q67u6lm33ftijfsggyzzjzb2l4cy NFTSTAR 32.00 GiB 1 11ane-1
1Defil2022
2newwebgroup
1psh0691
1stcouldlisa
2Tom-OriginStorage
f1px4vu4r5nvtiz6y774b7gjshmgy643jgtztw4wa 2amok 32.00 GiB 1 1stcouldlisa
1xingjitansuo

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

herrehesse commented 1 year ago

@sam001nnnhotmail Thank you for your answer.

"after evaluation - we are qualified." Self produced video's by your team do not qualify for datacap. You claim to have private data and this is just not the right place for you.

Maybe you can try to store your data on Filecoin through regular deals? Can I help?

cryptowhizzard commented 1 year ago

Hello,

You stated "The data we want to store on filecoin are internet dataset which can be shared with the public. They are publicly distributable teaching courses videos/live telecast."

However , i see you are storing data from NFTSTAR and that application #960 is blacklisted because of CID sharing and fraudulent behaviour. Please explain what the goal is of this application and what you intend to store.

I would also like to see a list of CID of the dataset you have build so that we can see that data that you are going to distribute.

sam001nnnhotmail commented 1 year ago

Hey

Regarding data sharing issue you mentioned, we have checked and the main reason is when SP processes the data, there is some technical oversight in steps. For the SPs which have been checked out for some mishandling, we will not work with them anymore. We will reach out to the qualified SPs which support public retrieval and provide CID. We will require SPs to do due diligence on multi-layers verification for further data processing.

Thanks for your timely callout. I guarantee no data sharing anymore in future and look forward to further support from the community and you.


发件人: CryptoWhizzard @.> 发送时间: 2023年1月13日 21:18 收件人: filecoin-project/filecoin-plus-large-datasets @.> 抄送: Sam @.>; Mention @.> 主题: Re: [filecoin-project/filecoin-plus-large-datasets] [DataCap Application] - < IvyPal > (Issue #1084)

Hello,

You stated "The data we want to store on filecoin are internet dataset which can be shared with the public. They are publicly distributable teaching courses videos/live telecast."

However , i see you are storing data from NFTSTAR and that application #960https://github.com/filecoin-project/filecoin-plus-large-datasets/issues/960 is blacklisted because of CID sharing and fraudulent behaviour. Please explain what the goal is of this application and what you intend to store.

I would also like to see a list of CID of the dataset you have build so that we can see that data that you are going to distribute.

― Reply to this email directly, view it on GitHubhttps://github.com/filecoin-project/filecoin-plus-large-datasets/issues/1084#issuecomment-1381840963, or unsubscribehttps://github.com/notifications/unsubscribe-auth/AZFR5TG5YJK2NDSIOKKTNQDWSFI3TANCNFSM6AAAAAARFHJCCU. You are receiving this because you were mentioned.Message ID: @.***>

cryptowhizzard commented 1 year ago

Hello @sam001nnnhotmail

I would also like to see a list of CID's of the dataset you have build so that we can see that data that you are going to distribute.

sam001nnnhotmail commented 1 year ago

Sure. We will share the CID of stored dataset in github.

Thanks for your comments and support.


发件人: CryptoWhizzard @.> 发送时间: 2023年2月1日 17:03 收件人: filecoin-project/filecoin-plus-large-datasets @.> 抄送: Sam @.>; Mention @.> 主题: Re: [filecoin-project/filecoin-plus-large-datasets] [DataCap Application] - < IvyPal > (Issue #1084)

Hello @sam001nnnhotmailhttps://github.com/sam001nnnhotmail

I would also like to see a list of CID's of the dataset you have build so that we can see that data that you are going to distribute.

― Reply to this email directly, view it on GitHubhttps://github.com/filecoin-project/filecoin-plus-large-datasets/issues/1084#issuecomment-1411696758, or unsubscribehttps://github.com/notifications/unsubscribe-auth/AZFR5TDEXNLKMYSXLZ3SWMLWVIRGFANCNFSM6AAAAAARFHJCCU. You are receiving this because you were mentioned.Message ID: @.***>

github-actions[bot] commented 1 year ago

This application has not seen any responses in the last 10 days. This issue will be marked with Stale label and will be closed in 4 days. Comment if you want to keep this application open.

github-actions[bot] commented 1 year ago

This application has not seen any responses in the last 14 days, so for now it is being closed. Please feel free to contact the Fil+ Gov team to re-open the application if it is still being processed. Thank you!

large-datacap-requests[bot] commented 10 months ago

Thanks for your request! :exclamation: We have found some problems in the information provided. We could not find Website \/ Social Media field in the information provided We could not find Total amount of DataCap being requested (between 500 TiB and 5 PiB) field in the information provided We could not find Weekly allocation of DataCap requested (usually between 1-100TiB) field in the information provided We could not find On-chain address for first allocation field in the information provided We could not find Data Type of Application field in the information provided

Please, take a look at the request and edit the body of the issue providing all the required information.
large-datacap-requests[bot] commented 8 months ago

RootKeyHolders have approved multisig account. You can now request first datacap release