fidlabs / Open-Data-Pathway

6 stars 8 forks source link

[DataCap Application] <K12 International Education Platform> - < datastorage> #37

Closed 1613557499 closed 2 months ago

1613557499 commented 5 months ago

Data Owner Name

K12 International Education Platform

Data Owner Country/Region

Singapore

Data Owner Industry

Education & Training

Website

http://www.topschools.cn/index?_l=en

Social Media Handle

顶思

Social Media Type

WeChat

What is your role related to the dataset

Dataset Owner

Total amount of DataCap being requested

5PiB

Expected size of single dataset (one copy)

512TiB

Number of replicas to store

4

Weekly allocation of DataCap requested

512TiB

On-chain address for first allocation

f1dvulxgyfvnv3fqwjpelfhictonbf3vytdgao4xy

Data Type of Application

Public, Open Commercial/Enterprise

Custom multisig

Identifier

No response

Share a brief history of your project and organization

TopSchools was founded on August 28, 2016.  As the resource aggregation platform of K12 international Education, TopSchools is committed to supporting the sustainable development of the international school market through community communication, data research, teacher training and talent recruitment.  Topschools is a close partner of international schools and international education institutions.  Provides industry with information on international education policies, operations, investments, schools, and people.  Organized various online and offline activities, cooperated with various examination bureaus, global top universities, research institutions, embassies and consulates, and provided services and support for more than 600 schools across the country;  We are linked to thousands of quality school service providers around the world to provide a variety of solutions for schools.

Is this project associated with other projects/ecosystem stakeholders?

No

If answered yes, what are the other projects/ecosystem stakeholders

No response

Describe the data being stored onto Filecoin

Our data is very large, mainly stored all kinds of courses of teaching videos, materials, documents.

Where was the data currently stored in this dataset sourced from

My Own Storage Infra

If you answered "Other" in the previous question, enter the details here

No response

If you are a data preparer. What is your location (Country/Region)

China

If you are a data preparer, how will the data be prepared? Please include tooling used and technical details?

We plan to cooperate and negotiate with professionals to ensure the safety and reliability of data, and achieve short-term or long-term effective storage to ensure the integrity of data

If you are not preparing the data, who will prepare the data? (Provide name and business)

No response

Has this dataset been stored on the Filecoin network before? If so, please explain and make the case why you would like to store this dataset again to the network. Provide details on preparation and/or SP distribution.

We have previously applied for 1p data, that data is only a small part of our total data, the company's development is stable, our data is always growing, storage costs are also increasing, the need for filecoin network support, for the development of the traditional Internet industry to promote.

Please share a sample of the data

https://pan.baidu.com/s/1vOQ-h6lCCLvCXAY_PtsP1A?pwd=6jt9 
https://pan.baidu.com/s/1CbRHk1E-tiXULoJIgLawMw?pwd=iapx 
https://pan.baidu.com/s/1_qSCseiHfUPyXFumfgvsLQ?pwd=fgis

Confirm that this is a public dataset that can be retrieved by anyone on the Network

If you chose not to confirm, what was the reason

No response

What is the expected retrieval frequency for this data

Monthly

For how long do you plan to keep this dataset stored on Filecoin

Permanently

In which geographies do you plan on making storage deals

Greater China, Asia other than Greater China, Africa, North America, South America, Europe, Australia (continent)

How will you be distributing your data to storage providers

Cloud storage (i.e. S3), HTTP or FTP server, IPFS, Shipping hard drives, Lotus built-in data transfer

How did you find your storage providers

Slack, Filmine, Big Data Exchange, Partners

If you answered "Others" in the previous question, what is the tool or platform you used

No response

Please list the provider IDs and location of the storage providers you will be working with.

f02850067
fo2830476
fo2941888
fo2950965
fo2830451

How do you plan to make deals to your storage providers

Boost client, Lotus client, Droplet client, Bidbot, Singularity

If you answered "Others/custom tool" in the previous question, enter the details here

No response

Can you confirm that you will follow the Fil+ guideline

Yes

datacap-bot[bot] commented 5 months ago

Application is waiting for allocator review

kevzak commented 5 months ago

Hello @1613557499 please clarify who are the SPs you are involved with:

You need to confirm two entities and two location minimum: f02850067 fo2830476 fo2941888 fo2950965 fo2830451

kevzak commented 5 months ago

Hello @1613557499 this application will require a business check to confirm your connection to K12 International Education Platform and their data. There are two options:

Let me know

1613557499 commented 5 months ago

Hello @1613557499 this application will require a business check to confirm your connection to K12 International Education Platform and their data. There are two options:

  • complete a third party KYB check via https://efilplus.synaps.me/signup (cost is $100) and they will check business license and complete KYC
  • complete a virtual zoom call with me, the data owner, and the lead Storage Provider.

Let me know

This was not required before, is it necessary? Why do I need this? What are the contents and questions of the call if I choose the second method? Can you tell me? What I understand is that we can call and give you a recording file, right? Still don't understand why it is so complicated and wait for your reply, thank you

kevzak commented 5 months ago

Yes, this is an unknown user to me. I would need proof that @1613557499 is somehow associated with this dataset and that this company is in fact a registered legit company.

If you prefer, first fill this form with client info https://form.jotform.com/240786057753667 and I can review and contact client directly for more information.

Also, please complete a KYC sanctions check via https://filplus.storage/kyc you can login with this github and get verified. Thank you.

1613557499 commented 5 months ago

Yes, this is an unknown user to me. I would need proof that @1613557499 is somehow associated with this dataset and that this company is in fact a registered legit company.

If you prefer, first fill this form with client info https://form.jotform.com/240786057753667 and I can review and contact client directly for more information.

Also, please complete a KYC sanctions check via https://filplus.storage/kyc you can login with this github and get verified. Thank you.

image I have submitted the application, and the remaining sp is still following up the chat

kevzak commented 5 months ago

I sent an email @1613557499 please confirm

1613557499 commented 5 months ago

I sent an email @1613557499 please confirm

Thank you for your reply. I just got the message that the recipient has replied,Could you approve my application?

kevzak commented 5 months ago

I sent an email @1613557499 please confirm

Thank you for your reply. I just got the message that the recipient has replied,Could you approve my application?

They did not provide any information that I'm asking for.

To approve this application we need to see: -review the dataset and size -confirm ownership (proof of employment, employer signoff, sharing the business license) - we can setup a call as I mentioned. -and validate storage of the data by the client/applicant is approved and a contract with the SP(s) is in place.

So far I have only been provided 3 SP IDs. None are setup for retrievals. Can you explain more about this and when it will be available? f01970622 HongKong f0832131 YangZhou f01146045 Singapore

kevzak commented 5 months ago

You can see all the pathway guidelines here. https://github.com/fidlabs/Open-Data-Pathway/blob/main/README.md

I'm not trying to make this hard on you, just need to follow the guidelines.

datacap-bot[bot] commented 5 months ago

KYC has been requested. Please complete KYC at https://kyc.allocator.tech/?owner=fidlabs&repo=Open-Data-Pathway&client=f1dvulxgyfvnv3fqwjpelfhictonbf3vytdgao4xy&issue=37

1613557499 commented 5 months ago
image

f03143698,HongKong f03143705,Shanghai f0870354,Singapore f01989372,HongKong

hello,

  1. I have tried my best to meet your requirements. I got a message that the materials have been sent to kevin@fidl.tech from the email address you confirmed before, please check it.
  2. in addition, the recent packaging has been reduced, our sp is also unstable, the above new home sp are communicated with the intention to cooperate, and continue to strive for cooperation with other sp,If my dc application is approved, it can be used immediately
  3. I see that there is a kyc about gitcoin which I have already operated Looking forward to your reply, thank you
1613557499 commented 5 months ago

KYC has been requested. Please complete KYC at https://kyc.allocator.tech/?owner=fidlabs&repo=Open-Data-Pathway&client=f1dvulxgyfvnv3fqwjpelfhictonbf3vytdgao4xy&issue=37

@kevzak Hi,My account only has 2.99 points, which is all I can get for all the apps I tried

kevzak commented 5 months ago

OK. Then the other available KYC option is:

  1. go to filplus.storage
  2. Log in with your github account (top right corner)
  3. Click on your Profile Icon (Next to a bell icon)
  4. Choose "Confirm Identity"
  5. Scroll to about middle of the page and follow an external link to togggle third party check.
  6. You will need a mobile phone, and an ID.
1613557499 commented 5 months ago

OK. Then the other available KYC option is:

  1. go to filplus.storage
  2. Log in with your github account (top right corner)
  3. Click on your Profile Icon (Next to a bell icon)
  4. Choose "Confirm Identity"
  5. Scroll to about middle of the page and follow an external link to togggle third party check.
  6. You will need a mobile phone, and an ID.
image

@kevzak Hi,I've gone through a very difficult process of verification

kevzak commented 5 months ago

OK great, thank you @1613557499 .

Please confirm SPs with retrievals that will be used for this dataset and we can move forward with a first allocation

1613557499 commented 5 months ago

f03143698,HongKong f03143705,Shanghai f0870354,Singapore f01989372,HongKong

@kevzak We have communicated with sps and will consider these first in the first round

kevzak commented 5 months ago

These SPs do not enable retrievals, the IDs are not available on Spark Dashboard: https://spacemeridian.grafana.net/public-dashboards/32c03ae0d89748e3b08e0f08121caa14?orgId=1

can you explain why?

1613557499 commented 4 months ago

These SPs do not enable retrievals, the IDs are not available on Spark Dashboard: https://spacemeridian.grafana.net/public-dashboards/32c03ae0d89748e3b08e0f08121caa14?orgId=1

can you explain why?

image

@kevzak I have been looking for sp to understand, their newly opened sp for encapsulation, hope to get the first round of support, let's look forward to the next process and results

kevzak commented 4 months ago

@1613557499 I can allocate 50TiBs as a first allocation. Let's ensure SPs match and retrievals are enabled. Thank you

datacap-bot[bot] commented 4 months ago

Datacap Request Trigger

Total DataCap requested

3PiB

Expected weekly DataCap usage rate

512TiB

DataCap Amount - First Tranche

50TiB

Client address

f1dvulxgyfvnv3fqwjpelfhictonbf3vytdgao4xy

datacap-bot[bot] commented 4 months ago

DataCap Allocation requested

Multisig Notary address

Client address

f1dvulxgyfvnv3fqwjpelfhictonbf3vytdgao4xy

DataCap allocation requested

50TiB

Id

6aa94f4b-8f3f-44b5-8a0b-884a50590695

datacap-bot[bot] commented 4 months ago

Application is ready to sign

datacap-bot[bot] commented 4 months ago

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzacedef3ibxya4uuboihshlovsaoodw5y7dovpjf65gtc6smceirggrc

Address

f1dvulxgyfvnv3fqwjpelfhictonbf3vytdgao4xy

Datacap Allocated

50TiB

Signer Address

f1v24knjbqv5p6qrmfjj5xmlaoddzqnon2oxkzkyq

Id

6aa94f4b-8f3f-44b5-8a0b-884a50590695

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacedef3ibxya4uuboihshlovsaoodw5y7dovpjf65gtc6smceirggrc

datacap-bot[bot] commented 4 months ago

Application is Granted

datacap-bot[bot] commented 4 months ago

Issue has been modified. Changes below:

(OLD vs NEW)

Total Requested Amount: 5PiB vs 3PiB State: ChangesRequested vs Granted

datacap-bot[bot] commented 4 months ago

Issue has been modified. Changes below:

(OLD vs NEW)

Total Requested Amount: 3PiB vs 5PiB

datacap-bot[bot] commented 4 months ago

Issue has been modified. Changes below:

(OLD vs NEW)

Total Requested Amount: 5PiB vs 3PiB

kevzak commented 4 months ago

Also can you please include documentation on how the data is transformed into deals for filecoin?

when a deal is sampled for verification, how will we be able to confirm that it is part of this dataset? (how is is chunked into car files?)

1613557499 commented 3 months ago

Also can you please include documentation on how the data is transformed into deals for filecoin?

when a deal is sampled for verification, how will we be able to confirm that it is part of this dataset? (how is is chunked into car files?)

I don't quite understand this, I asked sp, he is normal packaging

1613557499 commented 3 months ago

checker:manualTrigger

datacap-bot[bot] commented 3 months ago

DataCap and CID Checker Report[^1]

No active deals found for this client.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

1613557499 commented 3 months ago

No active deals found for this client.

Why is there no active trading yet? How do I start the next round of applications

kevzak commented 3 months ago

@1613557499

We have successfully seen datasets be prepared for filecoin, ranging from internet archive, to wikipedia, to the chain itself. You can either store the metadata structure above the individual file chunks as is done by e.g. web3 storage, or you can have a separate well-advertised metadata layer via e.g. a website.

We want to see how a client could be able to make use of this dataset. Can you share details?

this could be a client script for how to iterate through / process over the data this could be a web site allowing browsing / identification of specific pieces of data from the dataset as stored this could be identification of clients making use of the data

1613557499 commented 3 months ago

@1613557499

我们已经成功看到为 filecoin 准备的数据集,从互联网档案到维基百科,再到链本身。您可以像 web3 存储那样将元数据结构存储在各个文件块之上,也可以通过网站等方式拥有一个单独的、广为人知的元数据层。

我们想看看客户如何利用这个数据集。您能分享详细信息吗?

这可能是一个客户端脚本,用于描述如何迭代/处理数据; 这可能是一个网站,允许浏览/识别存储的数据集中的特定数据; 这可能是对使用数据的客户端的识别

We only have less than 50T in the package, so there's no customer search yet

kevzak commented 3 months ago

No active deals found for this client.

Why is there no active trading yet? How do I start the next round of applications

Hi @1613557499 https://datacapstats.io/clients?filter=f1dvulxgyfvnv3fqwjpelfhictonbf3vytdgao4xy shows the deal usage thus far.

1613557499 commented 3 months ago

checker:manualTrigger

datacap-bot[bot] commented 3 months ago

DataCap and CID Checker Report[^1]

No active deals found for this client.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

1613557499 commented 3 months ago

@kevzak there is a transaction record, why still can't open a new round of applications, I have already sealed more than 75%.

kevzak commented 3 months ago

@1613557499 this comment https://github.com/fidlabs/Open-Data-Pathway/issues/37#issuecomment-2271074326 is not asking for transaction record. We are asking how the client can retrieve the data. Please provide detail

1613557499 commented 3 months ago

@1613557499 this comment #37 (comment) is not asking for transaction record. We are asking how the client can retrieve the data. Please provide detail

@kevzak Thank you for your reply. The current version is that you don't need to care about transaction records, so manual triggering at work is normal, right? As for how the client can retrieve data, we can obtain the CID content through the command line, and then submit it to the front-end for query, but the process is a little complicated, we need time to study, the final result is that we can query the content on our official website

datacap-bot[bot] commented 3 months ago

Issue information change request has been approved.

datacap-bot[bot] commented 3 months ago

Client used 75% of the allocated DataCap. Consider allocating next tranche.

1613557499 commented 3 months ago

@kevzak I see changes in the information, but there is no hint of the next step, what do I need to do next

kevzak commented 3 months ago

@1613557499 normally after the client uses 75% of DataCap allocation, I complete my diligence checks.

1613557499 commented 3 months ago

@1613557499 normally after the client uses 75% of DataCap allocation, I complete my diligence checks.

@kevzak More than 75% of the data I configured has been encapsulated. I haven't seen any further instructions, and I haven't signed the quota for the next round. Do I still need to find a notary? Or how much longer do I have to wait?Or who I need to talk to to get to the next step

kevzak commented 3 months ago

@1613557499 can you confirm the SP miner IDs that received deals in this initial tranche?

1613557499 commented 3 months ago

f1dvulxgyfvnv3fqwjpelfhictonbf3vytdgao4xy @kevzak

3143698 This is the id of sp

datacap-bot[bot] commented 3 months ago

Application is in Refill

datacap-bot[bot] commented 3 months ago

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzacecjyowxmxaew2dwyk35u54bs3h2ndhxnncita2cvqo3eajirwz6bw

Address

f1dvulxgyfvnv3fqwjpelfhictonbf3vytdgao4xy

Datacap Allocated

50TiB

Signer Address

f1v24knjbqv5p6qrmfjj5xmlaoddzqnon2oxkzkyq

Id

2a87c0c6-30d5-4323-bb05-e3505c94c25c

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacecjyowxmxaew2dwyk35u54bs3h2ndhxnncita2cvqo3eajirwz6bw