fidlabs / Open-Data-Pathway

6 stars 8 forks source link

[DataCap Application] <顶思TopSchools> - <Educational Data> #78

Open Atonia12 opened 2 months ago

Atonia12 commented 2 months ago

Data Owner Name

sun

Data Owner Country/Region

China

Data Owner Industry

Education & Training

Website

http://www.topschools.cn/schooldatabase

Social Media Handle

http://www.topschools.cn/schooldatabase

Social Media Type

WeChat

What is your role related to the dataset

Dataset Owner

Total amount of DataCap being requested

100TiB

Expected size of single dataset (one copy)

32G

Number of replicas to store

4

Weekly allocation of DataCap requested

50TiB

On-chain address for first allocation

f1oe3xl5llyiu42t6ewyfhen3efoa7a7xwk5346uq

Data Type of Application

Public, Open Commercial/Enterprise

Custom multisig

Identifier

No response

Share a brief history of your project and organization

TopTalents focuses on providing precise recruitment of talents in K12 international education industry and consulting services on human resources and talent development of international schools, and builds HR learning community by regularly holding international school human resources summit, researching human resources related topics and releasing reports on international school human resources and salary system. Provide talent increment for the development of international schools, advocate the healthy flow of talents, and support the optimization of human resource management of schools.

Is this project associated with other projects/ecosystem stakeholders?

No

If answered yes, what are the other projects/ecosystem stakeholders

No response

Describe the data being stored onto Filecoin

Educational resources video online courses for schools in various countries and regions

Where was the data currently stored in this dataset sourced from

My Own Storage Infra

If you answered "Other" in the previous question, enter the details here

No response

If you are a data preparer. What is your location (Country/Region)

China

If you are a data preparer, how will the data be prepared? Please include tooling used and technical details?

Our current data is stored on our server in the computer room
We send data to SP according to their needs, including sending hard drives, network transmission

If you are not preparing the data, who will prepare the data? (Provide name and business)

No response

Has this dataset been stored on the Filecoin network before? If so, please explain and make the case why you would like to store this dataset again to the network. Provide details on preparation and/or SP distribution.

No data is stored on the filecoin

Please share a sample of the data

链接: https://pan.baidu.com/s/1z8A7c9FKWIbnCe1pI5oCIg?pwd=6scf 提取码: 6scf 
链接: https://pan.baidu.com/s/1-9u07uutFct_kLK_RPT5xw?pwd=d6r9 提取码: d6r9 
链接: https://pan.baidu.com/s/16KIg3i5ZEX9HRo93Ln4Zhw?pwd=cjms 提取码: cjms

Confirm that this is a public dataset that can be retrieved by anyone on the Network

If you chose not to confirm, what was the reason

We self-check the data of customers visiting our website

What is the expected retrieval frequency for this data

Yearly

For how long do you plan to keep this dataset stored on Filecoin

Permanently

In which geographies do you plan on making storage deals

Greater China, North America, Australia (continent)

How will you be distributing your data to storage providers

Cloud storage (i.e. S3), HTTP or FTP server, IPFS, Shipping hard drives, Lotus built-in data transfer, Venus built-in data transfer

How did you find your storage providers

Slack, Filmine, Big Data Exchange, Partners

If you answered "Others" in the previous question, what is the tool or platform you used

No response

Please list the provider IDs and location of the storage providers you will be working with.

f03143698,HongKong
f03143705,Shanghai
f0870354,Singapore
f01989372,HongKong

How do you plan to make deals to your storage providers

Boost client, Lotus client, Droplet client, Bidbot, Singularity

If you answered "Others/custom tool" in the previous question, enter the details here

No response

Can you confirm that you will follow the Fil+ guideline

Yes

datacap-bot[bot] commented 2 months ago

Application is waiting for allocator review

martapiekarska commented 2 months ago

Hi! Thank you for your application. This looks identical to https://github.com/fidlabs/Open-Data-Pathway/issues/37 - how is your application different? Thanks!

Atonia12 commented 2 months ago

Hi! Thank you for your application. This looks identical to #37 - how is your application different? Thanks!

Thank you for your reply. We are the participants of filecoin ecosystem. After learning from experienced clients and distributors, we have learned how to use it properly. We want to redo this client, store the data on Filecoin, and work this data first with an experienced SP with a good track record.

Atonia12 commented 2 months ago

Hi! Thank you for your application. This looks identical to #37 - how is your application different? Thanks!

@martapiekarska We complete data storage through small data sets, trust our sincerity

martplo commented 1 month ago

@Atonia12 First, you apply to the Open-Data Pathway. As the name suggests, this data should be publicly available to everyone, but you indicated in the form that it is a private dataset. The SPs you listed probably do not support Spark. Their retrieval is unavailable. The data samples you provided are inconsistent; one link does not work, and one contains content other than declared.

Atonia12 commented 1 month ago

@Atonia12 First, you apply to the Open-Data Pathway. As the name suggests, this data should be publicly available to everyone, but you indicated in the form that it is a private dataset. The SPs you listed probably do not support Spark. Their retrieval is unavailable. The data samples you provided are inconsistent; one link does not work, and one contains content other than declared.

I have changed private to public, and the sp given above is their new id. According to the information I have learned, they used the latest DDO package before, resulting in no retrieval rate, and novices need time to adapt. I believe the ecology is getting better and better

f03179555、f03179570、f03178144

martplo commented 1 month ago

@Atonia12 Those SPs look good. Try to find another one so you use at least 4 SPs. okay, I'll give you 50 TiB and check your progress. Please follow the rules.

datacap-bot[bot] commented 1 month ago

KYC has been requested. Please complete KYC at https://kyc.allocator.tech/?owner=fidlabs&repo=Open-Data-Pathway&client=f1oe3xl5llyiu42t6ewyfhen3efoa7a7xwk5346uq&issue=78

martplo commented 1 month ago

@Atonia12 I still want to clarify the data issue. I already mentioned that it is not fully compliant (see my comment on this topic), but I have a few additional concerns.

As you mentioned, this is Open-Data-Pathway, which means that this data should be available for download by anyone, for example from aws or another storage service.

What you shared looks like files selected by you from local storage, which is what you write here:

Our current data is stored on our server in the computer room We send data to SP according to their needs, including sending hard drives, network transmission

Before we start working together, I want you to correct your data to be compliant with the rules.

By the way, please read our policy https://github.com/fidlabs/Open-Data-Pathway/wiki/Policies

Atonia12 commented 1 month ago

已请求 KYC。请在https://kyc.allocator.tech/?owner=fidlabs&repo=Open-Data-Pathway&client=f1oe3xl5llyiu42t6ewyfhen3efoa7a7xwk5346uq&issue=78完成 KYC

I have some data is there, but not given validation I am not sure what is going on ![Uploading 10-5.png…]()

root@master:/data1# du -sh * 33T 8_24post_105.163_32T ![Uploading 10-5-2.png…]()

I can't seem to upload a photo. What's wrong?

martplo commented 1 month ago

@Atonia12 Maybe the images didn't upload before you hit the comment button. Try adding the images again and wait for them to appear in the comment preview.

Atonia12 commented 1 month ago

@Atonia12 Maybe the images didn't upload before you hit the comment button. Try adding the images again and wait for them to appear in the comment preview.

10-5-2 10-5 I switched to a different browser and it worked,

martplo commented 1 month ago

@Atonia12, you need to have a score of at least 20 to pass the KYC check. Please, try again.

Atonia12 commented 1 month ago

https://kyc.allocator.tech/?owner=fidlabs&repo=Open-Data-Pathway&client=f1oe3xl5llyiu42t6ewyfhen3efoa7a7xwk5346uq&issue=78完成 KYC

image image

Hi,My account only has 3 points, which is all I can get for all the apps I tried,My app has an account but it doesn't score,Is there any other way to verify it?

martplo commented 1 month ago

@Atonia12 Gitcoin passport is our KYC method. If you don't know how to go through the steps, try referring to the guides in the documentation Without verifying your identity using the official method, we won't be able to proceed. Sorry.

Atonia12 commented 1 month ago

@Atonia12 Gitcoin passport is our KYC method. If you don't know how to go through the steps, try referring to the guides in the documentation Without verifying your identity using the official method, we won't be able to proceed. Sorry.

image image

@martapiekarska I have babt, and he's not connecting, so I think my score is a perfect 20

martplo commented 1 month ago

@Atonia12 , we do not accept any other KYC than the one set in our rules. I already mentioned that if you cannot reach a score of 20 with a gitcoin passport, we cannot cooperate. I suggest you try to follow tutorial from the documentation of how to add BABT into gitcoin passport, but if no luck I'll close this application. Sorry.

Atonia12 commented 1 month ago

@Atonia12 , we do not accept any other KYC than the one set in our rules. I already mentioned that if you cannot reach a score of 20 with a gitcoin passport, we cannot cooperate. I suggest you try to follow tutorial from the documentation of how to add BABT into gitcoin passport, but if no luck I'll close this application. Sorry.

OK,I keep trying.

Atonia12 commented 1 month ago

@Atonia12 , we do not accept any other KYC than the one set in our rules. I already mentioned that if you cannot reach a score of 20 with a gitcoin passport, we cannot cooperate. I suggest you try to follow tutorial from the documentation of how to add BABT into gitcoin passport, but if no luck I'll close this application. Sorry.

OK,I keep trying.

I don't know why I have to have this point. Doesn't filecoin have its own verification method? This is the point of my confusion. Anyway, now that I've reached the minimum score, can you help me move forward?Thanks!

image
martplo commented 1 month ago

@Atonia12 I don't fully understand your doubts. I hope that when applying to us you read our rules for accepting clients. We clearly wrote that we perform KYC using kyc.allocator.tech.

I am glad that you managed to achieve a score above 20 with gitcoin passport. Please complete the process by connecting your wallet to the application. You can do this at the link in this comment: https://github.com/fidlabs/Open-Data-Pathway/issues/78#issuecomment-2383002729

Atonia12 commented 1 month ago

https://kyc.allocator.tech/?owner=fidlabs&repo=Open-Data-Pathway&client=f1oe3xl5llyiu42t6ewyfhen3efoa7a7xwk5346uq&issue=78

What I mean is that it is not reasonable for us to spend one month on verification. The certification platform has nothing to do with filecoin, so it can't help filecoin achieve any merit. This is a waste of time. We can join filecoin's warehouse on github in accordance with the previous certification, which will help filecoin's ecological development grow with data and make the public have greater confidence in filecoin

image image

Why are the same wallet scores different?

martplo commented 1 month ago

You tell me why this is different. If you connected the same wallet, it shouldn't be a difference. Are you using the same wallet as the one connected to the gitcoin? I won't accept your KYC based on the score presented with a screenshot.

The previous KYC method is no longer available. A Gitcoin passport is now the official KYC method. Also, if you were previously accepted but now use a different wallet, you still need to pass the KYC.

Atonia12 commented 1 month ago

You tell me why this is different. If you connected the same wallet, it shouldn't be a difference. Are you using the same wallet as the one connected to the gitcoin? I won't accept your KYC based on the score presented with a screenshot.

The previous KYC method is no longer available. A Gitcoin passport is now the official KYC method. Also, if you were previously accepted but now use a different wallet, you still need to pass the KYC.

I changed a different browser to try, the same wallet address to get the score is inconsistent, I do not know what the reason, can you help me to see?I can provide my wallet address information.

My address:0xE24e77660f46968042d04ed3B154479d93183BF1

image image image

These signs all show that my address belongs to me, and my wallet scored 20 points in passport, but it was less than 20 points in kyc. I only applied for 100T, and all my information is true and valid. I think this difference is a bug in kyc process, and I am willing to accept any investigation

martplo commented 1 month ago

@Atonia12, please go to https://app.passport.xyz/#/dashboard. At the bottom, there is a button titled "bring passport onchain." Please, click it. That should do the work. Then, go back to the KYC.allocator.tech if it won't finish automatically.

Atonia12 commented 1 month ago

@Atonia12,请访问https://app.passport.xyz/#/dashboard。底部有一个名为“将护照带上链”的按钮。请点击它。这应该可以完成工作。然后,如果它不会自动完成,请返回 KYC.allocator.tech。

image

@martapiekarska Finally, it can be achieved, and it takes a sum of gas to achieve it. I hope my 100T can proceed smoothly. Thank you for your careful guidance all the time

martplo commented 1 month ago

@Atonia12 Thank you for your efforts. Did you have a chance to take a look at this comment? https://github.com/fidlabs/Open-Data-Pathway/issues/78#issuecomment-2385492042

martplo commented 3 weeks ago

@Atonia12 any news?

Atonia12 commented 3 weeks ago

@Atonia12 Thank you for your efforts. Did you have a chance to take a look at this comment? #78 (comment)

Http://107.174.172.102:/DC_data.tar.gz There are too many things to deal with recently, I didn't notice your reply. Sorry, this is our data I put on a cloud, I would like to ask if 100T also needs to be distributed according to 5 rounds?

willscott commented 3 weeks ago

Hi,

This is a link to a 1.5GB tar.gz file. This is neither the format that you would be using to store data on filecoin, nor a link to the overall dataset as far as I can tell.

Atonia12 commented 3 weeks ago

Hi,

This is a link to a 1.5GB tar.gz file. This is neither the format that you would be using to store data on filecoin, nor a link to the overall dataset as far as I can tell.

Have you opened the zip file to see what's inside? Each company has different storage methods, or if you give me detailed steps, the first step and the second step should be done. I have applied for 100T for several months, how can you promote the development of the project with such a centralized process, and who else can use filecoin? You should study how to easily and quickly use the retrieval.

willscott commented 3 weeks ago

You have applied to a 'public open data pathway' that is meant for storing existing public online data.

when asked what and how the data will be stored, you provided a compressed link without explanation.

This is not sufficient for us as the allocator to justify that we are giving out datacap fulfilling the purpose of this pathway.

Atonia12 commented 2 weeks ago

Hi,

This is a link to a 1.5GB tar.gz file. This is neither the format that you would be using to store data on filecoin, nor a link to the overall dataset as far as I can tell.

The format stored on file needs sp to deal with it. I just need to transfer the data to him. What do you mean by the data pointing to the data set? The customer group of my company is students, this is the data we use daily, I just randomly selected part of the data

martplo commented 6 days ago

@Atonia12 Sorry for not getting back to you sooner.

The source dataset must already be online - the allocator must be able to browse to and observe any part of that dataset at their discretion. The dataset must be accessible for any web viewer without gating on credentials.

We strive to make the data you want to store in Filecoin publicly available. This means that everyone should have access to it, be able to look through it all, choose a set, and download it. We would like to be able to choose this random set ourselves and check that the amount of data you declare is consistent with the actual state.

When a piece is downloaded from the stored dataset, there must be a way to identify what part of the original dataset that piece represents and confirm that it is indeed valid data from the dataset. (this could be e.g. provided via a log file indicating the mapping of offsets / files into stored deals)

This is what "data pointing to the dataset" refers to.

aliff1322 commented 16 hours ago

hi