filecoin-project / filecoin-plus-large-datasets

Hub for client applications for DataCap at a large scale
110 stars 62 forks source link

[DataCap Application] <CAO> - <Sentinel-1 SLC dataset-02> #2096

Closed caoxinhe0108 closed 1 year ago

caoxinhe0108 commented 1 year ago

Data Owner Name

LiveEO

What is your role related to the dataset

Dataset Owner

Data Owner Country/Region

Afghanistan

Data Owner Industry

Life Science / Healthcare

Website

https://www.live-eo.com/

Social Media

Twitter---https://twitter.com/liveeo_space
Linkedin---https://www.linkedin.com/company/liveeo/

Total amount of DataCap being requested

10PiB

Expected size of single dataset (one copy)

64G

Number of replicas to store

10

Weekly allocation of DataCap requested

1PiB

On-chain address for first allocation

f1ynv3f5ne5k7xxr7z5q7coro7rqpls3ilhuz3ndy

Data Type of Application

Public, Open Dataset (Research/Non-Profit)

Custom multisig

Identifier

No response

Share a brief history of your project and organization

LiveEO was born from the idea that satellite data, a vast and almost untapped resource, has enormous potential to transform our lives, and the world around us, for the better. Sven and Daniel, two NewSpace enthusiasts with a keen interest in A.I., founded LiveEO in 2018 with a simple vision: to apply this incredible trove of data about our planet to improve the way we do business, and live our lives.

Is this project associated with other projects/ecosystem stakeholders?

Yes

If answered yes, what are the other projects/ecosystem stakeholders

No response

Describe the data being stored onto Filecoin

LiveEO was born from the idea that satellite data, a vast and almost untapped resource, has enormous potential to transform our lives, and the world around us, for the better. Sven and Daniel, two NewSpace enthusiasts with a keen interest in A.I., founded LiveEO in 2018 with a simple vision: to apply this incredible trove of data about our planet to improve the way we do business, and live our lives.

Where was the data currently stored in this dataset sourced from

AWS Cloud

If you answered "Other" in the previous question, enter the details here

No response

How do you plan to prepare the dataset

No response

If you answered "other/custom tool" in the previous question, enter the details here

No response

Please share a sample of the data

aws s3 ls --no-sign-request s3://sentinel1-slc/ Total Objects: 5961934  
1.1 PiB

Confirm that this is a public dataset that can be retrieved by anyone on the Network

If you chose not to confirm, what was the reason

No response

What is the expected retrieval frequency for this data

Yearly

For how long do you plan to keep this dataset stored on Filecoin

2 to 3 years

In which geographies do you plan on making storage deals

Greater China, Asia other than Greater China, North America, South America

How will you be distributing your data to storage providers

Cloud storage (i.e. S3)

How do you plan to choose storage providers

Slack, Big Data Exchange

If you answered "Others" in the previous question, what is the tool or platform you plan to use

No response

If you already have a list of storage providers to work with, fill out their names and provider IDs below

No response

How do you plan to make deals to your storage providers

No response

If you answered "Others/custom tool" in the previous question, enter the details here

No response

Can you confirm that you will follow the Fil+ guideline

Yes

large-datacap-requests[bot] commented 1 year ago

Thanks for your request! Everything looks good. :ok_hand:

A Governance Team member will review the information provided and contact you back pretty soon.

caoxinhe0108 commented 1 year ago

2093 Robot not triggered, resubmit

Although this project already has many applications, there are almost no fully stored applications, and many of them have low retrieval rates. Therefore, I want to comprehensively store this project and provide a good retrieval experience to contribute to FIL+. You can check all the LDNs and health levels I have previously applied for. Thank you.

herrehesse commented 1 year ago

@caoxinhe0108, kindly ensure that you close your previous application before opening a duplicate. The argument you presented lacks substance as we are currently engaged in facilitating the HTTP retrieval for SPs and clients. It is not valid to claim that all other stored copies are flawed and therefore require a 10x repetition from yourself.

Your previous allocations were flawed and primarily stored within the same region. In regards to your commitment about the geographies in which you plan to make storage deals, they include Greater China, other parts of Asia excluding Greater China, North America, and South America.

I request that all notaries refrain from signing at this time.

Sunnyiscoming commented 1 year ago

Best practice for storing large datasets includes ideally, storing it in 3 or more regions, with 4 or more storage provider operators or owners. You should list Miner ID, Business Entity, Location of sps you will cooperate with.

caoxinhe0108 commented 1 year ago

f02050599 Bang Sai, Phra Nakhon Si Ayutthaya, TH f02063202 Mueang Nonthaburi, Nonthaburi, TH f02064089 Singapore, Singapore, SG f02045964 Sham Shui Po, Sham Shui Po, HK f02058557 Kuala Lumpur, Kuala Lumpur, MY f02055638 Kuala Lumpur, Kuala Lumpur, MY f02048990 Hong Kong, Central and Western, HK f02041085 Tokyo, Tokyo, JP f02056391 Hong Kong, Central and Western, HK f02053449 Hong Kong, Central and Western, HK f02044834 Singapore, Singapore, SG f02046736 Curug, Banten, ID f02059055 Seoul, Seoul, KR f02029895 Seoul, Seoul, KR f01170282 Hong Kong, Central and Western, HK f02030031 Singapore, Singapore, SG f02042992 Singapore, Singapore, SG

This is the SP we are currently in contact with and the detailed information. Thank you for your review

Best practice for storing large datasets includes ideally, storing it in 3 or more regions, with 4 or more storage provider operators or owners. You should list Miner ID, Business Entity, Location of sps you will cooperate with.

caoxinhe0108 commented 1 year ago

@caoxinhe0108, kindly ensure that you close your previous application before opening a duplicate. The argument you presented lacks substance as we are currently engaged in facilitating the HTTP retrieval for SPs and clients. It is not valid to claim that all other stored copies are flawed and therefore require a 10x repetition from yourself.

Your previous allocations were flawed and primarily stored within the same region. In regards to your commitment about the geographies in which you plan to make storage deals, they include Greater China, other parts of Asia excluding Greater China, North America, and South America.

I request that all notaries refrain from signing at this time.

Hello, I have checked the historical application records before proceeding with this operation. You can also view the historical application records, as well as my application records and packaging reports. I am also contacting SP in North America, and North America will also store a backup. Thank you for your question

herrehesse commented 1 year ago

f02050599 Bang Sai, Phra Nakhon Si Ayutthaya, TH f02063202 Mueang Nonthaburi, Nonthaburi, TH f02064089 Singapore, Singapore, SG f02045964 Sham Shui Po, Sham Shui Po, HK f02058557 Kuala Lumpur, Kuala Lumpur, MY f02055638 Kuala Lumpur, Kuala Lumpur, MY f02048990 Hong Kong, Central and Western, HK f02041085 Tokyo, Tokyo, JP f02056391 Hong Kong, Central and Western, HK f02053449 Hong Kong, Central and Western, HK f02044834 Singapore, Singapore, SG f02046736 Curug, Banten, ID f02059055 Seoul, Seoul, KR f02029895 Seoul, Seoul, KR f01170282 Hong Kong, Central and Western, HK f02030031 Singapore, Singapore, SG f02042992 Singapore, Singapore, SG

This is the SP we are currently in contact with and the detailed information. Thank you for your review

Best practice for storing large datasets includes ideally, storing it in 3 or more regions, with 4 or more storage provider operators or owners. You should list Miner ID, Business Entity, Location of sps you will cooperate with.

What are their respective SP names?

caoxinhe0108 commented 1 year ago

Due to various reasons, not all SPs are willing to disclose their information, so I am not entirely aware of it. However, if you can retrieve the information of these nodes, I will also check their information. If they do not meet our standards, we will not cooperate.

herrehesse commented 1 year ago

@caoxinhe0108 It is not your standards that matter, it's the community standards that matter, including rules and guidelines from the Filecoin+ ecosystem.

caoxinhe0108 commented 1 year ago

Yes, I have always followed the community standards, thank you, let us maintain together, so that FIL+ can move forward with inspiration

Sunnyiscoming commented 1 year ago
caoxinhe0108 commented 1 year ago
  • Have you prepared enough token for sector pledge?
  • Are you a data preparer? What is your previous experience as a data-preparer? List previous applications and client IDs
  • How will the data be prepared? Please include tooling used and technical details
  • If you are not preparing the data, who will prepare the data? (Name and Business)
  • Has this dataset been stored on Filecoin before? If so, why are you choosing to store it again?

Hello 1: We have prepared approximately 50% of the tokens and are still seeking other collaborations 2: We have a collaborative data preparation team, and you can check the #1652 I previously applied for 3: We will download the data from AWS first, Using the Singarate tool for packaging, it will automatically package the source data into a car file that does not exceed 64GB The order frequency of Singarature will be controlled based on the packaging speed of each SP. In addition, all SPs need to use boost to import car files and turn on the retrieval function 4: I have checked the previously stored LDNs, and most of them have only been stored a little bit. Some have not even started yet. You can check the LDNs I previously applied for. Most of the previous ones have been completed, so I want to complete their storage Thank you for your review

herrehesse commented 1 year ago

@caoxinhe0108,

I can confirm that there is at least one complete copy, as we collaborated on one of the previous efforts.

Your assertion that you have thoroughly checked all LDNs and are certain that only a small portion is stored is a falsehood. You do not possess the preparer's index files, which are necessary for verifying the integrity of a fully stored dataset. Why are you engaging in such deceptive behavior?

As a community, we must never tolerate or condone this kind of behaviour on our chain. It is crucial that we maintain the integrity and trustworthiness of our ecosystem.

caoxinhe0108 commented 1 year ago

@caoxinhe0108,

I can confirm that there is at least one complete copy, as we collaborated on one of the previous efforts.

Your assertion that you have thoroughly checked all LDNs and are certain that only a small portion is stored is a falsehood. You do not possess the preparer's index files, which are necessary for verifying the integrity of a fully stored dataset. Why are you engaging in such deceptive behavior?

As a community, we must never tolerate or condone this kind of behaviour on our chain. It is crucial that we maintain the integrity and trustworthiness of our ecosystem.

If you have a relatively complete inventory, please list your LDN and what is the LDN-ID you are collaborating with I said I've read most of it, what did I deceive you about? Why do you often verbally attack others and feel like you are right all day?Please pay attention to your wording! You can check out the comments from the rest of the community about you! Are these really other people's problems?

caoxinhe0108 commented 1 year ago
  • Have you prepared enough token for sector pledge?
  • Are you a data preparer? What is your previous experience as a data-preparer? List previous applications and client IDs
  • How will the data be prepared? Please include tooling used and technical details
  • If you are not preparing the data, who will prepare the data? (Name and Business)
  • Has this dataset been stored on Filecoin before? If so, why are you choosing to store it again?

Hello 1: We have prepared approximately 50% of the tokens and are still seeking other collaborations 2: We have a collaborative data preparation team, and you can check the #1652 I previously applied for 3: We will download the data from AWS first, Using the Singarate tool for packaging, it will automatically package the source data into a car file that does not exceed 64GB The order frequency of Singarature will be controlled based on the packaging speed of each SP. In addition, all SPs need to use boost to import car files and turn on the retrieval function 4: I have checked the previously stored LDNs, and most of them have only been stored a little bit. Some have not even started yet. You can check the LDNs I previously applied for. Most of the previous ones have been completed, so I want to complete their storage Thank you for your review

@Sunnyiscoming Please check my answer to see if there are any further questions?

herrehesse commented 1 year ago

You assert that you have thoroughly researched all copies and can confidently affirm that there are no complete versions available. However, you expect me to provide evidence to support my claims.

The community strictly prohibits a multitude of copies and asks notaries to abstain from involvement.

ghost commented 1 year ago

Hello @caoxinhe0108 per the new guidelines https://github.com/filecoin-project/notary-governance/issues/922 for Open Dataset applicants, please complete the following Fil+ registration form to identify yourself as the applicant and also please add the contact information of the SP entities you are working with to store copies of the data.

This information will be reviewed by Fil+ Governance team to confirm validity and then the application will be triggered for notary review. Let us know if you have any questions.

caoxinhe0108 commented 1 year ago

You assert that you have thoroughly researched all copies and can confidently affirm that there are no complete versions available. However, you expect me to provide evidence to support my claims.

The community strictly prohibits a multitude of copies and asks notaries to abstain from involvement.

Haven't you noticed any inconsistencies in yourself? You said that you collaborated to store this related data, Now that you can't show it yourself, isn't this fraudulent behavior? To prove that you are not fraudulent, please list the LDNs you have collaborated with and show them to us

herrehesse commented 1 year ago

@caoxinhe0108, I wish you the best of luck attempting to deceive me with gaslighting. However, let's clarify the situation moving forward:

To justify the need for additional copies, provide a clear explanation or present substantial evidence of the failures of previous storage attempts. Remember, 10PiB is a significant amount and should not be taken lightly by anyone.

Thanks for being transparent, this is the only way on getting into the quality phase of Filecoin+.

caoxinhe0108 commented 1 year ago

@herrehesse Why can't I apply for 10P for this public data set? You can all apply for 120P LDN. Why can't you allow others to apply for 10P? For almost all applications, you are blocking. I hope you can stop this behavior that harms the community. Finally, I hope you can answer my above questions.

herrehesse commented 1 year ago

@caoxinhe0108 Why aren't you answering any of my questions here?

cryptowhizzard commented 1 year ago

Hello @caoxinhe0108 per the new guidelines filecoin-project/notary-governance#922 for Open Dataset applicants, please complete the following Fil+ registration form to identify yourself as the applicant and also please add the contact information of the SP entities you are working with to store copies of the data.

This information will be reviewed by Fil+ Governance team to confirm validity and then the application will be triggered for notary review. Let us know if you have any questions.

Let's focus back on this please.

caoxinhe0108 commented 1 year ago

@caoxinhe0108 Why aren't you answering any of my questions here?

That's what I said to you. Please take a look at the record above. Didn't I answer your question? It is you who have been deceiving and preventing others from obtaining the quota, which fully demonstrates that you are indiscriminately accusing and affecting the entire community. Isn't anyone in charge of this kind of thing? @raghavrmadya @galen-mcandrew

cryptowhizzard commented 1 year ago

@caoxinhe0108

New issues need to go through PL for verification.

Per the filecoin-project/notary-governance#922 for Open, Public Dataset applicants, please complete the following Fil+ registration form to identify yourself as the applicant and also please add the contact information of the SP entities you are working with to store copies of the data. This information will be reviewed by Fil+ Governance team to confirm validity and then the application will be approved for notary review. Let us know if you have any questions.

@Filplus-govteam

Stanton15 commented 1 year ago

Ignore these two liars, they're just trying to sell DC to Chinese SPs.

Sunnyiscoming commented 1 year ago

Have you submitted the Fil+ registration form?

Sunnyiscoming commented 1 year ago

Close for no reply.