filecoin-project / filecoin-plus-large-datasets

Hub for client applications for DataCap at a large scale
110 stars 62 forks source link

[DataCap Application] StoneMoonGalaxy - <USGS-Environmental Monitoring Dataset> [1/6] #1755

Closed StoneMoonGalaxy closed 10 months ago

StoneMoonGalaxy commented 1 year ago

Data Owner Name

USGS

Data Owner Country/Region

United States

Data Owner Industry

Environment

Website

https://earthexplorer.usgs.gov/ ; https://glovis.usgs.gov/

Social Media

https://twitter.com/StoneMoonGalaxy

Total amount of DataCap being requested

5PiB

Weekly allocation of DataCap requested

450TiB

On-chain address for first allocation

f1zz7tilbyzlrrzcgzrac4yuyv6ykxgdc5tfll5ei

Custom multisig

Identifier

none

Share a brief history of your project and organization

Stone Moon is a platform that provides storage solutions. Focus on Filecoin storage and FVM smart contract application development.
Since relaunching the project in Australia, we have integrated a large number of storage providers and infrastructure service providers in order to help build the Filecoin eco-network and promote the storage and retrieval of real data, as well as to meet the hot market demand and bring more decentralized storage enthusiasts into the network, for which we will work tirelessly to find new and useful data to bring into the network. To this end, we will work tirelessly to find new and useful data and bring it to the network.

Is this project associated with other projects/ecosystem stakeholders?

No

If answered yes, what are the other projects/ecosystem stakeholders

none

Describe the data being stored onto Filecoin

The dataset is from a collection of public data provided by the U.S. Geological Survey (USGS), including geological, environmental, and especially natural disaster data. Each collection contains a different type of data. Include:

Landsat: This dataset includes high-resolution satellite imagery of the Earth's surface, collected by the Landsat program. It covers over 40 years of imagery, and is useful for studying changes in land use, natural disasters, and more.

3D Elevation Program (3DEP): This dataset provides high-resolution elevation data for the entire United States, including both bare earth and surface features such as vegetation and buildings.

Sentinel-2 series:The USGS manages and distributes data from the Sentinel-2 mission through its EarthExplorer and GloVis portals, providing free access to images for researchers, scientists and the public. It is also incorporated into the Landsat Archives of the United States Geological Survey, and both data sets are used for a variety of applications, such as land use and land cover mapping, environmental monitoring, and natural resource management.

Where was the data currently stored in this dataset sourced from

AWS Cloud

If you answered "Other" in the previous question, enter the details here

No response

How do you plan to prepare the dataset

others/custom tool

If you answered "other/custom tool" in the previous question, enter the details here

Self-developed packing tool

Please share a sample of the data

Landsat:
s3://usgs-landsat/collection02/
s3://deafrica-landsat/
s3://astrogeo-ard/mars/mo/themis/controlled_mosaics/
s3://astrogeo-ard/jupiter/europa/galileo_voyager/usgs_controlled_mosaics/
s3://astrogeo-ard/jupiter/europa/galileo_voyager/usgs_controlled_observations/
s3://usgs-lidar-uscities/
USGS 3DEP
s3://usgs-lidar-public/
Sentinel-2
s3://sentinel-s2-l1c/
s3://sentinel-cogs/
s3://esa-worldcover/
s3://deafrica-services/gm_s2_annual/
s3://deafrica-sentinel-2/
s3://deafrica-services/crop_mask/
s3://bdc-sentinel-2/
s3://io-10m-annual-lulc/
s3://sentinel-s2-l2a-mosaic-120/
s3://radiant-mlhub/c2smsfloods/
s3://sentinel-products-ca-mirror/

Confirm that this is a public dataset that can be retrieved by anyone on the Network

If you chose not to confirm, what was the reason

none

What is the expected retrieval frequency for this data

Sporadic

For how long do you plan to keep this dataset stored on Filecoin

1.5 to 2 years

In which geographies do you plan on making storage deals

Greater China, Asia other than Greater China, North America, Europe, Australia (continent)

How will you be distributing your data to storage providers

Cloud storage (i.e. S3), HTTP or FTP server, Shipping hard drives, Others

How do you plan to choose storage providers

Slack, Partners, Others

If you answered "Others" in the previous question, what is the tool or platform you plan to use

none

If you already have a list of storage providers to work with, fill out their names and provider IDs below

| MinerID | org | region |
| --- | --- | --- |
| f02058976 | Open Gate | AU |
| f0442671 | Open Gate | AU |
| f0515461 | Friday engine inc. | US |
| f0818235 | xu xiong | CN |
| f0443184 | WITMIND SYSTEM TECHNOLOGY LIMITED | HK |
| f02003553 | WITMIND SYSTEM TECHNOLOGY LIMITED | HK |

How do you plan to make deals to your storage providers

Lotus client

If you answered "Others/custom tool" in the previous question, enter the details here

none

Can you confirm that you will follow the Fil+ guideline

Yes

DirectionTechnology commented 1 year ago

According to the applicant's past records, the current allocation is normal and reasonable. Support This batch.

DirectionTechnology commented 1 year ago

Request Proposed

Your Datacap Allocation Request has been proposed by the Notary

Message sent to Filecoin Network

bafy2bzacea3hhqsb3utxb43ab25zpgi7gake4rnwgjzddmohtsvva3ansb4fw

Address

f1zz7tilbyzlrrzcgzrac4yuyv6ykxgdc5tfll5ei

Datacap Allocated

1.75PiB

Signer Address

f1inkdoatsbfumdvpctxbgcatscewr3rus5pxmsgi

Id

d1b4e5ff-54a7-45fd-a58f-c6a3eeb10e0e

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacea3hhqsb3utxb43ab25zpgi7gake4rnwgjzddmohtsvva3ansb4fw

METAVERSEDATAMINING commented 1 year ago

I will continue to monitor the next storage round based on the applicant's explanation. The retrieval feedback is good, so I support this round.

doi
METAVERSEDATAMINING commented 1 year ago

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzacecql76dgefzhsn7vcit23qpp7jt6x7acwrtvuuoratxd2jy377kng

Address

f1zz7tilbyzlrrzcgzrac4yuyv6ykxgdc5tfll5ei

Datacap Allocated

1.75PiB

Signer Address

f17idrnfnxl2mbgcgr57a6z2c6lj2qx56gvm3336i

Id

d1b4e5ff-54a7-45fd-a58f-c6a3eeb10e0e

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacecql76dgefzhsn7vcit23qpp7jt6x7acwrtvuuoratxd2jy377kng

StoneMoonGalaxy commented 1 year ago

Preparing data...

StoneMoonGalaxy commented 1 year ago

Preparing data...

herrehesse commented 1 year ago

checker:manualTrigger

filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report Summary[^1]

Retrieval Statistics

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

Deal Data Replication

✔️ Data replication looks healthy.

Deal Data Shared with other Clients[^3]

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the CID Checker report. Click here to view the Retrieval report.

kevzak commented 1 year ago

Hi @StoneMoonGalaxy you mentioned these were the SP entities you'd be storing with above.

MinerID org region
f02058976 Open Gate AU
f0442671 Open Gate AU
f0515461 Friday engine inc. US
f0818235 xu xiong CN
f0443184 WITMIND SYSTEM TECHNOLOGY LIMITED HK
f02003553 WITMIND SYSTEM TECHNOLOGY LIMITED HK

From CID report, I see one that matches from the initial list. f0442671 and it shows in HK, not AU.

Can you explain?

Can you explain other SP businesses you are are using and if they are using VPN?

kevzak commented 1 year ago

Also we have a new, optional KYC ID check available on filplus.storage. If you want to add additional layer of trust you can complete an ID check and verify this GitHub account. See details: LINK

StoneMoonGalaxy commented 1 year ago

Hello, @kevzak based on the issue you raised, we have conducted a retrospective investigation and contacted the previous SPs to verify the situation one by one. The problem has been identified. Initially, when filling out the application, we recorded the complete information of the SP, including the the miner ID, miner organization, organization region, and IP region as shown in the table below:

MinerID org org region ip region
f02058976 Open Gate AU AU
f0442671 Open Gate AU HK
f0515461 Friday engine inc. US US
f0818235 xu xiong CN CN
f0443184 WITMIND SYSTEM TECHNOLOGY LIMITED HK HK
f02003553 WITMIND SYSTEM TECHNOLOGY LIMITED HK HK

However, a deviation occurred during the final submission, which was an unintentional miswriting .

StoneMoonGalaxy commented 1 year ago

Currently, we are still in the data preparation stage and are also in continuous contact with the next batch of potential partner SPs.

herrehesse commented 1 year ago

@StoneMoonGalaxy, it is evident from our analysis that you are engaged in self-dealing by exclusively providing data to your own miners through VPN, falsely creating the appearance of global distribution. Deal amounts are perfectly similar and minerID's are following up. The chance that these are separate entities is zero.

Additionally, the merging of datasets violates our guidelines.

sp client totaldeals ipadres location vpnresults percent
f0442671 f02063280 109882443300864 18.162.90.101 US , Central and Western , Hong Kong TRUE with fraudscore 75 5
f02031264 f02063280 109951162777600 52.77.205.10 SG , Singapore , Singapore TRUE with fraudscore 75 5
f02032453 f02063280 503919922905088 3.28.28.100 AE , Dubai , Dubai TRUE with fraudscore 75 23
f02048808 f02063280 503919922905088 54.207.232.14 BR , Sao Paulo , So Paulo TRUE with fraudscore 75 23
f02052252 f02063280 503919922905088 44.231.31.109 US , Oregon , Boardman TRUE with fraudscore 75 23
f01980952 f02063280 505294312439808 183.222.164.239 CN , Sichuan , Chengdu FALSE with fraudscore 0 23

We recommend closing your application and revoking your remaining datacap.

github-actions[bot] commented 1 year ago

This application has not seen any responses in the last 10 days. This issue will be marked with Stale label and will be closed in 4 days. Comment if you want to keep this application open.

StoneMoonGalaxy commented 1 year ago

Already responded

github-actions[bot] commented 1 year ago

This application has not seen any responses in the last 10 days. This issue will be marked with Stale label and will be closed in 4 days. Comment if you want to keep this application open.

StoneMoonGalaxy commented 1 year ago

The discussion regarding this application has undergone multiple rounds of appeals, and no further objections have been raised. Therefore, we will proceed with the subsequent project processes.

StoneMoonGalaxy commented 1 year ago

WIP

StoneMoonGalaxy commented 1 year ago

WIP

large-datacap-requests[bot] commented 1 year ago

DataCap Allocation requested

Request number 5

Multisig Notary address

f02049625

Client address

f1zz7tilbyzlrrzcgzrac4yuyv6ykxgdc5tfll5ei

DataCap allocation requested

1.71PiB

Id

0a0e9bae-feb1-467e-a9d1-11bec6e7d6dd

large-datacap-requests[bot] commented 1 year ago

Stats & Info for DataCap Allocation

Multisig Notary address

f02049625

Client address

f1zz7tilbyzlrrzcgzrac4yuyv6ykxgdc5tfll5ei

Rule to calculate the allocation request amount

400% of weekly dc amount requested

DataCap allocation requested

1.71PiB

Total DataCap granted for client so far

1.629814505577088e+36YiB

Datacap to be granted to reach the total amount requested by the client (5PiB)

1.629814505577088e+36YiB

Stats

Number of deals Number of storage providers Previous DC Allocated Top provider Remaining DC
47034 8 1.75PiB 18.48 879.96TiB
cryptowhizzard commented 1 year ago

Dear StoneMoonGalaxy,

As notary I am doing due diligence on your LDN. I could not get retrieval to work. Can you please upload the car file of CID baga6ea4seaqksy4yj47zfgbeiwc2vmrhd3ptatw7gvw5z4p24vpny3qvn2n2opy ?

You can use our upload system at http://send.datasetcreators.com. Please select 7 days for the system to keep the file and post the link you received here so I (and other notaries) can download your content.

StoneMoonGalaxy commented 1 year ago

WIP

github-actions[bot] commented 1 year ago

This application has not seen any responses in the last 10 days. This issue will be marked with Stale label and will be closed in 4 days. Comment if you want to keep this application open.

-- Commented by Stale Bot.

StoneMoonGalaxy commented 1 year ago

WIP

StoneMoonGalaxy commented 1 year ago

WIP

github-actions[bot] commented 12 months ago

This application has not seen any responses in the last 10 days. This issue will be marked with Stale label and will be closed in 4 days. Comment if you want to keep this application open.

-- Commented by Stale Bot.

StoneMoonGalaxy commented 12 months ago

Please keep it open.

github-actions[bot] commented 11 months ago

This application has not seen any responses in the last 10 days. This issue will be marked with Stale label and will be closed in 4 days. Comment if you want to keep this application open.

-- Commented by Stale Bot.

StoneMoonGalaxy commented 11 months ago

Please keep it open.

StoneMoonGalaxy commented 11 months ago

WIP

StoneMoonGalaxy commented 11 months ago

WIP

Sunnyiscoming commented 10 months ago

Hello, @StoneMoonGalaxy per the https://github.com/filecoin-project/notary-governance/issues/922 for Open, Public Dataset applicants, please complete the following Fil+ registration form to identify yourself as the applicant and also please add the contact information of the SP entities you are working with to store copies of the data.

This information will be reviewed by Fil+ Governance team to confirm validity and then the application will be allowed to move forward for additional notary review.

StoneMoonGalaxy commented 10 months ago

checker:manualTrigger

filplus-checker-app[bot] commented 10 months ago

DataCap and CID Checker Report Summary[^1]

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

Deal Data Replication

✔️ Data replication looks healthy.

Deal Data Shared with other Clients[^3]

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the CID Checker report. Click here to view the Retrieval Dashboard.

StoneMoonGalaxy commented 10 months ago

The form has been updated. Here is the additional info. image

ghost commented 10 months ago

SPs taking deals: f02052252 | Boardman, Oregon, USAmazon.com, Inc. | 458.31 TiB | 14.21% | 458.31 TiB | 0.00% f02048808 | São Paulo, São Paulo, BRAmazon.com, Inc. | 458.19 TiB | 14.20% | 458.19 TiB | 0.00% f02032453 | Abu Dhabi, Abu Dhabi, AEAmazon.com, Inc. | 457.75 TiB | 14.19% | 457.75 TiB | 0.00% f02363300 | Tokyo, Tokyo, JPAmazon.com, Inc. | 262.78 TiB | 8.15% | 262.78 TiB | 0.00% f02031264 | Singapore, Singapore, SGAmazon.com, Inc. | 99.94 TiB | 3.10% | 99.94 TiB | 0.00% f0442671 | Hong Kong, Central and Western, HKAmazon.com, Inc. | 99.88 TiB | 3.10% | 99.88 TiB | 0.00% f01980952 | Chengdu, Sichuan, CNChina Mobile Communications Group Co., Ltd. | 459.56 TiB | 14.25% | 459.44 TiB | 0.03% f02829749 | Shenzhen, Guangdong, CNCTGNet | 262.03 TiB | 8.12% | 262.03 TiB | 0.00% f02830476 | Shenzhen, Guangdong, CNCTGNet | 258.09 TiB | 8.00% | 258.09 TiB | 0.00% f02829748 | Shenzhen, Guangdong, CNCTGNet | 227.88 TiB | 7.06% | 227.88 TiB | 0.00% f02122388 | Shenzhen, Guangdong, CNCTGNet | 181.56 TiB | 5.63% | 181.56 TiB | 0.00%

SPs don't match. Closing until a clear SP entity and distribution is provided

StoneMoonGalaxy commented 10 months ago

@Filplus-govteam @Sunnyiscoming

Please verify again

image

StoneMoonGalaxy commented 10 months ago

Hope to make the SP list field in the form editable, so that it can be dynamically adjusted based on the quantity.

StoneMoonGalaxy commented 10 months ago

@Filplus-govteam @Sunnyiscoming

ghost commented 10 months ago

@StoneMoonGalaxy its hard to confirm SP entity and location with just a list of names and cities. Please provide a way to validate location of SPs.

Also, how many copies are you planning to store of this data? There are 6 applications for 30 PIB. Let's combine into one application, its easier to manage