filecoin-project / filecoin-plus-large-datasets

Hub for client applications for DataCap at a large scale
109 stars 62 forks source link

[DataCap Application] Mazidon Inc. Large Dataset Notary Application #1 #421

Closed mazidoninc closed 2 years ago

mazidoninc commented 2 years ago

Large Dataset Notary Application

To apply for DataCap to onboard your dataset to Filecoin, please fill out the following.

Core Information

Project details

Share a brief history of your project and organization.

Mazidon Inc is an MSP that specializes in AI & Security and works mostly with oil & gas customers. We have accumulated a large number of CAD & SEGY files (an image format used for seismic imaging) that we will back up with the Filecoin network.

What is the primary source of funding for this project?

Mazidon Inc.

What other projects/ecosystem stakeholders is this project associated with?

The primary stakeholders are Mazidon customers. These include drilling & engineering firms and oil and gas companies that need to retain seismic and engineering data long-term.

Use-case details

Describe the data being stored onto Filecoin

Seismic data images are critical to understanding how the earth's crust is evolving over time.

Where was the data in this dataset sourced from?

Archived customer data 

Can you share a sample of the data? A link to a file, an image, a table, etc., are good ways to do this.

A private link will be sent

Confirm that this is a public dataset that can be retrieved by anyone on the Network (i.e., no specific permissions or access rights are required to view the data).

Not currently open to the public, this data is in the process of being preserved by Mazidon Inc. with an eye toward future public use.

What is the expected retrieval frequency for this data?

Near zero, currently 

For how long do you plan to keep this dataset stored on Filecoin?

5 years

DataCap allocation plan

In which geographies (countries, regions) do you plan on making storage deals?

North America, EU, and Australia if possible.

How will you be distributing your data to storage providers? Is there an offline data transfer process?

Offline data ingest shipping hard drives initially then hosting an S3 bucket for SPs to use.

How do you plan on choosing the storage providers with whom you will be making deals? This should include a plan to ensure the data is retrievable in the future both by you and others.

We will verify the other SPs we chose to work with. Mazidon will make the data available for S3 ingest to those SPs.

How will you be distributing deals across storage providers?

S3 bucket

Do you have the resources/funding to start making deals as soon as you receive DataCap? What support from the community would help you onboard onto Filecoin?


Yes and none currently. 
large-datacap-requests[bot] commented 2 years ago

Thanks for your request! :exclamation: We have found some problems in the information provided.

cryptowhizzard commented 2 years ago

Can you please take some time to give us some insight about your Company? I took the liberty to look up your website but it seems you are not in the oil and gas industry.

Schermafbeelding 2022-06-20 om 10 24 35

https://web.archive.org/web/20220211093107/http://mazidon.com/

Schermafbeelding 2022-06-20 om 10 29 28
Sunnyiscoming commented 2 years ago

@mazidoninc

  1. Website can not be accessed. image 2.Please provide On-chain address for first allocation. 3.Please provide data samples to prove that you have 4 PB data.
large-datacap-requests[bot] commented 2 years ago

Thanks for your request! Everything looks good. :ok_hand:

A Governance Team member will review the information provided and contact you back pretty soon.

mazidoninc commented 2 years ago

Hello We have completed our site migration. Mazidon is a managed service provider; our primary focus is oil and gas companies in the western United States. We recently learned about Filecoin and have added this to our service offerings. We have updated our data cap request to 5 copies of 230 TiB ( 1.15 PB) to start this project. Additionally, the other data cap requests have been closed. We have 4 PB of raw data in total we would like to seal; however, we have learned that 230 TiB is a much more manageable starting point. Wallet address - f1qc66l4h5bmcc3wvxm7laky5ixevfjofhg5r3aka What is the best way to share sample files?

Sunnyiscoming commented 2 years ago

Maybe more than 5 data samples related with seismic data images, and each of them less than 1GB will be easily checked.

mazidoninc commented 2 years ago

@Sunnyiscoming okay, we will prepare a few samples. I will update the thread when they are ready.

mazidoninc commented 2 years ago

We have prepared a sample data set for review with a public GDrive link. There are two different projects with more than the requested 5 SEGY sample files, a total of 8GB of data. Please let me know when you have reviewed the files. https://drive.google.com/drive/folders/1Ug8ythKP9TsSln7pETRYARdICyYtAqcT?usp=sharing Additionally, we are looking for trustworthy storage providers to partner with in the United States, EU, and Australia. Can you please recommend a few providers to partner with?

Sunnyiscoming commented 2 years ago

Mazidon Inc is an MSP that specializes in AI & Security and works mostly with oil & gas customers. We have accumulated a large number of CAD & SEGY files (an image format used for seismic imaging) that we will back up with the Filecoin network.

As you said, Mazidon Inc is an MSP that specializes in AI & Security. So these data samples are from your customers? Have you authorized by them?

Sunnyiscoming commented 2 years ago

Any update here?

raghavrmadya commented 2 years ago

@mazidoninc, the governance team does not have a high enough level of confidence to move forward with this application. If you still wish to pursue, kindly open a new application and provide sufficient evidence for KYC and KYC. An email from an official company email to filplus-app-review@fil.org and information establishing your relationship with the company is needed. Thanks

Screen Shot 2022-08-21 at 3 37 46 PM