filecoin-project / filecoin-plus-large-datasets

Hub for client applications for DataCap at a large scale
110 stars 62 forks source link

[DataCap Allocation] - DataCap Aplication #1101

Closed StorageLabs closed 1 year ago

StorageLabs commented 1 year ago

name: Large Dataset Notary application about: Clients should use this application form to request a DataCap allocation via a LDN for a dataset title: "DataCap Aplication" labels: 'application, Phase: Diligence' assignees: ''


Large Dataset Notary Application

To apply for DataCap to onboard your dataset to Filecoin, please fill out the following.

Core Information

Please respond to the questions below by replacing the text saying "Please answer here". Include as much detail as you can in your answer.

Project details

Share a brief history of your project and organization.

The StorageLabs is a non-profit organization which generates mass data from experiment, observation and calculation led by our scientific research team. We carried out several international projects, like American Illustris Project, British EAGLE project, particularly ELUCID project for now. ELUCID (EXPLORING THE LOCAL UNIVERSE WITH THE RECONSTRUCTED INITIAL DENSITY FIELD) project is to reconstruct the initial linear density field from an input nonlinear density field, employing the Hamiltonian Markov Chain Monte Carlo (HMC) algorithm combined with Particle-mesh (PM) dynamics. Mass data is produced from ELUCID project with the help of computers based on physical formulas, the data mainly are the snapshots of Parameters.

What is the primary source of funding for this project?

We will fund this project with Slingshot rewards and social donations.

What other projects/ecosystem stakeholders is this project associated with?

No other stakeholders involved.

Use-case details

Describe the data being stored onto Filecoin

The data is mainly about galaxy formation and cosmological evolution; According to cosmological modeling, we can predict how the universe looks like at different ages since the Big Bang. A lot of scientific efforts have been taken and huge datasets have been produced to describe the predicted universe at different redshifts. For now, the data is snapshot of Parameters mainly from ELUCID Project. 

Where was the data in this dataset sourced from?

Massive scientific data is generated from experiment, observation and calculation by the researchers with the help of computers.
Parts of the data is stored in centralized data center now, but most of the data is forced to be discarded due to the limit of storage capacity. So we seek to find decentralized storage provider for more space and easy access.

Can you share a sample of the data? A link to a file, an image, a table, etc., are good ways to do this.

https://universe.storagelabs.io/data/snapshot_099/
https://cpb-us-e1.wpmucdn.com/sites.northwestern.edu/dist/6/84/files/2013/12/FIREbox_z2_512_slow-ry595s.mp4
https://fire.northwestern.edu/files/2013/12/z2_512_gas_stars_fahrt1-1aks7mn.m4v?_=2

Confirm that this is a public dataset that can be retrieved by anyone on the Network (i.e., no specific permissions or access rights are required to view the data).

We confirm that the public has access to the dataset without any restriction.

What is the expected retrieval frequency for this data?

1~2 times per year.

For how long do you plan to keep this dataset stored on Filecoin?

The dataset will be kept on Filecoin for 1-3 years as planned for now, but the duration may extend afterwards.

DataCap allocation plan

In which geographies (countries, regions) do you plan on making storage deals?

Please answer here.

How will you be distributing your data to storage providers? Is there an offline data transfer process?

Please answer here.

How do you plan on choosing the storage providers with whom you will be making deals? This should include a plan to ensure the data is retrievable in the future both by you and others.

Please answer here.

How will you be distributing deals across storage providers?

Please answer here.

Do you have the resources/funding to start making deals as soon as you receive DataCap? What support from the community would help you onboard onto Filecoin?

Please answer here.
large-datacap-requests[bot] commented 1 year ago

Thanks for your request! Everything looks good. :ok_hand:

A Governance Team member will review the information provided and contact you back pretty soon.

large-datacap-requests[bot] commented 1 year ago

Thanks for your request! Everything looks good. :ok_hand:

A Governance Team member will review the information provided and contact you back pretty soon.

large-datacap-requests[bot] commented 1 year ago

Thanks for your request! Everything looks good. :ok_hand:

A Governance Team member will review the information provided and contact you back pretty soon.

large-datacap-requests[bot] commented 7 months ago

Thanks for your request! :exclamation: We have found some problems in the information provided. We could not find Organization Name field in the information provided We could not find Website \/ Social Media field in the information provided We could not find Total amount of DataCap being requested (between 500 TiB and 5 PiB) field in the information provided We could not find Weekly allocation of DataCap requested (usually between 1-100TiB) field in the information provided We could not find On-chain address for first allocation field in the information provided We could not find Data Type of Application field in the information provided

Please, take a look at the request and edit the body of the issue providing all the required information.