liyunzhi-666 / TianjiStudio-Fil

0 stars 0 forks source link

[DataCap Application] <LAMOST DR8 public data> #6

Open zhangmiao112257 opened 1 day ago

zhangmiao112257 commented 1 day ago

Data Owner Name

zhang miao

Data Owner Country/Region

China

Data Owner Industry

Environment

Website

http://www.lamost.org/dr8/

Social Media Handle

http://www.lamost.org/dr8/

Social Media Type

Other

What is your role related to the dataset

Data Preparer

Total amount of DataCap being requested

18PiB

Expected size of single dataset (one copy)

2.5PiB

Number of replicas to store

7

Weekly allocation of DataCap requested

1000TiB

On-chain address for first allocation

f1ifpe2rmywletnc5zwtntrf3tmf5abey4j5hu5ca

Data Type of Application

Public, Open Dataset (Research/Non-Profit)

Custom multisig

Identifier

No response

Share a brief history of your project and organization

The Large Sky Area Multi-Object Fiber Spectroscopic Telescope (LAMOST) is a Chinese national scientific research facility operated by the National Astronomical Observatories, Chinese Academy of Sciences. It is a special reflecting Schmidt telescope with 4000 fibers in a field of view of 20 deg2 in the sky. Until July 2019, LAMOST has completed its pilot survey, which was launched in October 2011 and ended in June 2012, and the regular survey of the first seven years, which was initiated on September 2012[1-7]. In this data release, there are totally 10,431,197 low resolution spectra published, which satisfy the selection criteria that the LAMOST LRS General Catalog also used. The data products of this release can be available from the website http://www.lamost.org/dr8/.

Guoshoujing Telescope (the Large Sky Area Multi-Object Fiber Spectroscopic Telescope LAMOST) is a National Major Scientific Project built by the Chinese Academy of Sciences. Funding for the project has been provided by the National Development and Reform Commission. LAMOST is operated and managed by the National Astronomical Observatories, Chinese Academy of Sciences.

Is this project associated with other projects/ecosystem stakeholders?

No

If answered yes, what are the other projects/ecosystem stakeholders

No response

Describe the data being stored onto Filecoin

1. LAMOST Data includes Three Major Types:
Type (I):
Raw Data: All original data as well as original provenance information (for example, the observing log files, calibration files, software versions used, etc.), and the batch reduced two-dimensional spectra.
Type (II):
1D Spectral Data: One-dimensional spectra of observed objects, reduced through standardized reduction pipelines. Some provenance information is included with the 1D spectra, including the input catalog information, selection criteria and observing information such as exposure time, observation quality, seeing, weather conditions, and so on).
Type (III):
Catalog Data: Objective physical quantities with errors, derived from the spectral data and input catalog. The catalog includes the coordinates, magnitudes, radial velocities, effective temperature, surface gravity, elemental abundances, warning flags and so on.

Where was the data currently stored in this dataset sourced from

AWS Cloud

If you answered "Other" in the previous question, enter the details here

No response

If you are a data preparer. What is your location (Country/Region)

None

If you are a data preparer, how will the data be prepared? Please include tooling used and technical details?

No response

If you are not preparing the data, who will prepare the data? (Provide name and business)

No response

Has this dataset been stored on the Filecoin network before? If so, please explain and make the case why you would like to store this dataset again to the network. Provide details on preparation and/or SP distribution.

No response

Please share a sample of the data

http://www.lamost.org/dr8/

Confirm that this is a public dataset that can be retrieved by anyone on the Network

If you chose not to confirm, what was the reason

No response

What is the expected retrieval frequency for this data

Sporadic

For how long do you plan to keep this dataset stored on Filecoin

1.5 to 2 years

In which geographies do you plan on making storage deals

Greater China, Asia other than Greater China, Africa, North America, South America, Europe, Australia (continent), Antarctica

How will you be distributing your data to storage providers

HTTP or FTP server, Shipping hard drives

How did you find your storage providers

Slack, Big Data Exchange, Partners

If you answered "Others" in the previous question, what is the tool or platform you used

No response

Please list the provider IDs and location of the storage providers you will be working with.

f01518369
f01889668
f03151456
f03179572
f03214937
f03178144
f03178077
f01106668 
f0870558 
f03151449
f03151456
f03229932 
f03229933
f03151449
f1315096 
f03055005 
f03055018 
f03055029

How do you plan to make deals to your storage providers

Boost client, Lotus client, Singularity

If you answered "Others/custom tool" in the previous question, enter the details here

No response

Can you confirm that you will follow the Fil+ guideline

Yes

datacap-bot[bot] commented 1 day ago

Application is waiting for allocator review

liyunzhi-666 commented 1 day ago

I have a few more questions for you to confirm before reviewing your application. @zhangmiao112257 1.Have you prepared enough token for sector pledge? 2.Are you a data preparer? What is your previous experience as a data-preparer? List previous applications and client IDs 3.How will the data be prepared? Please include tooling used and technical details 4.If you are not preparing the data, who will prepare the data? (Name and Business) 5.Has this dataset been stored on Filecoin before? If so, why are you choosing to store it again? 6.Best practice for storing large datasets includes ideally, storing it in 3 or more regions, with 4 or more storage provider operators or owners.You should list Miner ID, Business Entity, Location of sps you will cooperate with.

  1. Why are you applying for 18 PiB of Datacap? Please explain further.

Please send an email to verify authenticity, which should include your application and name. patricckk886@gmail.com

zhangmiao112257 commented 18 hours ago

As far as I know, the SPs I contacted have already prepared at least 80% of the filecoin pledged coins, as a data preparer, mainly through the official tools (Singularity, boost, lotus), the dataset I applied for has been stored relatively little so far, and I asked the SPs about it, and they told me that they hadn't stored that dataset. The SPs being approached are already listed in the application list.

zhangmiao112257 commented 18 hours ago
image

We've looked at the data size of one spectrum, which is around 180MB, and according to publicly available data DR8 has a total of 10633515 + 4510193 = 15143708 15143708180MB/32G7 backup=17.77PiB Datacap

That is: 2.5PiB per data, 7 backups, totaling about 18PiB data

zhangmiao112257 commented 18 hours ago
image

please check the E-mail.

liyunzhi-666 commented 8 hours ago

This looks good, I have received your KYC email and am willing to support the first round, as per Allocator approval rules, the first round will be approved for 18PiB*5% = 921.6TiB which I will round up and issue 900TiB.

image image

Given the large amount for the first round, I hope you are operating in compliance, otherwise I will limit the approval for the second round.

datacap-bot[bot] commented 8 hours ago

Datacap Request Trigger

Total DataCap requested

18PiB

Expected weekly DataCap usage rate

1000TiB

DataCap Amount - First Tranche

900TiB

Client address

f1ifpe2rmywletnc5zwtntrf3tmf5abey4j5hu5ca

datacap-bot[bot] commented 8 hours ago

DataCap Allocation requested

Multisig Notary address

Client address

f1ifpe2rmywletnc5zwtntrf3tmf5abey4j5hu5ca

DataCap allocation requested

900TiB

Id

613d6585-7d64-4e1d-8950-3f40cd3ba60a

datacap-bot[bot] commented 8 hours ago

Application is ready to sign

datacap-bot[bot] commented 8 hours ago

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzacecmfjbzxm7l2osnonduxpqgu72adw73j2svidkbwk2r35quxw7u2w

Address

f1ifpe2rmywletnc5zwtntrf3tmf5abey4j5hu5ca

Datacap Allocated

900TiB

Signer Address

f12ytronnjfel3otbml2xrycb64pbexvdi5ecysda

Id

613d6585-7d64-4e1d-8950-3f40cd3ba60a

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacecmfjbzxm7l2osnonduxpqgu72adw73j2svidkbwk2r35quxw7u2w

datacap-bot[bot] commented 8 hours ago

Application is Granted