liyunzhi-666 / TianjiStudio-Fil

0 stars 0 forks source link

[DataCap Application] <LAMOST DR8 public data> #6

Open zhangmiao112257 opened 1 day ago

zhangmiao112257 commented 1 day ago

Data Owner Name

zhang miao

Data Owner Country/Region


Data Owner Industry



Social Media Handle

Social Media Type


What is your role related to the dataset

Data Preparer

Total amount of DataCap being requested


Expected size of single dataset (one copy)


Number of replicas to store


Weekly allocation of DataCap requested


On-chain address for first allocation


Data Type of Application

Public, Open Dataset (Research/Non-Profit)

Custom multisig


No response

Share a brief history of your project and organization

The Large Sky Area Multi-Object Fiber Spectroscopic Telescope (LAMOST) is a Chinese national scientific research facility operated by the National Astronomical Observatories, Chinese Academy of Sciences. It is a special reflecting Schmidt telescope with 4000 fibers in a field of view of 20 deg2 in the sky. Until July 2019, LAMOST has completed its pilot survey, which was launched in October 2011 and ended in June 2012, and the regular survey of the first seven years, which was initiated on September 2012[1-7]. In this data release, there are totally 10,431,197 low resolution spectra published, which satisfy the selection criteria that the LAMOST LRS General Catalog also used. The data products of this release can be available from the website

Guoshoujing Telescope (the Large Sky Area Multi-Object Fiber Spectroscopic Telescope LAMOST) is a National Major Scientific Project built by the Chinese Academy of Sciences. Funding for the project has been provided by the National Development and Reform Commission. LAMOST is operated and managed by the National Astronomical Observatories, Chinese Academy of Sciences.

Is this project associated with other projects/ecosystem stakeholders?


If answered yes, what are the other projects/ecosystem stakeholders

No response

Describe the data being stored onto Filecoin

1. LAMOST Data includes Three Major Types:
Type (I):
Raw Data: All original data as well as original provenance information (for example, the observing log files, calibration files, software versions used, etc.), and the batch reduced two-dimensional spectra.
Type (II):
1D Spectral Data: One-dimensional spectra of observed objects, reduced through standardized reduction pipelines. Some provenance information is included with the 1D spectra, including the input catalog information, selection criteria and observing information such as exposure time, observation quality, seeing, weather conditions, and so on).
Type (III):
Catalog Data: Objective physical quantities with errors, derived from the spectral data and input catalog. The catalog includes the coordinates, magnitudes, radial velocities, effective temperature, surface gravity, elemental abundances, warning flags and so on.

Where was the data currently stored in this dataset sourced from

AWS Cloud

If you answered "Other" in the previous question, enter the details here

No response

If you are a data preparer. What is your location (Country/Region)


If you are a data preparer, how will the data be prepared? Please include tooling used and technical details?

No response

If you are not preparing the data, who will prepare the data? (Provide name and business)

No response

Has this dataset been stored on the Filecoin network before? If so, please explain and make the case why you would like to store this dataset again to the network. Provide details on preparation and/or SP distribution.

No response

Please share a sample of the data

Confirm that this is a public dataset that can be retrieved by anyone on the Network

If you chose not to confirm, what was the reason

No response

What is the expected retrieval frequency for this data


For how long do you plan to keep this dataset stored on Filecoin

1.5 to 2 years

In which geographies do you plan on making storage deals

Greater China, Asia other than Greater China, Africa, North America, South America, Europe, Australia (continent), Antarctica

How will you be distributing your data to storage providers

HTTP or FTP server, Shipping hard drives

How did you find your storage providers

Slack, Big Data Exchange, Partners

If you answered "Others" in the previous question, what is the tool or platform you used

No response

Please list the provider IDs and location of the storage providers you will be working with.


How do you plan to make deals to your storage providers

Boost client, Lotus client, Singularity

If you answered "Others/custom tool" in the previous question, enter the details here

No response

Can you confirm that you will follow the Fil+ guideline


datacap-bot[bot] commented 1 day ago

Application is waiting for allocator review

liyunzhi-666 commented 1 day ago

I have a few more questions for you to confirm before reviewing your application. @zhangmiao112257 1.Have you prepared enough token for sector pledge? 2.Are you a data preparer? What is your previous experience as a data-preparer? List previous applications and client IDs 3.How will the data be prepared? Please include tooling used and technical details 4.If you are not preparing the data, who will prepare the data? (Name and Business) 5.Has this dataset been stored on Filecoin before? If so, why are you choosing to store it again? 6.Best practice for storing large datasets includes ideally, storing it in 3 or more regions, with 4 or more storage provider operators or owners.You should list Miner ID, Business Entity, Location of sps you will cooperate with.

  1. Why are you applying for 18 PiB of Datacap? Please explain further.

Please send an email to verify authenticity, which should include your application and name.

zhangmiao112257 commented 18 hours ago

As far as I know, the SPs I contacted have already prepared at least 80% of the filecoin pledged coins, as a data preparer, mainly through the official tools (Singularity, boost, lotus), the dataset I applied for has been stored relatively little so far, and I asked the SPs about it, and they told me that they hadn't stored that dataset. The SPs being approached are already listed in the application list.

zhangmiao112257 commented 18 hours ago

We've looked at the data size of one spectrum, which is around 180MB, and according to publicly available data DR8 has a total of 10633515 + 4510193 = 15143708 15143708180MB/32G7 backup=17.77PiB Datacap

That is: 2.5PiB per data, 7 backups, totaling about 18PiB data

zhangmiao112257 commented 18 hours ago

please check the E-mail.

liyunzhi-666 commented 8 hours ago

This looks good, I have received your KYC email and am willing to support the first round, as per Allocator approval rules, the first round will be approved for 18PiB*5% = 921.6TiB which I will round up and issue 900TiB.

image image

Given the large amount for the first round, I hope you are operating in compliance, otherwise I will limit the approval for the second round.

datacap-bot[bot] commented 8 hours ago

Datacap Request Trigger

Total DataCap requested


Expected weekly DataCap usage rate


DataCap Amount - First Tranche


Client address


datacap-bot[bot] commented 8 hours ago

DataCap Allocation requested

Multisig Notary address

Client address


DataCap allocation requested




datacap-bot[bot] commented 8 hours ago

Application is ready to sign

datacap-bot[bot] commented 8 hours ago

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network




Datacap Allocated


Signer Address




You can check the status of the message here:

datacap-bot[bot] commented 8 hours ago

Application is Granted