filecoin-project / filecoin-plus-large-datasets

Hub for client applications for DataCap at a large scale
110 stars 62 forks source link

[DataCap Application] <SXX Future Data> - LAMOST DR4 v2 <1/3> #1748

Closed sxxfuture-official closed 8 months ago

sxxfuture-official commented 1 year ago

Data Owner Name

SXX Future Data

Data Owner Country/Region

China

Data Owner Industry

Not-for-Profit

Website

https://www.sxxfuture.com/

Social Media

Twitter: @sxxfuture
Slack: @Jintao - Sxxfuture

Total amount of DataCap being requested

5PiB

Weekly allocation of DataCap requested

500TiB

On-chain address for first allocation

f1pgjhe5egpian6yjy4oak2s6e2nqdjean663ujxi

Custom multisig

Identifier

No response

Share a brief history of your project and organization

Focusing on distributed big data storage, disaster data protection, encryption algorithm research, encrypted data product application and Web3.0 new-generation Internet research and development.
SXX Future Data provides government enterprises and individual users with products and service system with data value as the core to meet the ever-expanding demand for mass data storage, management and application.

Is this project associated with other projects/ecosystem stakeholders?

No

If answered yes, what are the other projects/ecosystem stakeholders

No response

Describe the data being stored onto Filecoin

The Large Sky Area Multi-Object Fiber Spectroscopic Telescope (LAMOST) is a Chinese national scientific research facility operated by the National Astronomical Observatories, Chinese Academy of Sciences. It is a special reflecting Schmidt telescope with 4000 fibers in a field of view of 20 deg2
 in the sky. Until July 2016, LAMOST has completed its pilot survey which was launched in October 2011 and ended in June 2012, and the first four years of regular survey which was initiated on September 2012. After this five-year-survey, we totally obtain 7,617,035 spectra, which consist of stars, galaxies, quasars and other unknown objects[1−7]
. Now, the fourth data release (DR4) has published online (http://dr4.lamost.org/), and released data products include:

Spectra. - In general, there are 7,617,035 flux- and wavelength-calibrated, sky-subtracted spectra in DR4, including 6,943,865 stars, 117,254 galaxies, 36,575 quasars, and 519,341 unknown objects, and these spectra cover the wavelength range of 3690-9100 angstrom with a resolution of 1800[2−3]
 at the 5500 angstrom.
Spectroscopic Parameters Catalogs. - In this data release, six spectroscopic parameters catalogs are also published,they are the LAMOST general catalog, the A, F, G and K type star catalog, the A type star catalog, the M dwarf catalog, the observed plate information catalog, and the input catalog respectively. For the first four catalogs, they all include 36 columns of basic spectroscopic information, for example, right ascension, declination, signal to noise ratio, magnitude, classification and redshift. Also, the A type star catalog publish line indices of six spectral lines and four balmer line widths at 20% below the local continua, the A, F, G and K type star catalog provides effective temperature, surface gravity, and metallicity, and the M dwarf catalog releases the equivalent width of Halpha line, ten line indices, one metallicity sensitive parameter and a flag that indicates whether or not exist magnetic activity. For the observed plate information catalog, it mainly contains nine basic plate information for all published plates. At last, the input catalog includes 24 basic fields mentioned above, and three new fields which are not included in the above catalogs.

Guoshoujing Telescope (the Large Sky Area Multi-Object Fiber Spectroscopic Telescope LAMOST) is a National Major Scientific Project built by the Chinese Academy of Sciences. Funding for the project has been provided by the National Development and Reform Commission. LAMOST is operated and managed by the National Astronomical Observatories, Chinese Academy of Sciences.

Where was the data currently stored in this dataset sourced from

Other

If you answered "Other" in the previous question, enter the details here

http://dr4.lamost.org/

How do you plan to prepare the dataset

lotus, singularity, graphsplit

If you answered "other/custom tool" in the previous question, enter the details here

No response

Please share a sample of the data

http://dr4.lamost.org/v2/sas/catalog/
http://dr4.lamost.org/v2/sas/fits/B5591606/
http://dr4.lamost.org/v2/sas/fits/B5591705/
http://dr4.lamost.org/v2/sas/fits/B5591804/
http://dr4.lamost.org/v2/sas/fits/B5591903/
http://dr4.lamost.org/v2/sas/fits/B5591906/
http://dr4.lamost.org/v2/sas/fits/B5592004/
http://dr4.lamost.org/v2/sas/png/B5591606/
http://dr4.lamost.org/v2/sas/png/B5591705/
http://dr4.lamost.org/v2/sas/png/B5591804/
http://dr4.lamost.org/v2/sas/png/B5591906/
http://dr4.lamost.org/v2/sas/png/B5592004/

Confirm that this is a public dataset that can be retrieved by anyone on the Network

If you chose not to confirm, what was the reason

No response

What is the expected retrieval frequency for this data

Yearly

For how long do you plan to keep this dataset stored on Filecoin

2 to 3 years

In which geographies do you plan on making storage deals

Greater China, Asia other than Greater China, North America

How will you be distributing your data to storage providers

HTTP or FTP server, Shipping hard drives, Others

How do you plan to choose storage providers

Slack, Big data exchange, Others

If you answered "Others" in the previous question, what is the tool or platform you plan to use

No response

If you already have a list of storage providers to work with, fill out their names and provider IDs below

No response

How do you plan to make deals to your storage providers

Boost client, Lotus client

If you answered "Others/custom tool" in the previous question, enter the details here

No response

Can you confirm that you will follow the Fil+ guideline

Yes

large-datacap-requests[bot] commented 1 year ago

Thanks for your request!

Heads up, you’re requesting more than the typical weekly onboarding rate of DataCap!
large-datacap-requests[bot] commented 1 year ago

Thanks for your request! Everything looks good. :ok_hand:

A Governance Team member will review the information provided and contact you back pretty soon.

Sunnyiscoming commented 1 year ago

image Copyright © 2005-2017, National Astronomical Observatories, Chinese Academy of Sciences Can you provide supporting documents from authorized storage of these two organizations?

large-datacap-requests[bot] commented 1 year ago

Thanks for your request!

Heads up, you’re requesting more than the typical weekly onboarding rate of DataCap!
large-datacap-requests[bot] commented 1 year ago

Thanks for your request! Everything looks good. :ok_hand:

A Governance Team member will review the information provided and contact you back pretty soon.

sxxfuture-official commented 1 year ago

@Sunnyiscoming Thank for review my application! We have no relationship with the two organizations you provided. But, this is a large public data set, and we will add it to the FIL+ project for reliable storage on the chain. This is a data disclosure plan of an international astronomical project, refer to LAMOST DATA POLICY image

We added this statement in the application disclosure - "Describe the data being stored onto Filecoin", which complies with its usage rules.

Here you can see the public time node table of its data. http://www.lamost.org/lmusers/ image

Sunnyiscoming commented 1 year ago

image There are too many rules for usage of this dataset.

sxxfuture-official commented 1 year ago

image @Sunnyiscoming
If you look carefully at the terms, you should be able to see that the rules you have shown belong to "The Usage Policy for Pre-Release Data", and the subject of this LDN storage - LAMOST DR4, belongs to Release Data, so I included relevant statements can.

Here you can see the public time node table of its data. http://www.lamost.org/lmusers/

This is data released publicly by a society, so it complies with regulations.

Sunnyiscoming commented 1 year ago

Datacap Request Trigger

Total DataCap requested

5PiB

Expected weekly DataCap usage rate

500TiB

Client address

f1pgjhe5egpian6yjy4oak2s6e2nqdjean663ujxi

large-datacap-requests[bot] commented 1 year ago

DataCap Allocation requested

Multisig Notary address

f02049625

Client address

f1pgjhe5egpian6yjy4oak2s6e2nqdjean663ujxi

DataCap allocation requested

250TiB

Id

a37c3683-5935-41ed-a552-5db799038513

igoovo commented 1 year ago

Request Proposed

Your Datacap Allocation Request has been proposed by the Notary

Message sent to Filecoin Network

bafy2bzacedegcxqx4hnfuaj7ihaonryprt2wet7vepo33yqnn4qsnspgomjsq

Address

f1pgjhe5egpian6yjy4oak2s6e2nqdjean663ujxi

Datacap Allocated

250.00TiB

Signer Address

f1shnsfayxqll77svffaxnjenms7bbbysbqcatrpy

Id

a37c3683-5935-41ed-a552-5db799038513

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacedegcxqx4hnfuaj7ihaonryprt2wet7vepo33yqnn4qsnspgomjsq

igoovo commented 1 year ago

Public dataset, support.

OpenGate01 commented 1 year ago

I'm wondering if there are any confirmed storage providers at the moment?

sxxfuture-official commented 1 year ago

@OpenGate01 Yes, we have found SPs from different regions of the world to join our project, and they are distributed in mainland China, Hong Kong, Singapore, North America, and Japan. . . Most of them will initialize new nodes for storage, currently known as follows: f01986236 f01964215 f0503420 f01986203 More SP will be added later.

OpenGate01 commented 1 year ago

Ok, willing to support in the first round and would like to see CID Checker results that meet the requirements.

OpenGate01 commented 1 year ago

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzaceaa3znnnautdmph7q2kxwv6d5ci7nbyrutrroptmok3eehmpwcijk

Address

f1pgjhe5egpian6yjy4oak2s6e2nqdjean663ujxi

Datacap Allocated

250.00TiB

Signer Address

f1im4hmtbfzqnx7ir74kdaiu4ynjhgqh3sdi2snla

Id

a37c3683-5935-41ed-a552-5db799038513

You can check the status of the message here: https://filfox.info/en/message/bafy2bzaceaa3znnnautdmph7q2kxwv6d5ci7nbyrutrroptmok3eehmpwcijk

large-datacap-requests[bot] commented 1 year ago

DataCap Allocation requested

Request number 2

Multisig Notary address

f02049625

Client address

f1pgjhe5egpian6yjy4oak2s6e2nqdjean663ujxi

DataCap allocation requested

500TiB

Id

1812579b-b7fd-456e-8ca1-1c57295dc3d0

large-datacap-requests[bot] commented 1 year ago

Stats & Info for DataCap Allocation

Multisig Notary address

f02049625

Client address

f1pgjhe5egpian6yjy4oak2s6e2nqdjean663ujxi

Rule to calculate the allocation request amount

100% of weekly dc amount requested

DataCap allocation requested

500TiB

Total DataCap granted for client so far

250TiB

Datacap to be granted to reach the total amount requested by the client (5PiB)

4.75PiB

Stats

Number of deals Number of storage providers Previous DC Allocated Top provider Remaining DC
2243 6 250TiB 20.47 55.34TiB
Bitengine-reeta commented 1 year ago

checker:manualTrigger

filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report Summary[^1]

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

Deal Data Replication

✔️ Data replication looks healthy.

Deal Data Shared with other Clients[^3]

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the full report.

Bitengine-reeta commented 1 year ago

CID Checker Report shows all aspects are very healthy. Check data retrieve : img_v2_aadd8770-ef57-4891-abfc-7bc9498e58fg

will support.

Bitengine-reeta commented 1 year ago

Request Proposed

Your Datacap Allocation Request has been proposed by the Notary

Message sent to Filecoin Network

bafy2bzaceb3vo4p7xxvod2hilmmjmxaz6ogoo6sf77f2zvl5jou476mgthhxy

Address

f1pgjhe5egpian6yjy4oak2s6e2nqdjean663ujxi

Datacap Allocated

500.00TiB

Signer Address

f1jyvhxp4kmwreo22ke4itspraznpudw3uqaink5i

Id

You can check the status of the message here: https://filfox.info/en/message/bafy2bzaceb3vo4p7xxvod2hilmmjmxaz6ogoo6sf77f2zvl5jou476mgthhxy

maxvint commented 1 year ago

checker:manualTrigger

filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report Summary[^1]

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

Deal Data Replication

✔️ Data replication looks healthy.

Deal Data Shared with other Clients[^3]

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the full report.

maxvint commented 1 year ago

Great, the data distribution looks very good and checks all the boxes, retrieval also works very well, willing to support.

image image image
maxvint commented 1 year ago

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzaceam4vwe3ljgu2nffxakyntzz4lfzvfcommfwawyg44twwmqxd2z6w

Address

f1pgjhe5egpian6yjy4oak2s6e2nqdjean663ujxi

Datacap Allocated

500.00TiB

Signer Address

f1ui5iy4mmkxjbw7752omiwp2ols2fzv4thrayagi

Id

You can check the status of the message here: https://filfox.info/en/message/bafy2bzaceam4vwe3ljgu2nffxakyntzz4lfzvfcommfwawyg44twwmqxd2z6w

data-programs commented 1 year ago
KYC

This user’s identity has been verified through filplus.storage

cryptowhizzard commented 1 year ago

checker:manualTrigger

filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report Summary[^1]

Retrieval Statistics

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

Deal Data Replication

✔️ Data replication looks healthy.

Deal Data Shared with other Clients[^3]

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the CID Checker report. Click here to view the Retrieval report.

kevzak commented 1 year ago

Hi @sxxfuture-official retrieval looks good for your specific SP miner IDs.

Who are other SP entities you are storing with? How did you find them?

large-datacap-requests[bot] commented 1 year ago

DataCap Allocation requested

Request number 3

Multisig Notary address

f02049625

Client address

f1pgjhe5egpian6yjy4oak2s6e2nqdjean663ujxi

DataCap allocation requested

1000.0TiB

Id

e36dfe3a-cdb7-4716-ad56-432010c46d69

large-datacap-requests[bot] commented 1 year ago

Stats & Info for DataCap Allocation

Multisig Notary address

f02049625

Client address

f1pgjhe5egpian6yjy4oak2s6e2nqdjean663ujxi

Rule to calculate the allocation request amount

200% of weekly dc amount requested

DataCap allocation requested

1000.0TiB

Total DataCap granted for client so far

454747.4YiB

Datacap to be granted to reach the total amount requested by the client (5PiB)

454747.4YiB

Stats

Number of deals Number of storage providers Previous DC Allocated Top provider Remaining DC
14758 10 500TiB 15.78 124.59TiB
sxxfuture-official commented 1 year ago

@kevzak Through online and offline Orbit activities and FILFI's SP Alliance, we have met many SPs from different regions, and they all have strong technical capabilities to solve retrieval problems.

stcloudlisa commented 1 year ago

checker:manualTrigger

filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report Summary[^1]

Retrieval Statistics

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

Deal Data Replication

✔️ Data replication looks healthy.

Deal Data Shared with other Clients[^3]

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the CID Checker report. Click here to view the Retrieval Dashboard. Click here to view the Retrieval report.

stcloudlisa commented 1 year ago

looks good

stcloudlisa commented 1 year ago

Request Proposed

Your Datacap Allocation Request has been proposed by the Notary

Message sent to Filecoin Network

bafy2bzacecn7hgw273kcqvyrl76an66s3mliktot5kwy4ml43knvxsbeym3ew

Address

f1pgjhe5egpian6yjy4oak2s6e2nqdjean663ujxi

Datacap Allocated

1000.00TiB

Signer Address

f1jvvltduw35u6inn5tr4nfualyd42bh3vjtylgci

Id

e36dfe3a-cdb7-4716-ad56-432010c46d69

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacecn7hgw273kcqvyrl76an66s3mliktot5kwy4ml43knvxsbeym3ew

ipollo00 commented 1 year ago

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzacedd3sowekcufhmjq6dmtttjqx7vwkytso7mxksw6esggiixbjjtra

Address

f1pgjhe5egpian6yjy4oak2s6e2nqdjean663ujxi

Datacap Allocated

1000.00TiB

Signer Address

f1n5wlrrhoxpkgwij25xrtt7w7g2k3fhbthmdn6ri

Id

e36dfe3a-cdb7-4716-ad56-432010c46d69

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacedd3sowekcufhmjq6dmtttjqx7vwkytso7mxksw6esggiixbjjtra

github-actions[bot] commented 1 year ago

This application has not seen any responses in the last 10 days. This issue will be marked with Stale label and will be closed in 4 days. Comment if you want to keep this application open.

-- Commented by Stale Bot.

large-datacap-requests[bot] commented 1 year ago

DataCap Allocation requested

Request number 4

Multisig Notary address

f02049625

Client address

f1pgjhe5egpian6yjy4oak2s6e2nqdjean663ujxi

DataCap allocation requested

1.95PiB

Id

7ca48ef6-f8cc-484a-a402-12337684b3ad

large-datacap-requests[bot] commented 1 year ago

Stats & Info for DataCap Allocation

Multisig Notary address

f02049625

Client address

f1pgjhe5egpian6yjy4oak2s6e2nqdjean663ujxi

Rule to calculate the allocation request amount

400% of weekly dc amount requested

DataCap allocation requested

1.95PiB

Total DataCap granted for client so far

909494701772928712704.0YiB

Datacap to be granted to reach the total amount requested by the client (5PiB)

909494701772928712704.0YiB

Stats

Number of deals Number of storage providers Previous DC Allocated Top provider Remaining DC
28581 17 1000.0TiB 11.24 605.15TiB
AlanGreaterheat commented 1 year ago

checker:manualTrigger

filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report Summary[^1]

Retrieval Statistics

Storage Provider Distribution

⚠️ 1 storage providers have unknown IP location - f02220886

Deal Data Replication

⚠️ 38.27% of deals are for data replicated across less than 4 storage providers.

Deal Data Shared with other Clients[^3]

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the CID Checker report. Click here to view the Retrieval Dashboard. Click here to view the Retrieval report.

AlanGreaterheat commented 1 year ago

I would like to know why this f02220886 node does not have a public IP?

sxxfuture-official commented 1 year ago

@AlanGreaterheat Sorry, f02220886 is a new miner. It has just started sealing in the past two days and has yet to configure the public IP. Just contacted SP to solve this problem

f02220886 -> {12D3KooWPdeAuETTcyjMjLkndQM87QqmMDdoGCaRugPyNn7tvAtw: [/ip4/111.48.247.130/tcp/12807]}

AlanGreaterheat commented 1 year ago

Request Proposed

Your Datacap Allocation Request has been proposed by the Notary

Message sent to Filecoin Network

bafy2bzacec2h4i7x2gkohficqgnjtcrzw5xisjswyld4ftaewzl7rccvphhis

Address

f1pgjhe5egpian6yjy4oak2s6e2nqdjean663ujxi

Datacap Allocated

1.95PiB

Signer Address

f1pnmzlxj7cfeo2v6oj5nco46hkg2l46wj7o4xxui

Id

7ca48ef6-f8cc-484a-a402-12337684b3ad

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacec2h4i7x2gkohficqgnjtcrzw5xisjswyld4ftaewzl7rccvphhis

NiwanDao commented 1 year ago

LGTM.

NiwanDao commented 1 year ago

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzaceb7i6jup3boh7l46nungtgwxhyvjoksehxe7qf5bcdgdjurqch4us

Address

f1pgjhe5egpian6yjy4oak2s6e2nqdjean663ujxi

Datacap Allocated

1.95PiB

Signer Address

f1a2lia2cwwekeubwo4nppt4v4vebxs2frozarz3q

Id

7ca48ef6-f8cc-484a-a402-12337684b3ad

You can check the status of the message here: https://filfox.info/en/message/bafy2bzaceb7i6jup3boh7l46nungtgwxhyvjoksehxe7qf5bcdgdjurqch4us

github-actions[bot] commented 1 year ago

This application has not seen any responses in the last 10 days. This issue will be marked with Stale label and will be closed in 4 days. Comment if you want to keep this application open.

-- Commented by Stale Bot.

sxxfuture-official commented 11 months ago

checker:manualTrigger