filecoin-project / filecoin-plus-large-datasets

Hub for client applications for DataCap at a large scale
109 stars 62 forks source link

[DataCap Application] siyuanjiatong #1195

Closed metacodebean closed 1 year ago

metacodebean commented 2 years ago

Large Dataset Notary Application

To apply for DataCap to onboard your dataset to Filecoin, please fill out the following.

Core Information

Please respond to the questions below by replacing the text saying "Please answer here". Include as much detail as you can in your answer.

Project details

Share a brief history of your project and organization.

1. Beijing Siyuan Jiatong Technology Co., Ltd. was established in 2008. It is a gold agent certified by Cisco manufacturers, focusing on providing a full range of CISCO application solution products. At the same time, Beijing Siyuan Jiatong Technology Co., Ltd. is a professional network service company, providing computer system network construction, information technology consulting and services for enterprises and industries. The company is headquartered in Beijing and has branches in Wuhan. Beijing Siyuan Jiatong Technology Co., Ltd. has been praised for its forward-looking vision, success guarantee and professional service among its existing customers. Our customers are widely distributed in state-owned enterprises, transportation industry, education system, medical system, real estate companies and other fields.
2. The business scope of Beijing Siyuan Jiatong Technology Co., Ltd. covers a full range of services from technical consultation, solution design and implementation. With the in-depth understanding of the customer's industry and a keen grasp of customer demand changes, we can propose a targeted change plan. With the help of the professional knowledge and professional skills of Beijing Siyuan Jiatong Technology Co., Ltd. Help clients achieve a competitive advantage and improve overall performance levels. The company is committed to providing large, medium and small enterprises with: (1) Network upgrade and transformation services, including IT management consulting solutions such as consulting, design, integration, maintenance, optimization, and upgrade of network infrastructure, as well as service support such as IT operation and maintenance. (2) Application of new network technologies, including triple play (voice, data, video), network security, video surveillance, unified communications, network storage, network management and network optimization solutions and technical service support. Starting from consultation, we provide users with planning, design, sales, logistics, installation and commissioning, operation and maintenance, maintenance during the warranty period and after-warranty services.

What is the primary source of funding for this project?

Company product and service revenue.

What other projects/ecosystem stakeholders is this project associated with?

Not yet.

Use-case details

Describe the data being stored onto Filecoin

1. Store product pictures.
2. Network device logs.
3. Monitor data.

Where was the data in this dataset sourced from?

Company.

Can you share a sample of the data? A link to a file, an image, a table, etc., are good ways to do this.

https://drive.google.com/file/d/1kp1RJDPJL2M3ytQQwFU9y1mbrddbaoty/view?usp=share_link
https://drive.google.com/file/d/1e0R7Ka76_O5yIEIXzGdqYv-yNf79j5oV/view?usp=share_link
https://drive.google.com/file/d/1egsbMSgyAxyqGVStXJTzMkRRHdWCkf_I/view?usp=share_link
https://drive.google.com/file/d/15WdJAxlwiXt7JAwKj0siUymT6MNJA7CB/view?usp=share_link
https://drive.google.com/file/d/1zNyVuQCMsl9Ap2iwYgwuJN_Vj9MEo2N1/view?usp=share_link
https://drive.google.com/file/d/1WWTwCDoXH3_PEzTFmV2IJQhDkUJnBJUv/view?usp=share_link
https://drive.google.com/file/d/1EPLLVEoK2MnS5pnqYp2gJUdJtjQqHTu5/view?usp=share_link

Confirm that this is a public dataset that can be retrieved by anyone on the Network (i.e., no specific permissions or access rights are required to view the data).

Yes

What is the expected retrieval frequency for this data?

1 time for one year.

For how long do you plan to keep this dataset stored on Filecoin?

More than 540 Days.

DataCap allocation plan

In which geographies (countries, regions) do you plan on making storage deals?

Hong Kong.

How will you be distributing your data to storage providers? Is there an offline data transfer process?

Both offline mail and online download are available.

How do you plan on choosing the storage providers with whom you will be making deals? This should include a plan to ensure the data is retrievable in the future both by you and others.

We prefer storage service providers with higher reliability.

How will you be distributing deals across storage providers?

Store to 5 SPs.

Do you have the resources/funding to start making deals as soon as you receive DataCap? What support from the community would help you onboard onto Filecoin?

We have funds ready for this project.
large-datacap-requests[bot] commented 2 years ago

Thanks for your request!

Heads up, you’re requesting more than the typical weekly onboarding rate of DataCap!
large-datacap-requests[bot] commented 2 years ago

Thanks for your request! Everything looks good. :ok_hand:

A Governance Team member will review the information provided and contact you back pretty soon.

large-datacap-requests[bot] commented 2 years ago

Thanks for your request!

Heads up, you’re requesting more than the typical weekly onboarding rate of DataCap!
large-datacap-requests[bot] commented 2 years ago

Thanks for your request! Everything looks good. :ok_hand:

A Governance Team member will review the information provided and contact you back pretty soon.

simonkim0515 commented 1 year ago

Datacap Request Trigger

Total DataCap requested

5PiB

Expected weekly DataCap usage rate

400TiB

Client address

f1fs2rynivh2jeywfkkwiasdzmjsc2cdtmu3zuzfi

large-datacap-requests[bot] commented 1 year ago

DataCap Allocation requested

Multisig Notary address

f02049625

Client address

f1fs2rynivh2jeywfkkwiasdzmjsc2cdtmu3zuzfi

DataCap allocation requested

200TiB

Id

bed16d99-59aa-4138-9f5c-5279a0c2875c

metacodebean commented 1 year ago

image

1ane-1 commented 1 year ago

Can you provide more data sample? The data sample you provided are 7 pictures.

metacodebean commented 1 year ago

Can you provide more data sample? The data sample you provided are 7 pictures.

I have uploaded some videos and log files, the links are https://drive.google.com/file/d/1tvTWJ2G68bz3qcCurBHHwKjc_LUo6CLA/view?usp=share_link https://drive.google.com/file/d/1UYtgIU3qK9zYGBQVzhw_UWPgNfXOZTIN/view?usp=share_link https://drive.google.com/file/d/1ClSV8M1KsJlObQ8ah0Eyn_tsNr59_ckX/view?usp=share_link https://drive.google.com/file/d/1m-RxjCHlK_RWhMOffPVSx5OHLEAMWSqj/view?usp=share_link https://drive.google.com/file/d/1gYRUNzepRJ4rqL47bQ1cBb63RgLDuJ_a/view?usp=share_link https://drive.google.com/file/d/1rkxysp2pBywk2S6maur8xTrVfQ9U-eBf/view?usp=share_link

1ane-1 commented 1 year ago

Can you provide more data sample? The data sample you provided are 7 pictures.

I have uploaded some videos and log files, the links are https://drive.google.com/file/d/1tvTWJ2G68bz3qcCurBHHwKjc_LUo6CLA/view?usp=share_link https://drive.google.com/file/d/1UYtgIU3qK9zYGBQVzhw_UWPgNfXOZTIN/view?usp=share_link https://drive.google.com/file/d/1ClSV8M1KsJlObQ8ah0Eyn_tsNr59_ckX/view?usp=share_link https://drive.google.com/file/d/1m-RxjCHlK_RWhMOffPVSx5OHLEAMWSqj/view?usp=share_link https://drive.google.com/file/d/1gYRUNzepRJ4rqL47bQ1cBb63RgLDuJ_a/view?usp=share_link https://drive.google.com/file/d/1rkxysp2pBywk2S6maur8xTrVfQ9U-eBf/view?usp=share_link

Could you provide the screenshot of your total data size?The pics and videos can't proof you need 5PiB datacap. Thanks a lot~

metacodebean commented 1 year ago

Can you provide more data sample? The data sample you provided are 7 pictures.

I have uploaded some videos and log files, the links are https://drive.google.com/file/d/1tvTWJ2G68bz3qcCurBHHwKjc_LUo6CLA/view?usp=share_link https://drive.google.com/file/d/1UYtgIU3qK9zYGBQVzhw_UWPgNfXOZTIN/view?usp=share_link https://drive.google.com/file/d/1ClSV8M1KsJlObQ8ah0Eyn_tsNr59_ckX/view?usp=share_link https://drive.google.com/file/d/1m-RxjCHlK_RWhMOffPVSx5OHLEAMWSqj/view?usp=share_link https://drive.google.com/file/d/1gYRUNzepRJ4rqL47bQ1cBb63RgLDuJ_a/view?usp=share_link https://drive.google.com/file/d/1rkxysp2pBywk2S6maur8xTrVfQ9U-eBf/view?usp=share_link

Could you provide the screenshot of your total data size?The pics and videos can't proof you need 5PiB datacap. Thanks a lot~

image

This is part of our data.

1ane-1 commented 1 year ago

Request Proposed

Your Datacap Allocation Request has been proposed by the Notary

Message sent to Filecoin Network

bafy2bzaceauju5wtsjtsdpp6dgyxci3shx5ef43gdy5zkuafqlxmpx4wwu6by

Address

f1fs2rynivh2jeywfkkwiasdzmjsc2cdtmu3zuzfi

Datacap Allocated

200.00TiB

Signer Address

f1mdk7s2vntzm6hu35yuo6vjubtrpfnb2awhgvrri

Id

bed16d99-59aa-4138-9f5c-5279a0c2875c

You can check the status of the message here: https://filfox.info/en/message/bafy2bzaceauju5wtsjtsdpp6dgyxci3shx5ef43gdy5zkuafqlxmpx4wwu6by

Tom-OriginStorage commented 1 year ago

It looks good. I'll sign it

Tom-OriginStorage commented 1 year ago

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzaceayi76q3lhbyafxehgq2azidpez6lzyfr3ev4gvfw5ojubdwlcgyu

Address

f1fs2rynivh2jeywfkkwiasdzmjsc2cdtmu3zuzfi

Datacap Allocated

200.00TiB

Signer Address

f1q6bpjlqia6iemqbrdaxr2uehrhpvoju3qh4lpga

Id

bed16d99-59aa-4138-9f5c-5279a0c2875c

You can check the status of the message here: https://filfox.info/en/message/bafy2bzaceayi76q3lhbyafxehgq2azidpez6lzyfr3ev4gvfw5ojubdwlcgyu

large-datacap-requests[bot] commented 1 year ago

DataCap Allocation requested

Request number 2

Multisig Notary address

f02049625

Client address

f1fs2rynivh2jeywfkkwiasdzmjsc2cdtmu3zuzfi

DataCap allocation requested

400TiB

Id

7243b2f2-b6e0-4bcb-a3a4-e275e5a73b15

large-datacap-requests[bot] commented 1 year ago

Stats & Info for DataCap Allocation

Multisig Notary address

f01858410

Client address

f1fs2rynivh2jeywfkkwiasdzmjsc2cdtmu3zuzfi

Last two approvers

llifezou & 1ane-1

Rule to calculate the allocation request amount

100% of weekly dc amount requested

DataCap allocation requested

400TiB

Total DataCap granted for client so far

200TiB

Datacap to be granted to reach the total amount requested by the client (5PiB)

4.80PiB

Stats

Number of deals Number of storage providers Previous DC Allocated Top provider Remaining DC
4640 9 200TiB 13.79 46.15TiB
filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report[^1]

Storage Provider Distribution

The below table shows the distribution of storage providers that have stored data for this client.

If this is the first time a provider takes verified deal, it will be marked as new.

For most of the datacap application, below restrictions should apply.

Since this is the 3rd allocation, the following restrictions have been relaxed:

✔️ Storage provider distribution looks healthy.

Provider Location Total Deals Sealed Percentage Unique Data Duplicate Deals
f01852325 Hong Kong, Central and Western, HK
BIH-Global Internet Harbor
19.97 TiB 13.86% 19.97 TiB 0.00%
f01852023 Busan, Busan, KR
Korea Telecom
20.00 TiB 13.88% 20.00 TiB 0.00%
f01851482 Busan, Busan, KR
Korea Telecom
13.16 TiB 9.13% 13.16 TiB 0.00%
f01852664 Singapore, Singapore, SG
StarHub Ltd
19.66 TiB 13.64% 19.66 TiB 0.00%
f01852677 Morrisville, North Carolina, US
TierPoint, LLC
19.97 TiB 13.86% 19.97 TiB 0.00%
f01966534 Bangkok, Bangkok, TH
Zenlayer Inc
19.97 TiB 13.86% 19.97 TiB 0.00%
f01965334 Mumbai, Maharashtra, IN
Zenlayer Inc
15.38 TiB 10.67% 15.38 TiB 0.00%
f01969202 London, England, GB
Zenlayer Inc
11.66 TiB 8.09% 11.66 TiB 0.00%
f01964073 Jakarta, Jakarta, ID
Zenlayer Inc
4.31 TiB 2.99% 4.31 TiB 0.00%

Provider Distribution

Deal Data Replication

The below table shows how each many unique data are replicated across storage providers.

Since this is the 3rd allocation, the following restrictions have been relaxed:

✔️ Data replication looks healthy.

Unique Data Size Total Deals Made Number of Providers Deal Percentage
4.53 TiB 4.53 TiB 1 3.15%
3.78 TiB 7.56 TiB 2 5.25%
6.88 TiB 20.63 TiB 3 14.32%
12.56 TiB 50.25 TiB 4 34.88%
12.22 TiB 61.09 TiB 5 42.41%

Replication Distribution

Deal Data Shared with other Clients

The below table shows how many unique data are shared with other clients. Usually different applications owns different data and should not resolve to the same CID.

However, this could be possible if all below clients use same software to prepare for the exact same dataset or they belong to a series of LDN applications for the same dataset.

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

herrehesse commented 1 year ago

@xiaoyuaiheshui "It looks good. I'll sign it" is not due diligence.

The above miners seem to be all from the same entity and have gotten over 300P worth of datacap. Could be highly fraudulent and should be on hold until more is known.

@metacodebean Dear Applicant,

(Your website is not reachable, please resolve)

Due to the increased amount of erroneous/wrong Filecoin+ data recently, on behalf of the entire community, we feel compelled to go deeper into datacap requests. Hereby to ensure that the overall value of the Filecoin network and Filecoin+ program increases and is not abused.

Please answer the questions below as comprehensively as possible.

Customer data

We expect that for the onboarding of customers with the scale of an LDN there would have been at least multiple email and perhaps several chat conversations preceding it. A single email with an agreement does not qualify here.

Should this only be soley for acquiring datacap this is of course out of the question. The customer must have a legitimate reason for wanting to use the Filecoin+ program which is intended as a program to store useful and public datasets on the network.

(As an intermediate solution Filecoin offers the FIL-E program or the glif.io website for business datasets that do not meet the requirements for a Filecoin+ dataset)

Files and Processing

Hopefully you understand the caution the overall community has for onboarding the wrong data. We understand the increased need for Filecoin+, however, we must not allow the program to be misused. Everything depends on a valuable and useful network, let's do our best to make this happen. Together.

cryptowhizzard commented 1 year ago

Hi there,

Doing retrieval tests failed here for these providers. None of the deals stored are retrievable and they all come back with the same error:

Example1 -> Deal ID 21623095 retrieve: offer error: retrieval query offer errored: failed to fetch piece to retrieve from: getting pieces for cid QmYXoGqwxvq3xzy8s22Bd5ySvyL85SskjBiP2YZCXzFWCp: getting pieces containing block QmYXoGqwxvq3xzy8s22Bd5ySvyL85SskjBiP2YZCXzFWCp: failed to lookup index for mh 1220976ff1281523b41a3905f17239e5151d01f392cc11ce23ce2c5023a4a64c0d01, err: datastore: key not found

Example2 -> Deal ID 21512169 retrieve: offer error: retrieval query offer errored: failed to fetch piece to retrieve from: getting pieces for cid QmVZNyBweu1iMSzkmshjJDfkNrWP6BX1vrMSpbBsk119Sj: getting pieces containing block QmVZNyBweu1iMSzkmshjJDfkNrWP6BX1vrMSpbBsk119Sj: failed to lookup index for mh 12206b448fd102baba908c3b6685d7c73e0c294dc05179f02c3ac5d5e8c7f7c1ae54, err: datastore: key not found

Example3 -> Deal ID 21528237 retrieve: offer error: retrieval query offer errored: failed to fetch piece to retrieve from: getting pieces for cid QmRppoH7UhWqTEbhkmRKNgZHUmiUtQ8VjsYWpFBXCfxvuf: getting pieces containing block QmRppoH7UhWqTEbhkmRKNgZHUmiUtQ8VjsYWpFBXCfxvuf: failed to lookup index for mh 122033cabb36381846d2620d028dbd2ee1a233a426f04450b07258a589bf4cac25ba, err: datastore: key not found

Example4 -> Deal ID 21551337 retrieve: offer error: retrieval query offer errored: failed to fetch piece to retrieve from: getting pieces for cid QmTwBSiS2Rz9wrewtNFTVcjLBMpLyHv3y1Fn1PDJhkXtUk: getting pieces containing block QmTwBSiS2Rz9wrewtNFTVcjLBMpLyHv3y1Fn1PDJhkXtUk: failed to lookup index for mh 12205322ef13da82bacf5d4a5eb684750c94051f0cc392645c91678918f0beebca15, err: datastore: key not found

Before datacap can be granted this situation must be resolved. The data stored in the FIL+ program needs to be retrievable and none of this stored data is.

I advise @metacodebean to contact his service providers and provide a solution before continuing this project. @metacodebean please ask your SP's to switch to boost to have the data properly stored and retrievable and wait until this is done.

cryptowhizzard commented 1 year ago

@FroghubMan can you please cancel your proposal. We must wait until this situation is resolved.

Sunnyiscoming commented 1 year ago

@metacodebean Any update here?

cryptowhizzard commented 1 year ago

Feb 15 23:40:03 proposals dealscanner-f01987325-f01852664: Error: Failed to retrieve content with candidate miner f01852664: data transfer failed: deal errored: getting pieces for cid QmbizRacw3A9E4n7sKFbKWMXV4yV26AsMi3FBc61xxgNiP: getting pieces containing block QmbizRacw3A9E4n7sKFbKWMXV4yV26AsMi3FBc61xxgNiP: failed to lookup index for mh 1220c6e0ffeed9d3c32225e6c46d7c987fd2f03bc7f0201bf38dbe718812bd5860bc, err: datastore: key not found

C00kies77 commented 1 year ago

checker:manualTrigger

filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report Summary[^1]

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

Deal Data Replication

✔️ Data replication looks healthy.

Deal Data Shared with other Clients[^3]

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the full report.

filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report Summary[^1]

Storage Provider Distribution

✔️ Storage provider distribution looks healthy.

Deal Data Replication

✔️ Data replication looks healthy.

Deal Data Shared with other Clients[^3]

✔️ No CID sharing has been observed.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger

[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...

Full report

Click here to view the full report.

github-actions[bot] commented 1 year ago

This application has not seen any responses in the last 10 days. This issue will be marked with Stale label and will be closed in 4 days. Comment if you want to keep this application open.

github-actions[bot] commented 1 year ago

This application has not seen any responses in the last 14 days, so for now it is being closed. Please feel free to contact the Fil+ Gov team to re-open the application if it is still being processed. Thank you!

metacodebean commented 1 year ago

Hello, we will look for reputable, technologically mature SP to continue to cooperate, thank you!

filplus-checker-app[bot] commented 1 year ago

DataCap and CID Checker Report[^1]

No active deals found for this client.

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger