filecoin-project / filecoin-plus-large-datasets

Hub for client applications for DataCap at a large scale
109 stars 62 forks source link

[DataCap Application] Ganku Co., Ltd. #1028

Closed Ganku112211 closed 1 year ago

Ganku112211 commented 2 years ago

Large Dataset Notary Application

To apply for DataCap to onboard your dataset to Filecoin, please fill out the following.

Core Information

Please respond to the questions below by replacing the text saying "Please answer here". Include as much detail as you can in your answer.

Project details

Share a brief history of your project and organization.

Founded in 2015, Ganku Technology has set up team in Beijing and Chengdu. We are committed to becoimg a comprehensive information security solutions provider. We provide blockchain security related business support, security analysis based on big data and other businesses. At present, we are mainly engaged in two business sectors:
1. Big data-based on-chain behavior tracking and analysis, user portrait, address marking, etc.
2. Data + AI-driven security, extract and analyze massive logs and on-chain information, summarize specific data samples and conduct machine learning training.

What is the primary source of funding for this project?

There are two main sources of funding for us:
1. Money invested by early adopters.
2. Business income of project business line

What other projects/ecosystem stakeholders is this project associated with?

Our current business line is closely related to on-chain data, and our customers also come from the blockchain industry. 
1. Regarding the server supply chain, we mainly purchase aliyun servers. 
2. As for the application party of distributed storage track, we currently have cooperation with NFT to help them review the contract code. As for the storage of metadata, we will suggest them to use decentralized services. Some customers require us to provide relevant solutions. These customers have storage requirements of NFT for music, video and pictures. 
3. As for our customers using secure data service, some on-chain data needs to be stored after processing. 

Use-case details

Describe the data being stored onto Filecoin

The data stored on Filecoin currently consists mostly of application data and publicly available data stored by customers. 
Some of the client's project data will be used for metadata storage of images such as NFT. 
The project stores project data and training related to security AI training, including attack log raw data and processed vector data. 

Where was the data in this dataset sourced from?

Mainly from customers and application data on projects

Can you share a sample of the data? A link to a file, an image, a table, etc., are good ways to do this.

https://github.com/vicious11/ML-for-SQL-Injection/blob/master/ML_for_SQL/data/sql_matrix.csv#/
https://pan.baidu.com/s/14nbEwY2h68Z7GqThcX6efw?pwd=1A4S
https://pan.baidu.com/s/1_paSZX6pkH4sTNw_im1khQ?pwd=1A4S 
https://pan.baidu.com/s/1RAI1fE6lakPAWOmJqjPU2w?pwd=1A4S 
Extraction code:1A4S 

Confirm that this is a public dataset that can be retrieved by anyone on the Network (i.e., no specific permissions or access rights are required to view the data).

Yes, this is a public dataset that can be retrieved by anyone on the Network.

What is the expected retrieval frequency for this data?

There is no fixed frequency for data retrieval, it depends on the need

For how long do you plan to keep this dataset stored on Filecoin?

It also depends on the need.

DataCap allocation plan

In which geographies (countries, regions) do you plan on making storage deals?

USA, Singapore

How will you be distributing your data to storage providers? Is there an offline data transfer process?

We support both online and offline modes.

How do you plan on choosing the storage providers with whom you will be making deals? This should include a plan to ensure the data is retrievable in the future both by you and others.

Our plan is to find miners who have performed well in the past to help us provide stable storage.

How will you be distributing deals across storage providers?

Proportion based on miners' repuation.

Do you have the resources/funding to start making deals as soon as you receive DataCap? What support from the community would help you onboard onto Filecoin?

We have enough funds to start trading.
large-datacap-requests[bot] commented 2 years ago

Thanks for your request! Everything looks good. :ok_hand:

A Governance Team member will review the information provided and contact you back pretty soon.

Sunnyiscoming commented 1 year ago

https://www.ganku.net/ Website are unavailable. Please check it. The data samples you provided are almost from chabug.org. What's the relationship between Ganku and chabug.org? Have you authorized to store the data on the Filecoin network? Can you explain your data composition? How many copies will you store? What's the relationship between you and the organization? Can you provide more detailed information about other storage providers participated in this program, such as you can list SPs you have contacted with at present?

Ganku112211 commented 1 year ago

Hello, @Sunnyiscoming Thank you very much for your feedback

  1. The problem that the website is unavailable may be a network problem. Please try again. Our display is normal.
  2. The data samples we provide are security-related teaching videos, and the data we will store mainly includes the following categories. Video (10%), database files (35%), security log attack files (50%), other information documents (5%)
  3. We will determine the number of backups based on different data. For example, we will save 2-3 copies of logs that are more important to us. Other data may be prepared according to the number of backups of 1-2 copies.
  4. I am the technical director of ganku, and I have sent KYB emails before, please confirm.
  5. We have come into contact with some well-known sps in the industry, for example: f01893023/f01890456/f01773206/f01880364/f01853077
simonkim0515 commented 1 year ago

Datacap Request Trigger

Total DataCap requested

5PiB

Expected weekly DataCap usage rate

100TiB

Client address

f1sffbmjyfnz5jbsej23syerh6ubjcfmk4hd34zpi

large-datacap-requests[bot] commented 1 year ago

DataCap Allocation requested

Multisig Notary address

f02049625

Client address

f1sffbmjyfnz5jbsej23syerh6ubjcfmk4hd34zpi

DataCap allocation requested

50TiB

Id

76e298aa-a20f-4233-b6b7-d12349fb36e3

Defil2022 commented 1 year ago

sample data looks good. website is fine. one concern is does your data can be stored in USA or other country?

Tom-OriginStorage commented 1 year ago

can you send email to [filplus-app-review@fil.org]

Ganku112211 commented 1 year ago

Hi, @Defil2022 @Tom-OriginStorage Yes, we have considered this issue and are planning it. I have sent KYC email, please check it! thanks! b01f337776600c221d4989a867150b7

Tom-OriginStorage commented 1 year ago

Request Proposed

Your Datacap Allocation Request has been proposed by the Notary

Message sent to Filecoin Network

bafy2bzacea3j6k67xuvoewks6vu5a4cd5w65u6b6pgazbt25lyljts6mp3vte

Address

f1sffbmjyfnz5jbsej23syerh6ubjcfmk4hd34zpi

Datacap Allocated

50.00TiB

Signer Address

f1q6bpjlqia6iemqbrdaxr2uehrhpvoju3qh4lpga

Id

76e298aa-a20f-4233-b6b7-d12349fb36e3

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacea3j6k67xuvoewks6vu5a4cd5w65u6b6pgazbt25lyljts6mp3vte

large-datacap-requests[bot] commented 1 year ago

Aborting. Exit Code is Non 0

Defil2022 commented 1 year ago

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzacecxoareuot4vin2pyva5xfunwhvogwer4gl4xux44tz47bkqzhnra

Address

f1sffbmjyfnz5jbsej23syerh6ubjcfmk4hd34zpi

Datacap Allocated

50.00TiB

Signer Address

f1dnb3uz7sylxk6emti3ififcvu3nlufnnsjui6ea

Id

76e298aa-a20f-4233-b6b7-d12349fb36e3

You can check the status of the message here: https://filfox.info/en/message/bafy2bzacecxoareuot4vin2pyva5xfunwhvogwer4gl4xux44tz47bkqzhnra

large-datacap-requests[bot] commented 1 year ago

Aborting. Exit Code is Non 0

large-datacap-requests[bot] commented 1 year ago

DataCap Allocation requested

Request number 2

Multisig Notary address

f02049625

Client address

f1sffbmjyfnz5jbsej23syerh6ubjcfmk4hd34zpi

DataCap allocation requested

100TiB

Id

a21ad1bf-5d45-40a9-892f-cfff918e9c9c

large-datacap-requests[bot] commented 1 year ago

Stats & Info for DataCap Allocation

Multisig Notary address

f01858410

Client address

f1sffbmjyfnz5jbsej23syerh6ubjcfmk4hd34zpi

Last two approvers

DeFIL123 & llifezou

Rule to calculate the allocation request amount

100% of weekly dc amount requested

DataCap allocation requested

100TiB

Total DataCap granted for client so far

50TiB

Datacap to be granted to reach the total amount requested by the client (5 PiB)

4.95PiB

Stats

Number of deals Number of storage providers Previous DC Allocated Top provider Remaining DC
1495 6 50TiB 18.06 3.28TiB
Ganku112211 commented 1 year ago

checker:manualTrigger

filplus-checker commented 1 year ago

DataCap and CID Checker Report[^1]

Storage Provider Distribution

The below table shows the distribution of storage providers that have stored data for this client.

If this is the first time a provider takes verified deal, it will be marked as new.

For most of the datacap application, below restrictions should apply.

⚠️ f01985611 has unknown IP location.

⚠️ 43.56% of total deal sealed by f01834291 are duplicate data.

Provider Location Total Deals Sealed Percentage Unique Data Duplicate Deals
f01854080 Los Angeles, California, US
Zenlayer Inc
8.44 TiB 18.33% 8.44 TiB 0.00%
f01853104 Los Angeles, California, US
Zenlayer Inc
8.25 TiB 17.92% 8.25 TiB 0.00%
f01985611 Unknown
Unknown
8.22 TiB 17.85% 8.22 TiB 0.00%
f01890456 Los Angeles, California, US
Zenlayer Inc
8.13 TiB 17.65% 8.13 TiB 0.00%
f01834253 Los Angeles, California, US
Zenlayer Inc
6.69 TiB 14.53% 6.69 TiB 0.00%
f01834291 Los Angeles, California, US
Zenlayer Inc
6.31 TiB 13.71% 3.56 TiB 43.56%

Provider Distribution

Deal Data Replication

The below table shows how each many unique data are replicated across storage providers.

⚠️ 100.00% of deals are for data replicated across less than 4 storage providers.

Unique Data Size Total Deals Made Number of Providers Deal Percentage
42.78 TiB 45.53 TiB 1 98.91%
160.00 GiB 320.00 GiB 2 0.68%
64.00 GiB 192.00 GiB 3 0.41%

Replication Distribution

Deal Data Shared with other Clients

The below table shows how many unique data are shared with other clients. Usually different applications owns different data and should not resolve to the same CID.

However, this could be possible if all below clients use same software to prepare for the exact same dataset or they belong to a series of LDN applications for the same dataset.

⚠️ CID sharing has been observed.

Other Client Application Total Deals Affected Unique CIDs Approvers
f1ji2xty4dtg2a4pfnslltkw7pbhy76kvihizkfmq Pow Power 143.59 TiB 552 Unknown
f1gmiepn73zoa5gz2oiqyugjmrsecwj5qxd42vmyi `` 82.72 TiB 295 Unknown
f1qntyafmuq5h3moldezdkpgiiycbilrsvgfvyzti Zhonghe Mineral Processing Technology 28.72 TiB 102 11ane-1
1Defil2022
2fireflyHZ
2liyunzhi-666
1NDLABS-OFFICE
1newwebgroup
1psh0691
1stcouldlisa
1szalongdzyxgs
2Tom-OriginStorage
f1ng6g57r4u62q67u6lm33ftijfsggyzzjzb2l4cy NFTSTAR 27.56 TiB 165 11ane-1
1Defil2022
2newwebgroup
1psh0691
1stcouldlisa
2Tom-OriginStorage
f1o2nxreefzqzc2mvaqneaolpdat5yk3275r6rofa iPolloverse SG PTE. LTD 21.53 TiB 98 11ane-1
2Defil2022
1fireflyHZ
1ipfscn
1kernelogic
1liyunzhi-666
1newwebgroup
2psh0691
4stcouldlisa
5Tom-OriginStorage
1YuanHeHK
f1y7xmlneep3qtxcumqlmqrja3fhqdxdaxwou6mdq ND LABS 20.78 TiB 135 1cryptowhizzard
1Fenbushi-Filecoin
1IreneYoung
1MegTei
1psh0691
1rayshitou
1Reiers
f1vajbbihmcqkcjlnicmjto6fflklvhoifips7uzq rctAI 19.94 TiB 185 Unknown
f1u3rzf5re6nvnyjjixcmbk5tderq55vjhudmapny Nswap 18.09 TiB 74 1Defil2022
1psh0691
1Tom-OriginStorage
1YuanHeHK
f1guplg5wyjdn6bv4forsb5eu2lohexvdlttkavpq Ipollo 14.72 TiB 113 1MegTei
1rayshitou
1Reiers
1swatchliu
f13fjnwckkgnkbpapcmnpupbdf4ohduomdso4eqga Asia Blockchain Gaming Alliance 8.44 TiB 85 3Defil2022
1GaryGJG
2newwebgroup
1psh0691
2stcouldlisa
1Tom-OriginStorage
f1dfkdmjhuvvol6okun57mix447wdpolwcd323ktq BITRISE CAPITAL FOUNDATION PTE.LTD 8.34 TiB 69 1dannyob
1MegTei
1rayshitou
1swatchliu
f1jkf2e4ljhym3yaxdlq5cvqctpfsvecrklnsubjy Mikaelafashion 8.09 TiB 119 1Defil2022
1Tom-OriginStorage
f1k7y7jd3ly42cg5ysty2ngc5smxmaery5mglbynq MikaeLa 6.50 TiB 75 Unknown
f1df64fi3a4pmo5mxvdrz2zs7n2bfd4wnixomudcq UZERO 6.19 TiB 58 1fireflyHZ
1liyunzhi-666
f17d2n363326qlln4uva7ylxteamtdj3lq6wuaewi Metadata Labs Inc 5.59 TiB 55 1Fenbushi-Filecoin
1IreneYoung
1rayshitou
f1raeideaex2fwvoemftky6x5jzbqdawrgeuz2mry BBNews 5.03 TiB 40 1KodaRobotDog
1rayshitou
f1zxmwo3kwapc6axatwuia4lsowvpasco3znfbuvi Unknown 4.78 TiB 48 Unknown
f1e2emop3dpataocjw3jh4cfvqrqbp7albnip7wdq Ganku Technology 4.41 TiB 29 1IreneYoung
f1366c5an6uegsc7zhsy4csax3e5msrhie5gef7mi Jinchan Curtains Co.,ltd 4.16 TiB 27 1fireflyHZ
1liyunzhi-666
f1pnsnrfp5dqfqwospzj6vis6jxhni765i37tne5q TIMAGE 4.16 TiB 45 1fireflyHZ
1PluskitOfficial
f1nkh5as6inqcvcoiliumdmvai2luahw5imdqug5a `` 3.25 TiB 16 Unknown
f1jlkknv6psnw4m3wxok25sy7ngzdfjjk6rucbfwy Nonabyte 2.88 TiB 22 1liyunzhi-666
1newwebgroup
f1mhyxd4unemmhrw4dbhjcovivayrj3tyactezmzq GOLDEN SECURITY 2.31 TiB 12 Unknown
f1hkvuvlwgzsgxnyet3wb4w2s3fyujvtkgeizdaey Blue Storm Information Technology 2.22 TiB 16 11475Notary
1Defil2022
1liyunzhi-666
1stcouldlisa
1swatchliu
1Tom-OriginStorage
1YuanHeHK
f16w5d2u57ndts6wcqupwwvrl7ewicrl5xnymfxui Maoyoo Network 1.91 TiB 16 Unknown
f13wvg4xve7nz3nev4bi2c7imod7lelstxfwwzjgi We Doctor 1.38 TiB 42 1Alex11801
1Defil2022
3kernelogic
1liyunzhi-666
1NDLABS-OFFICE
1stcouldlisa
f1ewwoms3aiairedun4h2uayqhlmtjsulnqy2xnrq AsiaBlockchain Gaming Alliance 544.00 GiB 7 1dannyob
1IreneYoung
1MegTei
1MRJAVAZHAO
1rayshitou
f14kei3mbjobgsfszu5cndhosuobkbbtoaikc52pq Proya 480.00 GiB 8
f1fn4y2mtu7ooeuqxzczxeienm6zbnsgpf6b5ng2a Wel Vape 384.00 GiB 3 1fireflyHZ
f1fttsmb4vjost7zod5gf4eqpd6qpzylx3i4d6s3i 奕甲智能技术(上海)有限公司 352.00 GiB 11
f1crhsa6czawwislz35lcwt3zke2cr2vy5rmdmt3y `` 320.00 GiB 3 Unknown
f1dob4zdjy6b3iinf6evbqzxf6nwgnftcksewabxq Dr.ji 288.00 GiB 9 1newwebgroup
1psh0691
1stcouldlisa
1YuanHeHK
f1mgvvj45ce5i3ikpyixtubqzcytcgmdlwgjldqri Blockchain Media 160.00 GiB 2 1Defil2022
1Joss-Hua
2kernelogic
1liyunzhi-666
1NDLABS-OFFICE
f1fsoyq7oiwdaapagpnlncbohkrkzolxxzkex6hxq SemiDrive Technology 128.00 GiB 4 1Alex11801
1Defil2022
1kernelogic
2liyunzhi-666
1NDLABS-OFFICE

[^1]: To manually trigger this report, add a comment with text checker:manualTrigger

Sunnyiscoming commented 1 year ago

Hi, please explain the abnormal information.

github-actions[bot] commented 1 year ago

This application has not seen any responses in the last 10 days. This issue will be marked with Stale label and will be closed in 4 days. Comment if you want to keep this application open.

github-actions[bot] commented 1 year ago

This application has not seen any responses in the last 14 days, so for now it is being closed. Please feel free to contact the Fil+ Gov team to re-open the application if it is still being processed. Thank you!

large-datacap-requests[bot] commented 10 months ago

Thanks for your request! :exclamation: We have found some problems in the information provided. We could not find Website \/ Social Media field in the information provided We could not find Total amount of DataCap being requested (between 500 TiB and 5 PiB) field in the information provided We could not find Weekly allocation of DataCap requested (usually between 1-100TiB) field in the information provided We could not find On-chain address for first allocation field in the information provided We could not find Data Type of Application field in the information provided

Please, take a look at the request and edit the body of the issue providing all the required information.
large-datacap-requests[bot] commented 8 months ago

RootKeyHolders have approved multisig account. You can now request first datacap release

large-datacap-requests[bot] commented 4 months ago

RootKeyHolders have approved multisig account. You can now request first datacap release