Closed Sunkistn closed 1 year ago
Thanks for your request! Everything looks good. :ok_hand:
A Governance Team member will review the information provided and contact you back pretty soon.
Please provide relevant samples and exactly what you will be stored with 5 PiBs of DC. All information must be provided upfront and cannot be provided "later"
Please provide relevant samples and exactly what you will be stored with 5 PiBs of DC. All information must be provided upfront and cannot be provided "later"
http://millionsongdataset.com/ https://github.com/mdeff/fma https://freemusicarchive.org/ https://www.kaggle.com/imsparsh/musicnet-dataset
’Later‘ means that new data will be added if the business grows in the future. So far, the above are all data sources
@raghavrmadya The previous application has passed the review and has reached the third round of datacap allocation, just because the github account was flagged and resubmitted with a new account
@galen-mcandrew @Kevin-FF-USA @raghavrmadya This is the previous application information:
Why are you requesting 5 PiBs when you have already received 200TiB?
"’Later‘ means that new data will be added if the business grows in the future." This is not acceptable. You can only apply for the amount of DC you need today, not for future projections
Who are the SPs you are working with currently? Please share their SP IDs and request them to confirm by commenting on this application
Thanks for your request! Everything looks good. :ok_hand:
A Governance Team member will review the information provided and contact you back pretty soon.
Who are the SPs you are working with currently? Please share their SP IDs and request them to confirm by commenting on this application @raghavrmadya I have updated the application form. SPs information can be queried here: https://filplus.info/allocation_record?client_address=f1ju5oabz45ceog6e7k5omdj56uspv2pzgghiyzdy&obj=eyJuYW1lIjoiU1lOQyBMSVZFIEpBUEFOIElOQy4iLCJpc3N1ZV9udW1iZXIiOiIxMjMifQ%3D%3D I have asked them to comment here, but not all of them have Github accounts.
Thanks for your request! Everything looks good. :ok_hand:
A Governance Team member will review the information provided and contact you back pretty soon.
http://millionsongdataset.com/, the entire dataset 280 GB
https://github.com/mdeff/fma, 917 GB
https://freemusicarchive.org/ All songs are free, but not all of them can be used in videos, podcasts, short films or commercial projects(https://freemusicarchive.org/royalty-free-music/). Can you explain what part of the music you will store and provide proof of that, and how much data?
If you only source your data from the above four sites, it is difficult to prove that you have 5 PB data storage needs. Hope you can give more explanation.
Any update here?
- http://millionsongdataset.com/, the entire dataset 280 GB
- https://github.com/mdeff/fma, 917 GB
- https://freemusicarchive.org/ All songs are free, but not all of them can be used in videos, podcasts, short films or commercial projects(https://freemusicarchive.org/royalty-free-music/). Can you explain what part of the music you will store and provide proof of that, and how much data?
- https://www.kaggle.com/imsparsh/musicnet-dataset 33.46GB
If you only source your data from the above four sites, it is difficult to prove that you have 5 PB data storage needs. Hope you can give more explanation.
@Sunnyiscoming We also have the following data sources: https://registry.opendata.aws/pacific-sound/, 140TiB https://registry.opendata.aws/elp-nouabale-landscape/, 60TiB And we plan to store 10 copies. sorry for the late reply.
Thanks for your request! Everything looks good. :ok_hand:
A Governance Team member will review the information provided and contact you back pretty soon.
Total DataCap requested
4.7PiB
Expected weekly DataCap usage rate
100TiB
Client address
f1ju5oabz45ceog6e7k5omdj56uspv2pzgghiyzdy
f02049625
f1ju5oabz45ceog6e7k5omdj56uspv2pzgghiyzdy
50TiB
6c2ffb09-cdae-4488-908a-30ad4eb8ee00
SYNC LIVE JAPAN INC.
f1ju5oabz45ceog6e7k5omdj56uspv2pzgghiyzdy
The below table shows the distribution of storage providers that have stored data for this client.
If this is the first time a provider takes verified deal, it will be marked as new
.
For most of the datacap application, below restrictions should apply.
⚠️ 46.17% of total deal sealed by f01114587 are duplicate data.
⚠️ 37.50% of total deal sealed by f0867300 are duplicate data.
⚠️ 37.50% of total deal sealed by f0522948 are duplicate data.
⚠️ 37.50% of total deal sealed by f01227975 are duplicate data.
⚠️ 37.50% of total deal sealed by f01228000 are duplicate data.
⚠️ 37.50% of total deal sealed by f01228008 are duplicate data.
⚠️ 45.04% of total deal sealed by f0694908 are duplicate data.
⚠️ f0694908 has unknown IP location.
⚠️ f01075159 has unknown IP location.
⚠️ f0867429 has unknown IP location.
⚠️ f01016239 has unknown IP location.
Provider | Location | Total Deals Sealed | Percentage | Unique Data | Duplicate Deals |
---|---|---|---|---|---|
f01114587new |
Tokyo, Tokyo, JP | 57.97 TiB | 16.56% | 31.21 TiB | 46.17% |
f0867300 | Tokyo, Tokyo, JP | 40.00 TiB | 11.43% | 25.00 TiB | 37.50% |
f0522948 | Singapore, Singapore, SG | 40.00 TiB | 11.43% | 25.00 TiB | 37.50% |
f01227975 | Hong Kong, Central and Western, HK | 40.00 TiB | 11.43% | 25.00 TiB | 37.50% |
f01228000 | Seoul, Seoul, KR | 40.00 TiB | 11.43% | 25.00 TiB | 37.50% |
f01228008 | Sydney, New South Wales, AU | 40.00 TiB | 11.43% | 25.00 TiB | 37.50% |
f0694908new |
Unknown | 20.47 TiB | 5.85% | 11.25 TiB | 45.04% |
f01075159new |
Unknown | 11.34 TiB | 3.24% | 11.34 TiB | 0.00% |
f0867429new |
Unknown | 10.52 TiB | 3.01% | 9.90 TiB | 5.94% |
f01016255new |
Grimstad, Agder, NO | 10.49 TiB | 3.00% | 9.90 TiB | 5.66% |
f01228009 | Hong Kong, Central and Western, HK | 10.00 TiB | 2.86% | 10.00 TiB | 0.00% |
f01228065 | Singapore, Singapore, SG | 10.00 TiB | 2.86% | 10.00 TiB | 0.00% |
f0867298new |
Dehiwala-Mount Lavinia, Western, LK | 9.90 TiB | 2.83% | 9.90 TiB | 0.00% |
f01016239new |
Unknown | 9.27 TiB | 2.65% | 9.27 TiB | 0.00% |
The below table shows how each many unique data are replicated across storage providers.
✔️ Data replication looks healthy.
Unique Data Size | Total Deals Made | Number of Providers | Deal Percentage |
---|---|---|---|
31.30 TiB | 46.82 TiB | 1 | 13.38% |
1.25 TiB | 3.75 TiB | 2 | 1.07% |
640.00 GiB | 2.44 TiB | 3 | 0.70% |
19.27 TiB | 96.97 TiB | 4 | 27.71% |
25.00 TiB | 200.00 TiB | 5 | 57.15% |
The below table shows how many unique data are shared with other clients. Usually different applications owns different data and should not resolve to the same CID.
⚠️ CID sharing has been observed.
Other Client | Application | Total Deals Affected | Unique CIDs | Verifier |
---|---|---|---|---|
f3wgfwtrs5p6jrkwfl2mksqa2ivgbgdjjrhjbefy3 n7qzvotc3y6sazmp5gfyj7um6jlgdvlbiepzawnc6 wxtq |
FileDrive Labs | 166.92 TiB | 700 | LDN v3 multisig |
f1pkrmygbvweykpjcut36lf7ewgqdfhjklbhvepda | Protocol Labs ( project: Slingshot Evergreen ) | 400.00 GiB | 13 | LDN # 293 |
[^1]: To manually trigger this report, add a comment with text checker:manualTrigger
Hi, please explain the abnormal information. Some Sps has unknown IP location.
Hi, please explain the abnormal information. Some Sps has unknown IP location.
These are SPs that I collaborated with around January 2022. However, my GitHub account was banned for a long time afterwards, which prevented me from obtaining datacap, resulting in the termination of our collaboration. There is no plan to continue working with them in the future.
checker:manualTrigger
⚠️ 7 storage providers sealed too much duplicate data - f01114587: 46.17%, f01227975: 37.50%, f01228000: 37.50%, f01228008: 37.50%, f0522948: 37.50%, f0867300: 37.50%, f0694908: 45.04%
⚠️ 2 storage providers have unknown IP location - f0694908, f01075159
✔️ Data replication looks healthy.
⚠️ CID sharing has been observed. (Top 3)
[^1]: To manually trigger this report, add a comment with text checker:manualTrigger
[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger
[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...
Click here to view the full report.
@Sunkistn
The page cannot be found.
@Alex11801 This link appears to expire periodically, please check for the latest link: https://filplus.info/allocation_record?client_address=f1ju5oabz45ceog6e7k5omdj56uspv2pzgghiyzdy&obj=%7B%22name%22%3A%22SYNC%20LIVE%20JAPAN%20INC.%22,%22issue_number%22%3A%22123%22%7D
checker:manualTrigger
⚠️ 7 storage providers sealed too much duplicate data - f01114587: 46.17%, f01227975: 37.50%, f01228000: 37.50%, f01228008: 37.50%, f0522948: 37.50%, f0867300: 37.50%, f0694908: 45.04%
⚠️ 2 storage providers have unknown IP location - f0694908, f01075159
✔️ Data replication looks healthy.
⚠️ CID sharing has been observed. (Top 3)
[^1]: To manually trigger this report, add a comment with text checker:manualTrigger
[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger
[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...
Click here to view the full report.
checker:manualTrigger
⚠️ All retrieval success ratios are below 1%.
⚠️ 7 storage providers sealed too much duplicate data - f01114587: 46.17%, f01227975: 37.50%, f01228000: 37.50%, f01228008: 37.50%, f0522948: 37.50%, f0867300: 37.50%, f0694908: 45.04%
⚠️ 2 storage providers have unknown IP location - f0694908, f01075159
✔️ Data replication looks healthy.
⚠️ CID sharing has been observed. (Top 3)
[^1]: To manually trigger this report, add a comment with text checker:manualTrigger
[^2]: Deals from those addresses are combined into this report as they are specified with checker:manualTrigger
[^3]: To manually trigger this report with deals from other related addresses, add a comment with text checker:manualTrigger <other_address_1> <other_address_2> ...
Click here to view the CID Checker report. Click here to view the Retrieval report.
This application has not seen any responses in the last 10 days. This issue will be marked with Stale label and will be closed in 4 days. Comment if you want to keep this application open.
This application has not seen any responses in the last 14 days, so for now it is being closed. Please feel free to contact the Fil+ Gov team to re-open the application if it is still being processed. Thank you!
Thanks for your request! :exclamation: We have found some problems in the information provided. We could not find Website \/ Social Media field in the information provided We could not find Total amount of DataCap being requested (between 500 TiB and 5 PiB) field in the information provided We could not find Weekly allocation of DataCap requested (usually between 1-100TiB) field in the information provided We could not find On-chain address for first allocation field in the information provided We could not find Data Type of Application field in the information provided
Please, take a look at the request and edit the body of the issue providing all the required information.
RootKeyHolders have approved multisig account. You can now request first datacap release
RootKeyHolders have approved multisig account. You can now request first datacap release
Large Dataset Notary Application
To apply for DataCap to onboard your dataset to Filecoin, please fill out the following.
Core Information
Please respond to the questions below by replacing the text saying "Please answer here". Include as much detail as you can in your answer.
Project details
Share a brief history of your project and organization.
What is the primary source of funding for this project?
What other projects/ecosystem stakeholders is this project associated with?
Use-case details
Describe the data being stored onto Filecoin
Where was the data in this dataset sourced from?
Can you share a sample of the data? A link to a file, an image, a table, etc., are good ways to do this.
Confirm that this is a public dataset that can be retrieved by anyone on the Network (i.e., no specific permissions or access rights are required to view the data).
What is the expected retrieval frequency for this data?
For how long do you plan to keep this dataset stored on Filecoin?
DataCap allocation plan
In which geographies (countries, regions) do you plan on making storage deals?
How will you be distributing your data to storage providers? Is there an offline data transfer process?
How do you plan on choosing the storage providers with whom you will be making deals? This should include a plan to ensure the data is retrievable in the future both by you and others.
How will you be distributing deals across storage providers?
Do you have the resources/funding to start making deals as soon as you receive DataCap? What support from the community would help you onboard onto Filecoin?