filecoin-project / filecoin-plus-large-datasets

Hub for client applications for DataCap at a large scale
110 stars 62 forks source link

[DataCap Application] Hello Decentralized #2081

Closed alexanderbkl closed 1 year ago

alexanderbkl commented 1 year ago

Data Owner Name

Hello Decentralized

What is your role related to the dataset

Other

Data Owner Country/Region

Spain

Data Owner Industry

Web3 / Crypto

Website

https://joinhello.app

Social Media

https://es.linkedin.com/company/hellostorage (Linkedin)

Total amount of DataCap being requested

10 PiB

Expected size of single dataset (one copy)

100 GiB

Number of replicas to store

4

Weekly allocation of DataCap requested

200 TiB

On-chain address for first allocation

f15r52r73xyl6bgvds55oibbeacm3qv2ve2ofr23a

Data Type of Application

Private Commercial/Enterprise

Custom multisig

No

Share a brief history of your project and organization

Hello Drive is a small startup that has a prototype and got pre-seed investments, it is an open-source, encrypted, user-controlled decentralized storage software designed for both web3 and traditional users. Data is securely stored across the decentralized infrastructure. Hello Drive is built on the principles of efficiency, sustainability, and decentralization, and that is why our infrastructure leverages unused storage space from current devices and constantly fine-tunes its efficiency, resulting in unmatched scalability. Sustainability is a key aspect, as our network doesn't require the creation of new servers, making us a carbon-neutral project.

Is this project associated with other projects/ecosystem stakeholders?

Yes

If answered yes, what are the other projects/ecosystem stakeholders

IPFS, Sia, Metamask

Describe the data being stored onto Filecoin

Clients will store their data on hot and cold storage. Hot storage will be linked to IPFS and/or Sia and cold storage is backed up in Filecoin SPs. When a user uploads its data, it is encrypted with AES-CBC end to end and has the option to share its data publicly or with specific terms (to a specific person, with a time range...).

Where was the data currently stored in this dataset sourced from

My own storage infra

If you answered "Other" in the previous question, enter the details here

n/a

How do you plan to prepare the dataset

lotus

If you answered "other/custom tool" in the previous question, enter the details here

n/a

Please share a sample of the data

User's personal data (images, videos, documents...).

Confirm that this is a public dataset that can be retrieved by anyone on the Network

No

If you chose not to confirm, what was the reason

n/a

What is the expected retrieval frequency for this data

Daily

For how long do you plan to keep this dataset stored on Filecoin

2 to 3 years

In which geographies do you plan on making storage deals

Europe

How will you be distributing your data to storage providers

I don't know yet

How do you plan to choose SP

Slack

If you answered "Others" in the previous question, what is the tool or platform you plan to use

n/a

If you already have a list of storage providers to work with, fill out their names and provider IDs below

n/a

How do you plan to make deals to your storage providers

Others/custom tool

If you answered "Others/custom tool" in the previous question, enter the details here

n/a

Can you confirm that you will follow the Fil+ guideline

Yes

Application created via filplus.storage

data-programs commented 1 year ago

This application requests a total of 10 PiB, so it’s labeled very large application

large-datacap-requests[bot] commented 1 year ago

Thanks for your request!

Heads up, you’re requesting more than the typical weekly onboarding rate of DataCap!
large-datacap-requests[bot] commented 1 year ago

Thanks for your request! Everything looks good. :ok_hand:

A Governance Team member will review the information provided and contact you back pretty soon.

Sunnyiscoming commented 1 year ago

When a user uploads its data, it is encrypted with AES-CBC end to end and has the option to share its data publicly or with specific terms (to a specific person, with a time range...).

  1. Whether the user data you will store is totally public? If it is encrypted, please apply for FIL+. If it is fully public, please provide enough data samples.

  2. Total amount of DataCap being requested≠Expected size of single dataset (one copy)Number of replicas to store 10 PiB≠100 GiB4

  3. Could you send an email to filplus-app-review@fil.org with your official domain in order to confirm your identity? Email name should includes the issue id #2081.

alexanderbkl commented 1 year ago

Thank you for your query Sunny!

To clarify, although our users' data is encrypted with AES-CBC end-to-end, they have the option to share their data publicly or with specific conditions (to a specific person, within a certain time range, etc.). This shared data can be accessed via a special link that our system generates. Therefore, while the data is encrypted, it can become publicly accessible if the user chooses to share it.

Considering this, our data falls in a category that is both private and public to a certain extent, depending on the individual user's choice. I believe applying for FIL+ is suitable for our case as it is partially encrypted. Due to the private nature of our user data, providing data samples might be a challenge but we can arrange some anonymized samples or synthetic data if required.

Regarding the DataCap, we request a total of 10 PiB. Initially, I mentioned that the size of a single dataset (one copy) would be around 100 GiB. Upon further analysis and considering the nature of the data we handle, it appears that a more accurate estimation would be 250 GiB per dataset. I apologize for the earlier misunderstanding. This refined estimation further justifies our DataCap request.

As for the question about sending an email to confirm identity, we will be proceeding with that step promptly.

I hope this clarifies your questions regarding our data and our DataCap request. Please let us know if there are any other details we can provide.

alexanderbkl commented 1 year ago

Hi!

I've just sent the email from our domain to proof our identity from team@joinhello.app email. Even though we bought and migrating to http://hello.storage, we still have https://joinhello.app.

Thanks in advance! @Sunnyiscoming

Sunnyiscoming commented 1 year ago

@kevzak

kevzak commented 1 year ago

Hi @alexanderbkl - can you explain the dataset a bit more. What type of data will be stored by your clients?

Also, can you prove the dataset size? As I'm reading, 250GiB x 4 =1 Terabyte not 10PiB. Can you re-explain the math behind the DataCap request?

kevzak commented 1 year ago

Because this is regarding private user data, the guidelines call for use of E-Fil+ pathway which includes additional upfront checks of the Applicant, Business and Data. See details here: https://efilplus.super.site/

kevzak commented 1 year ago

Please let me know if you have any questions about application process @alexanderbkl

github-actions[bot] commented 1 year ago

This application has not seen any responses in the last 10 days. This issue will be marked with Stale label and will be closed in 4 days. Comment if you want to keep this application open.

github-actions[bot] commented 1 year ago

This application has not seen any responses in the last 14 days, so for now it is being closed. Please feel free to contact the Fil+ Gov team to re-open the application if it is still being processed. Thank you!