[DataCap Application] BitsAndBytes - NOAA Water columns archive #9

Open bitsandbytes03 opened 4 months ago

DataCap Applicant


Project ID


Data Owner Name

NOAA Water-Column Sonar Data Archive

Data Owner Country/Region

United States

Data Owner Industry

Life Science / Healthcare


What is your role related to the dataset

Data Preparer

Total amount of DataCap being requested


Expected size of single dataset (one copy)


Number of replicas to store


Weekly allocation of DataCap requested


On-chain address for first allocation


Data Type of Application

Public, Open Dataset (Research/Non-Profit)

Share a brief history of your project and organization

Bits&Bytes is a European organization that aims to impact the decentralized storage space in the near future significantly. We have access to our own computer rooms in Amsterdam and Belgium, capable of processing large volumes of data. Our team is well-recognized in the Benelux internet scene and other industries.

Describe the data being stored onto Filecoin

We will store the Water-column sonar data archived at the NOAA National Centers for Environmental Information open dataset from AWS. This is a project funded by the NOAA to  store Water column sonar data, the acoustic back-scatter from the near-surface to the seafloor, are used to assess physical and biological characteristics of the ocean including the spatial distribution of plankton, fish, methane seeps, and underwater oil plumes.

Where was the data currently stored in this dataset sourced from

AWS Cloud

If you are a data preparer. What is your location (Country/Region)


If you are a data preparer, how will the data be prepared? Please include tooling used and technical details?

We will use Singularity.

Has this dataset been stored on the Filecoin network before? If so, please explain and make the case why you would like to store this dataset again to the network. Provide details on preparation and/or SP distribution.

The full set has not been stored completely on Filecoin and is not accessible in full. We aim to store it in full and have it retrievable.

Please share a sample of the data

This will be the AWS bucket.

Confirm that this is a public dataset that can be retrieved by anyone on the Network

What is the expected retrieval frequency for this data


For how long do you plan to keep this dataset stored on Filecoin

More than 3 years

In which geographies do you plan on making storage deals

Greater China, Asia other than Greater China, North America, Europe, Australia (continent)

How will you be distributing your data to storage providers

HTTP or FTP server

How did you find your storage providers


Please list the provider IDs and location of the storage providers you will be working with.

RMM Global
Holon Australia, Melbourne 2 replica's
DSS Australis, Sydney 1 replica
Dcent, The Netherlands 1 replica

How do you plan to make deals to your storage providers

Boost client

Can you confirm that you will follow the Fil+ guideline


datacap-bot[bot] commented 4 months ago

Application is waiting for allocator review

Total DataCap requested


Expected weekly DataCap usage rate


DataCap Amount - First Tranche


Client address


Multisig Notary address

Client address


DataCap allocation requested




Datacap Allocated


Signer Address




You can check the status of the message here:

cryptowhizzard commented 4 months ago

Additional comment :

KYC is performed / was already performed.

Bits & Bytes is a split of of Dcent and will focus on future data distribution. They are known irl. Holon Australia are knowm irl and per contract DSS Australis are known irl and per contract RMM Global, China. KYC performed per video and contract.

Until now focus has been on lead SP ( Bits and bytes ) to store the unsealed and sealed copy of first replica and have everything retrievable. RMM is only storing the sealed replica for security and backup.

gimims commented 4 months ago


DataCap and CID Checker Report Summary[^1]

Storage Provider Distribution

⚠️ All storage providers are located in the same region.

Deal Data Replication

✔️ Data replication looks healthy.

Deal Data Shared with other Clients[^3]

✔️ No CID sharing has been observed.

Full report

Click here to view the CID Checker report.

cryptowhizzard commented 4 months ago

Good morning :

Per request of the governance team :

f02366527, f02982293, f03064136 belong to Dcent. f03063130 and f03079759 belong to RMM.

As soon as DSS / Holon start sealing I will update the miner ID's here.

