Lind111 / EF

2 stars 3 forks source link

[DataCap Application] <Hami Melon> - <Enterprise Data Backup> #59

Closed mangobabyme closed 1 week ago

mangobabyme commented 1 month ago

Data Owner Name

Hami Melon

Data Owner Country/Region

Hong Kong

Data Owner Industry

IT & Technology Services

Website

-

Social Media Handle

@mangobabyme

Social Media Type

Other

What is your role related to the dataset

Dataset Owner

Total amount of DataCap being requested

5PiB

Expected size of single dataset (one copy)

650TiB

Number of replicas to store

8

Weekly allocation of DataCap requested

1PiB

On-chain address for first allocation

f14w5nqjqdvxjz7a7hhqlbwnyt6jwnv6cs6zelauy

Data Type of Application

Public, Open Commercial/Enterprise

Custom multisig

Identifier

No response

Share a brief history of your project and organization

Company Profile

Hami Melon Technology Ltd is a technology innovation company specializing in smart hardware and IoT solutions, headquartered in Hong Kong. The company is dedicated to the deep integration of cutting-edge artificial intelligence algorithms with practical hardware systems, delivering efficient, reliable, and cost-effective technological products that serve a wide range of sectors including smart agriculture, urban management, and security surveillance.

The name “Hami Melon” symbolizes the harmonious coexistence of nature and technology. It represents our mission to build a flexible bridge between complex data and real-world scenarios — like a sweet and intelligent “fruit” that gathers, transmits, and analyzes the most valuable data for users.

Our core products include:
    •   Drone-based Data Collection Systems (supporting high-precision mapping and crop monitoring)
    •   Edge Computing Gateways and Sensor Networks (suitable for agricultural and industrial applications)
    •   Blockchain-based Data Archiving and Verification Solutions (leveraging decentralized protocols such as Filecoin)
    •   AI-powered Image Recognition and Intelligent Decision-Making Systems

At Hami Melon Technology, we firmly believe that “Technology should not be cold.” We are committed to creating solutions that address real-world needs and deliver long-term value for our partners. Currently, the company has established partnerships with government agencies and agricultural enterprises across mainland China and Southeast Asia, progressively building a region-wide intelligent sensing network.

Is this project associated with other projects/ecosystem stakeholders?

No

If answered yes, what are the other projects/ecosystem stakeholders

Describe the data being stored onto Filecoin

1.  Raw Image and Video Data Collected by Drones
Used in scenarios such as agricultural monitoring, high-precision map creation, and disaster assessment. Distributed storage ensures the long-term preservation and secure access of large-volume, multi-source data.
    2.  Environmental and Crop Data Collected by Edge Sensors
Includes temperature, humidity, light intensity, soil nutrients, pest and disease detection, etc. The data is generated continuously and requires high temporal accuracy. Filecoin is used for archival storage, enabling traceability and verification of the data.
    3.  Intermediate and Output Data from AI Training and Inference Processes
Covers image annotation datasets, training samples, and inference results, facilitating model optimization and performance traceability.
    4.  Enterprise Operations and Regulatory Compliance Data Backup
Includes device operation logs, client reports, and project progress documents, which are shared with partners or submitted for regulatory review.
    5.  Original Data and Hash Files for On-chain Storage and Proof
Supports the construction of a “trusted data system” based on Filecoin, ensuring that key business data is tamper-proof and verifiable for integrity.

Where was the data currently stored in this dataset sourced from

My Own Storage Infra

If you answered "Other" in the previous question, enter the details here

If you are a data preparer. What is your location (Country/Region)

Hong Kong

If you are a data preparer, how will the data be prepared? Please include tooling used and technical details?

1.  Image and Video Preprocessing: Mainly using OpenCV for image cropping, rotation, and format conversion.
    2.  Data Annotation: Primarily using Label Studio or CVAT for labeling and annotation.
    3.  File Compression and Packaging: Utilizing tar + zstd (or gzip) to package data, improving upload efficiency.
    4.  Data Validation and Pre-Sealing Preparation: Relying on CID generation tools such as Lotus or Boost.

If you are not preparing the data, who will prepare the data? (Provide name and business)

Has this dataset been stored on the Filecoin network before? If so, please explain and make the case why you would like to store this dataset again to the network. Provide details on preparation and/or SP distribution.

No, because we are an early-stage startup team, we still hope to reduce costs by using decentralized storage.

Please share a sample of the data

https://8.152.212.134:39405/down/5LBnwMvZMBLl.rar

Confirm that this is a public dataset that can be retrieved by anyone on the Network

If you chose not to confirm, what was the reason

What is the expected retrieval frequency for this data

Never

For how long do you plan to keep this dataset stored on Filecoin

Permanently

In which geographies do you plan on making storage deals

Greater China

How will you be distributing your data to storage providers

HTTP or FTP server, IPFS

How did you find your storage providers

Slack

If you answered "Others" in the previous question, what is the tool or platform you used

Please list the provider IDs and location of the storage providers you will be working with.

f03540144 - Philippines
f03540147 - Philippines
f03540153 - Singapore
f03336022 - United Kingdom
f03253411 - Shenzhen, Guangdong, CN
f03321215 - Shenzhen, Guangdong, CN

How do you plan to make deals to your storage providers

Boost client, Lotus client

If you answered "Others/custom tool" in the previous question, enter the details here

Can you confirm that you will follow the Fil+ guideline

Yes

datacap-bot[bot] commented 1 month ago

Application is waiting for allocator review

datacap-bot[bot] commented 1 month ago

Similarity Report

No similar applications found for the issue

Lind111 commented 1 month ago

Enterprise Data Backup

Are you sure this is publicly available data?

Lind111 commented 1 month ago

Could you send an email with business license to qq248245620@gmail.com in order to confirm your identity?The email should include the miner ID, the business entity, a list of sps locations you will be working with

Lind111 commented 1 month ago

In which geographies do you plan on making storage deals Greater China

Is there another place where SP stores the data?

mangobabyme commented 1 month ago

@Lind111 Yes, we are an early-stage startup company. Currently, our data is mainly used for public backups.

mangobabyme commented 1 month ago

In which geographies do you plan on making storage deals Greater China

Is there another place where SP stores the data?

Regarding this, we hope to perform backups across multiple regions, as this will provide greater security for our data.

We aim to have backups not only in the Greater China region but also in North America and other locations.

mangobabyme commented 1 month ago

qq248245620@gmail.com

Hello, I have already sent the email.

Lind111 commented 1 month ago

Expected size of single dataset (one copy) 400TiB

How to prove it

Lind111 commented 1 month ago

Image Confirmed company information

Lind111 commented 1 month ago

Please keep the SP table list up to date

mangobabyme commented 1 month ago

Please keep the SP table list up to date

@Lind111 Hello, I have already made the update in the form.

mangobabyme commented 1 month ago

Expected size of single dataset (one copy) 400TiB

How to prove it

We reviewed and reorganized the data and found that the volume is larger than what was originally submitted, so I have updated it accordingly.

At the moment, we are still in the process of consolidating the data, but I can share some screenshots of the data with you.

mangobabyme commented 1 month ago
Image

Hello, this is about half of our data. We are currently preparing it for sealing.

Lind111 commented 1 month ago

f03540153 - Singapore

Image

f03336022 - United Kingdom

Image

Suspected VPN use, can you explain?And send proof of geolocation file to qq248245620@gmail.com

mangobabyme commented 1 month ago

@Lind111 Hello, to ensure the security of the nodes’ outbound IP addresses, the SPs we work with do use VPNs to hide their actual outbound IPs.

However, we do have data center cooperation documents to verify the actual locations of the SPs.

mangobabyme commented 1 month ago

@Lind111

The email has been sent.

Lind111 commented 1 month ago

@Lind111 Hello, to ensure the security of the nodes’ outbound IP addresses, the SPs we work with do use VPNs to hide their actual outbound IPs.

However, we do have data center cooperation documents to verify the actual locations of the SPs.

Image

Confirmed geolocation information

Lind111 commented 1 month ago

Can you satisfy spark retrieval and store unseal files? Can you do it https://github.com/filecoin-project/Allocator-Governance/issues/125 . can you create index information? Of If you need help, please contact us.

mangobabyme commented 1 month ago

@Lind111

Yes, but I’ve just learned via Slack that Spark’s IPNI service is currently experiencing issues, so Spark is unable to provide accurate retrieval rates for our partnered nodes at the moment. However, we can provide indexing information — this will be available after the project has started and the data has been stored.

datacap-bot[bot] commented 1 month ago

Datacap Request Trigger

Total DataCap requested

5PiB

Expected weekly DataCap usage rate

1PiB

DataCap Amount - First Tranche

281474976710656 B

Client address

f14w5nqjqdvxjz7a7hhqlbwnyt6jwnv6cs6zelauy

datacap-bot[bot] commented 1 month ago

DataCap Allocation requested

Multisig Notary address

Client address

f14w5nqjqdvxjz7a7hhqlbwnyt6jwnv6cs6zelauy

DataCap allocation requested

256 TiB

Id

e0bed34e-a8fe-4564-9928-ad1422b0b380

datacap-bot[bot] commented 1 month ago

Application is ready to sign

datacap-bot[bot] commented 1 month ago

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzacebiont2r5nklqno5tclk6yap47ihx6coayt4kdnkk7b42il2aoy56

Address

f14w5nqjqdvxjz7a7hhqlbwnyt6jwnv6cs6zelauy

Datacap Allocated

256 TiB

Signer Address

f1edi46m4ry7smqnk3p6nb4azmcythf4bl57lw3uy

Id

e0bed34e-a8fe-4564-9928-ad1422b0b380

You can check the status here https://filfox.info/en/message/bafy2bzacebiont2r5nklqno5tclk6yap47ihx6coayt4kdnkk7b42il2aoy56

datacap-bot[bot] commented 1 month ago

Application is Granted

Lind111 commented 1 month ago

checker:manualTrigger

datacap-bot[bot] commented 1 month ago

DataCap Client Report Summary [^1]

Client address: f14w5nqjqdvxjz7a7hhqlbwnyt6jwnv6cs6zelauy Client ID: f03604159 Report ID: 48048 Generated at: Tue, 27 May 2025 03:03:21 GMT (3 hours ago) [^2]

Report checks

⚠️All storage providers are located in the same region

✔️ Storage provider distribution looks healthy

✔️ Storage provider duplication looks healthy

✔️ Storage provider locations looks healthy

✔️ Storage provider zero retrievability looks healthy

✔️ Storage provider retrievability looks healthy

✔️ Low replica percentage is 0.00%

✔️ No CID sharing has been observed

✔️ Storage providers IPNI reporting looks healthy (1/2)

✔️ Storage providers IPNI reporting looks healthy (2/2)

✔️ Client receiving datacap from one allocator

✔️ 0.00% of deals have less than allocator-defined 4+ replicas

Full report

Click here to view the full report [^1]: To manually trigger this report, add a comment with text checker:manualTrigger [^2]: New report will be generated only if the latest one is older than 30 hours

datacap-bot[bot] commented 1 month ago

Client used 75% of the allocated DataCap. Consider allocating next tranche.

Lind111 commented 1 month ago

checker:manualTrigger

datacap-bot[bot] commented 1 month ago

DataCap Client Report Summary [^1]

Client address: f14w5nqjqdvxjz7a7hhqlbwnyt6jwnv6cs6zelauy Client ID: f03604159 Report ID: 48529 Generated at: Wed, 28 May 2025 03:05:54 GMT (4 hours ago) [^2]

Report checks

✔️ Storage providers are located in different regions

⚠️1 storage providers sealed more than 25% of total datacap

✔️ Storage provider duplication looks healthy

✔️ Storage provider locations looks healthy

✔️ Storage provider zero retrievability looks healthy

⚠️100.00% of storage providers have retrieval success rate less than 75%

⚠️Low replica percentage is 99.98%

✔️ No CID sharing has been observed

✔️ Storage providers IPNI reporting looks healthy (1/2)

✔️ Storage providers IPNI reporting looks healthy (2/2)

✔️ Client receiving datacap from one allocator

⚠️99.98% of deals have less than allocator-defined 4+ replicas

Full report

Click here to view the full report [^1]: To manually trigger this report, add a comment with text checker:manualTrigger [^2]: New report will be generated only if the latest one is older than 30 hours

Lind111 commented 1 month ago

Considering that the first round of data hasn't been completely tallied so far, could you please tell me what your plans are for the next round?

mangobabyme commented 1 month ago

@Lind111 f03540144 - Philippines f03540147 - Philippines f03540153 - Singapore f03336022 - United Kingdom

Hello, for now we will continue working with these four nodes. When more allocation is granted, we plan to bring in additional SPs, and we are currently in discussions regarding those partnerships.

By the time the third round begins, we will add at least two more nodes for data backup and replication.

datacap-bot[bot] commented 1 month ago

Application is in Refill

datacap-bot[bot] commented 1 month ago

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzaceaqmjs3ezn6uchgk4yl2etjpklxo457r5umozs4y6ufsfrh2xocuq

Address

f14w5nqjqdvxjz7a7hhqlbwnyt6jwnv6cs6zelauy

Datacap Allocated

512 TiB

Signer Address

f1edi46m4ry7smqnk3p6nb4azmcythf4bl57lw3uy

Id

2b8b9297-a4dc-4279-80b9-6a8220daf219

You can check the status here https://filfox.info/en/message/bafy2bzaceaqmjs3ezn6uchgk4yl2etjpklxo457r5umozs4y6ufsfrh2xocuq

datacap-bot[bot] commented 1 month ago

Application is Granted

Lind111 commented 1 month ago

checker:manualTrigger

datacap-bot[bot] commented 1 month ago

DataCap Client Report Summary [^1]

Client address: f14w5nqjqdvxjz7a7hhqlbwnyt6jwnv6cs6zelauy Client ID: f03604159 Report ID: 48529 Generated at: Wed, 28 May 2025 03:05:54 GMT (23 hours ago) [^2]

Report checks

✔️ Storage providers are located in different regions

⚠️1 storage providers sealed more than 25% of total datacap

✔️ Storage provider duplication looks healthy

✔️ Storage provider locations looks healthy

✔️ Storage provider zero retrievability looks healthy

⚠️100.00% of storage providers have retrieval success rate less than 75%

⚠️Low replica percentage is 99.98%

✔️ No CID sharing has been observed

✔️ Storage providers IPNI reporting looks healthy (1/2)

✔️ Storage providers IPNI reporting looks healthy (2/2)

✔️ Client receiving datacap from one allocator

⚠️99.98% of deals have less than allocator-defined 4+ replicas

Full report

Click here to view the full report [^1]: To manually trigger this report, add a comment with text checker:manualTrigger [^2]: New report will be generated only if the latest one is older than 30 hours

datacap-bot[bot] commented 1 month ago

Client used 75% of the allocated DataCap. Consider allocating next tranche.

Lind111 commented 1 month ago

checker:manualTrigger

datacap-bot[bot] commented 1 month ago

DataCap Client Report Summary [^1]

Client address: f14w5nqjqdvxjz7a7hhqlbwnyt6jwnv6cs6zelauy Client ID: f03604159 Report ID: 49296 Generated at: Fri, 30 May 2025 03:06:04 GMT (14 hours ago) [^2]

Report checks

✔️ Storage providers are located in different regions

⚠️3 storage providers sealed more than 25% of total datacap

✔️ Storage provider duplication looks healthy

✔️ Storage provider locations looks healthy

✔️ Storage provider zero retrievability looks healthy

⚠️100.00% of storage providers have retrieval success rate less than 75%

⚠️Low replica percentage is 62.87%

✔️ No CID sharing has been observed

✔️ Storage providers IPNI reporting looks healthy (1/2)

✔️ Storage providers IPNI reporting looks healthy (2/2)

✔️ Client receiving datacap from one allocator

⚠️62.87% of deals have less than allocator-defined 4+ replicas

Full report

Click here to view the full report [^1]: To manually trigger this report, add a comment with text checker:manualTrigger [^2]: New report will be generated only if the latest one is older than 30 hours

Lind111 commented 1 month ago

checker:manualTrigger

datacap-bot[bot] commented 1 month ago

DataCap Client Report Summary [^1]

Client address: f14w5nqjqdvxjz7a7hhqlbwnyt6jwnv6cs6zelauy Client ID: f03604159 Report ID: 50101 Generated at: Sun, 01 Jun 2025 03:07:36 GMT (9 hours ago) [^2]

Report checks

✔️ Storage providers are located in different regions

✔️ Storage provider distribution looks healthy

✔️ Storage provider duplication looks healthy

✔️ Storage provider locations looks healthy

✔️ Storage provider zero retrievability looks healthy

⚠️100.00% of storage providers have retrieval success rate less than 75%

⚠️Low replica percentage is 43.83%

✔️ No CID sharing has been observed

✔️ Storage providers IPNI reporting looks healthy (1/2)

✔️ Storage providers IPNI reporting looks healthy (2/2)

✔️ Client receiving datacap from one allocator

✔️ 43.83% of deals have less than allocator-defined 4+ replicas

Full report

Click here to view the full report [^1]: To manually trigger this report, add a comment with text checker:manualTrigger [^2]: New report will be generated only if the latest one is older than 30 hours

Lind111 commented 1 month ago

There was an undisclosed SP Can you explain this?

Lind111 commented 1 month ago

Tell me more about your next round.

mangobabyme commented 1 month ago

There was an undisclosed SP Can you explain this?

Sorry, due to the recent holiday, we didn’t manage to disclose the information in time. I will make sure to disclose it more promptly next time.

Below are the newly added nodes.

They all have good retrieval rates on Spark.

f03253411 Shenzhen, Guangdong, CN

f03321215 Shenzhen, Guangdong, CN

datacap-bot[bot] commented 1 month ago

Issue has been modified. Changes below:

(NEW vs OLD)

Please list the provider IDs and location of the storage providers you will be working with: f03540144 - Philippines f03540147 - Philippines f03540153 - Singapore f03336022 - United Kingdom f03253411 - Shenzhen, Guangdong, CN f03321215 - Shenzhen, Guangdong, CN vs f03540144 - Philippines f03540147 - Philippines f03540153 - Singapore f03336022 - United Kingdom State: ChangesRequested vs Granted

mangobabyme commented 1 month ago

Tell me more about your next round.

@Lind111 Our application includes a plan for 8 data backups. We are currently in discussions with two additional nodes. If the timing works out, we may be able to add these two backups in the next round to achieve our goal of 8 backups. I will make the disclosure in advance at that time. Thank you.

datacap-bot[bot] commented 1 month ago

Issue information change request has been approved.

datacap-bot[bot] commented 1 month ago

Application is in Refill

datacap-bot[bot] commented 1 month ago

Request Approved

Your Datacap Allocation Request has been approved by the Notary

Message sent to Filecoin Network

bafy2bzacedp2vgzl73xc5kls4e7wrrtgk6mozb7mn5ib5uuhm7sto6vlihass

Address

f14w5nqjqdvxjz7a7hhqlbwnyt6jwnv6cs6zelauy

Datacap Allocated

1.00 PiB

Signer Address

f1edi46m4ry7smqnk3p6nb4azmcythf4bl57lw3uy

Id

81e6ff5a-d84c-4694-9c4c-37a564490f14

You can check the status here https://filfox.info/en/message/bafy2bzacedp2vgzl73xc5kls4e7wrrtgk6mozb7mn5ib5uuhm7sto6vlihass