Project-MONAI / MONAI

AI Toolkit for Healthcare Imaging
https://monai.io/
Apache License 2.0
5.86k stars 1.09k forks source link

Add cloud data loading support #8173

Open KumoLiu opened 2 weeks ago

KumoLiu commented 2 weeks ago

This feature would enable the application to pull, process, and store data directly from cloud-based services, such as AWS, Google Cloud, and Azure. By integrating cloud data support, we aim to provide users with more flexibility and scalability in managing their data, especially when dealing with large datasets or distributed systems.

Implementation Considerations: API Integrations: We would need to support APIs for major cloud providers (e.g., S3 for AWS, Blob Storage for Azure, and Google Cloud Storage). Authentication and Security: Secure access management is critical, so we may need integrate with cloud authentication protocols like IAM. Data Formats and Compatibility: Support for multiple data formats (e.g., CSV, JSON, DICOM) to ensure compatibility with various data types stored on the cloud.

ref: https://github.com/webdataset/webdataset https://github.com/mosaicml/streaming https://aws.amazon.com/healthimaging/

Nic-Ma commented 2 days ago

Hi @ericspod , @KumoLiu ,

Do you think it's valuable to add support for AWS AHI? https://aws.amazon.com/healthimaging/?nc1=h_ls.

Thanks.