epam / cloud-pipeline

Cloud agnostic genomics analysis, scientific computation and storage platform
https://cloud-pipeline.com
Apache License 2.0
144 stars 58 forks source link

[S2] AWS Omics: Initiate Omics Import Job for the omics storage from UI #3474

Open SilinPavel opened 3 months ago

SilinPavel commented 3 months ago

AWS Omics store have capability to import files into it from s3 bucket, we also have such functionality on the server side, let's implement it on the UI as well.

Corresponding server API: GET /omcisstore/{id}/import Body:

{
  "serviceRoleArn": "string",
  "sources": [
    {
      "sourceFiles": {
        "source1": "string",
        "source2": "string"
      },
      "sourceFileType": "REFERENCE",
      "subjectId": "string",
      "sampleId": "string",
      "referencePath": "string",
      "name": "string",
      "description": "string",
      "generatedFrom": "string"
    }
  ]
}

name - required, string serviceRoleArn - optional, parameter if not present, server will get it from the region settings. Shouldn't be propagated on UI. sourceFileType - required, for Ref Store - REFERENCE for Seq Store - one of [FASTQ, BAM, UBAM, CRAM] subjectId - required, string sampleId - required, string referencePath - only for Seq Store, optional for FASTQ and UBAM, required for BAM CRAM, should be valid path from Ref Store registered in the system description - optional, string generatedFrom - optional, string sourceFiles - required, object with 2 fields. For Ref Store only source1 should be provided, source2 can be provided for FASTQ files

By navigating to the storage view and clicking a button Import in the right upper corner user should be able to start import procedure by filling out a form:

For Ref Store form should provide the next field to fill: name - string subjectId - string sampleId - string description - string generatedFrom - string sourceFiles- interactive control, allows user to navigate to any s3 bucket registered in the system and choose a file

For Seq Store form should provide the next field to fill: name - string subjectId - string sampleId - string sourceFileType - predefined list of FASTQ, BAM, UBAM, CRAM referencePath - interactive control, allows user to navigate to any s3 bucket registered in the system and choose a file description - string generatedFrom - string sourceFiles- interactive control, allows user to navigate to any s3 bucket registered in the system and choose a file, for FASTQ it should allow to choose 2 files