Closed dankolbman closed 5 years ago
Moved to https://github.com/kids-first/kf-api-dataservice/issues/484
Expanding on this, as I see it currently...
(nb. I'm using access_url instead of gen3_url to preserve generality)
Short term: s3:// file urls should be substituted with https://
Medium term:
https://data.kidsfirst.yadayada/3b82fad9-55da-402f-a446-c86029720ff3 or https://api.gdc.cancer.gov/data/3b82fad9-55da-402f-a446-c86029720ff3 but not s3://kf-study-buckets-lol/3b82fad9-55da-402f-a446-c86029720ff3.bam
Yes? No? Maybe?
@fiendish lets move the medium term to a new issue and include some example requests/responses? Maybe a design document, if that feels more natural.
We also may consider instead storing the data
url in the urls
array for external files. That is, use the urls routing to /data/<uuid>
. This is the true url that one would go to in order to download the actual file, as well as the url that the portal will route to.
urls
:["s3://kf-study-buckets-lol/3b82fad9-55da-402f-a446-c86029720ff3.bam"]
access_urls
:["https://gen3.kidsfirst.com/data/3b82fad9-55da-402f-a446-c86029720ff3"]
urls
:["s3://kf-study-buckets-lol/3b82fad9-55da-402f-a446-c86029720ff3.bam"]
access_urls
:["https://gen3.kidsfirst.com/index/index/3b82fad9-55da-402f-a446-c86029720ff3"]
Currently, the genomic file
urls
field returns the list of file locations from gen3. These are raw file locations in the case that we own the file, however, there are some files that live within another gen3 deployment.We should unify these to all be urls pointing to gen3 deployments and return them under
access_urls
. In the current situation, this means that all urls to files in ours3
buckets should be replaced with urls.