In preparation for #50, we need to have a standard place to store cite-seq and barcode tag files, and reference them in the scpca-library-metadata.tsv file.
Since many of these are the same across multiple libraries, I am going to propose that we create a new barcodes directory within s3://ccdl-scpca-data to store each of the barcode index files. These could be named by the submitter with a suffix to differentiate the case where there might be more than one barcode library from a submitter. So something like smith-adt1.tsv for CITE-seq and smith-hash1.tsv for cell hashing. (Note that I am proposing .tsv files for compatibility with the Alevin-fry workflow and the --features flag: https://combine-lab.github.io/alevin-fry-tutorials/2021/af-feature-bc/)
In preparation for #50, we need to have a standard place to store cite-seq and barcode tag files, and reference them in the scpca-library-metadata.tsv file.
Since many of these are the same across multiple libraries, I am going to propose that we create a new
barcodes
directory withins3://ccdl-scpca-data
to store each of the barcode index files. These could be named by the submitter with a suffix to differentiate the case where there might be more than one barcode library from a submitter. So something likesmith-adt1.tsv
for CITE-seq andsmith-hash1.tsv
for cell hashing. (Note that I am proposing.tsv
files for compatibility with the Alevin-fry workflow and the--features
flag: https://combine-lab.github.io/alevin-fry-tutorials/2021/af-feature-bc/)