Open nevrome opened 4 months ago
The biosamples FAQ states
What pattern do BioSamples accessions follow?
BioSample accessions always begin with SAM. The next letter is either E or N or D depending if the sample information was originally submitted to EMBL-EBI or NCBI or DDBJ respectively. After that, there may be an A or a G to denote an Assay sample or a Group of samples. Finally there is a numeric component that may or may not be zero-padded.
This seems to match to the sample_accession
field in the .ssf file, which identifies sequencing entities, not "samples" in the Poseidon sense. Is this correct? If we already have this covered in the .ssf file then maybe we should not add it to the .janno file as well.
Yes, I think so too. Plus, we have Genetic_Source_Accession_IDs
in the janno, which allows to specify the ENA sample ID as well.
This recommendation was raised in the review of the Poseidon paper.