hputnam / Becker_E5

3 stars 0 forks source link

Include accession numbers for PocHistone Reference Sequences. #9

Closed hputnam closed 4 months ago

hputnam commented 4 months ago

What are the Poc histone reference sequence NCBI accession numbers? They just say P53 and J001 in the tree.

daniellembecker commented 4 months ago

In supplemental table S3: https://docs.google.com/document/d/1YIEJETPUZyTXIqTthznRCbIT24bkYh1AcbmZiCqIvE0/edit

Screen Shot 2024-05-30 at 9 36 13 PM
hputnam commented 4 months ago

These are BioSample and BioProject numbers. How do you find the exact PocHistone fasta you need from here?

daniellembecker commented 4 months ago

they are located in this directory: https://github.com/hputnam/Becker_E5/tree/master/Bioinformatics/Data/PocHistone

hputnam commented 4 months ago

but we still need to show where we got them from. They are the refernece we are basing the genetic ID on and it needs to be just as clear as your mtORF reference nucleotide accession numbers

daniellembecker commented 4 months ago

edited supplemental table S2 to include the run SRR number that brings us to a page where the fasta files can be downloaded:

SRR5567678

SRR5567679

is this a better way to reference them?

hputnam commented 4 months ago

Yes, you need an NCBI nucleotide accession number that brings up the exact sequence.

Example mtORF = HQ378758

https://www.ncbi.nlm.nih.gov/nuccore/HQ378758

daniellembecker commented 4 months ago

Found NCBI nucleotide accession numbers for POC histone

Screen Shot 2024-05-31 at 12 04 45 PM