change label for CDS coordinates

mah11 commented 8 years ago

To avoid confusion about the original meaning (and any other interpretations, correct or incorrect) of "CDS", change the label to "start and end coordinates". Even if "CDS" would be technically correct, enough people don't realize it so changing to something ploddingly unambiguous won't go wrong.

kimrutherford commented 7 years ago

I guess this is from V1?

Currently in V2 we just have "II, 1500197-1502095 (1899nt)" with no CDS bit.

Would "start and end translation coordinates" be clearer?

ValWood commented 7 years ago

Hmm, I still think CDS is the correct way to refer to the coding sequence genome coordinates.

The problem is that people (incorrectly IMHO) are expecting the number 1899nt to represent the translation lenght. However it does't if it's the start and end of a spliced gene in the genome.

The CDS length (or the number we report) is the entire length of the coding sequence with introns in the genome, and I think that is correct.

The problem is how to explain this.....

ValWood commented 7 years ago

Maybe it should be the translated length in nucleotides. I think the meaning has changed over time. it used to be "from coding DNA sequence", which to me would make the CDS length the start and end of the CDS in the DNA not the start and end of the edited sequence.

The edited sequence seems to be what people expect. So we could report the nucleotide length of the translation in this case....

ValWood commented 7 years ago

Genomic location   II, 1500197-1502095 (1899nt) coding start to stop
                       1500197-1502095 (1899nt) including UTRs

mah11 commented 6 years ago

If we just stick with the current single set of coordinates, its current label ("genomic location") is fine. If you want to show with and without UTRs, the version from Jun 1 above (https://github.com/pombase/website/issues/59#issuecomment-305459606) would do.

kimrutherford commented 6 years ago

What text should we have for RNA genes and pseudogenes?

III, 2111204-2116520 (5317nt)

kimrutherford commented 6 years ago

III, 2111204-2116520 (5317nt)

Sorry, hit submit too soon. For now I've implemented it like "III, 2111204-2116520 (5317nt)" for genes without a translation, which is what we have at the moment. Is that enough in that case?

kimrutherford commented 6 years ago

For now I've implemented it like ...

It's on the main site now. Is it OK?