PoDP users fill the RefSeq_accession section with Assembly accession, e.g. the project shown below, the GCF_000515175.1 is actually an Assembly accession but not a RefSeq accession
RefSeq and Assembly accessions have different prefixes:
The assembly accession starts with a three letter prefix, GCA for GenBank assemblies and GCF for RefSeq assemblies. This is followed by an underscore and 9 digits.
Some ideas:
PoDP should validate the prefix to make sure the filled data is valid
PoDP should add Assembly section for Assembly data
PoDP schema shows that
RefSeq_accession
is will search on endpoint https://www.ncbi.nlm.nih.gov/nuccore/ (this is the endpoint of GenBank), but actually RefSeq has its own endpoint https://www.ncbi.nlm.nih.gov/refseq, so the RefSeq endpoint in the schema should be updated.PoDP users fill the
RefSeq_accession
section with Assembly accession, e.g. the project shown below, theGCF_000515175.1
is actually an Assembly accession but not a RefSeq accessionRefSeq and Assembly accessions have different prefixes:
GCF
andGCA
Some ideas:
Assembly
section for Assembly data