naturalis / wgs2ncbi

Toolkit for preparing genomes for submission to NCBI
BSD 3-Clause "New" or "Revised" License
29 stars 5 forks source link

How to respond to validation issues #18

Open emmannaemeka opened 4 years ago

emmannaemeka commented 4 years ago

Hello, Working on using a genome for submission to NCBI contamination errors were sent back. Please how can some of the validation and contamination errors be solved using this tool?

These are the error groups:

  1. Genes completely overlapped by other genes
  2. FEATURE_LOCATION_CONFLICT: 6626 features have inconsistent gene locations.
  3. FEATURE_LOCATION_CONFLICT: RNA feature location does not match gene location
  4. FATAL: SHOW_HYPOTHETICAL_CDS_HAVING_GENE_NAME: 2 hypothetical coding regions have a gene name
  5. ERROR: valid [SEQ_INST.BadProteinStart] gap symbol at start of protein sequence
  6. ERROR: valid [SEQ_INST.StopInProtein] [27] termination symbols in protein sequence
mcmurtrs commented 2 years ago

Were you able to resolve these issues? I am having similar issues with my submission to NCBI. Thank you!