the input protein recovers the correct source nucleotide sequence at the Elink step
the input protein recovers additional, incorrect sequences at the Elink step
where the incorrect recovered sequence is shorter than the correct sequence, it is preferrred over the correct sequence (for reasons of preserving bandwidth)
the incorrect sequence is recovered only when the /[start]-[end] Stockholm format domain is present in the input sequence accession/ID
Summary:
Input protein sequences deriving from a known organism (e.g. human) are retrieiving nucleotide sequences from a different organism (e.g. bos taurus).
Description:
The input sequence
does not give an output nucleotide sequence, as the wrong originating sequence is identified in the Elink linker step.
Reproducible Steps:
With the above sequence as input, run
ncfp
as normal:Current Output:
Expected Output:
A nucleotide coding sequence corresponding to the input protein, in the output directory.
ncfp
Version:commit 694d806
Python Version:
3.9
Operating System:
macOS