When including sequences of length 1 in the pHMM training, Aphid crashes with this error:
Error in checkForRemoteErrors(val) :
3 nodes produced errors; first error: Invalid model for AA, residue alphabet does not correspond to
20-letter amino acid alphabet
Perhaps it has to do with lacking dimensions for the sequence length 1:
> str(x_AAbin)
List of 25
$ E : 'AAbin' raw E
$ IA : 'AAbin' raw [1:2] I A
$ KQL : 'AAbin' raw [1:3] K Q L
$ ICHW : 'AAbin' raw [1:4] I C H W
$ QPCRH : 'AAbin' raw [1:5] Q P C R ...
$ GVHWFV : 'AAbin' raw [1:6] G V H W ...
$ LNRSNHF : 'AAbin' raw [1:7] L N R S ...
$ TTARHMHY : 'AAbin' raw [1:8] T T A R ...
$ GGEMLKSMS : 'AAbin' raw [1:9] G G E M ...
$ PYYYKCVNGH : 'AAbin' raw [1:10] P Y Y Y ...
$ KQMNMMSLLKG : 'AAbin' raw [1:11] K Q M N ...
$ IHVQMAQWQRLP : 'AAbin' raw [1:12] I H V Q ...
$ SWTGEYIGIAELI : 'AAbin' raw [1:13] S W T G ...
$ AWWNTWAWWTRKVY : 'AAbin' raw [1:14] A W W N ...
$ HEKPAMPYFWRNYMA : 'AAbin' raw [1:15] H E K P ...
$ HKMEEPFTNTNVPSMV : 'AAbin' raw [1:16] H K M E ...
$ GNKRWGCQLWCQEWHGK : 'AAbin' raw [1:17] G N K R ...
$ RWKPWQDELKMCYQKTDH : 'AAbin' raw [1:18] R W K P ...
$ GFVDRYQIANAAVINIDTQ : 'AAbin' raw [1:19] G F V D ...
$ VHIRTPRNYHSCFSDMYHHF : 'AAbin' raw [1:20] V H I R ...
$ ECTVFWPKEWTTPSHYDPCCL : 'AAbin' raw [1:21] E C T V ...
$ DCCRVNMFYPCGWEMGVDKKCI : 'AAbin' raw [1:22] D C C R ...
$ IIMWWEMGPNFLAISLMWKLWWN : 'AAbin' raw [1:23] I I M W ...
$ HPQYPLKSCPNGIHLRILNDGGNG : 'AAbin' raw [1:24] H P Q Y ...
$ FTSSSSWTSNPWCCILTKWTWCNVA: 'AAbin' raw [1:25] F T S S ...
- attr(*, "class")= chr "AAbin"
Hi @shaunpwilkinson,
When including sequences of length 1 in the pHMM training, Aphid crashes with this error:
Perhaps it has to do with lacking dimensions for the sequence length 1:
Reproducible example:
If I change
to
The error disappears