Closed wkipandula closed 9 months ago
@CunliangGeng - are you aware of any recent changes in the expected file format? @wkipandula, did you find an example mapping file?
Hi @wkipandula,
If you did not upload your data to PODP, then you will also have to add MS files to the strain_mapping.csv
. For example,
if "10a.mzXML" and "10b.mzXML" have the strain label "MIAP_547A_A", then you need to append them to the first line (MIAP_547A_A) of your strain_mapping.csv
.
I am using the webapp version.See the unknown_strains_gen.csv and unknown strains_met.csv.I am thinking that the problem is in parsing the gene/.bgk files labels. unknown_strains_gen.csv unknown_strains_met.csv common_strains.csv strain_mappings.csv
OK, I assume that you're using version 1.2.0.
The unknown_strains_gen
shows that BGC names from BigScape clustering file are not recognised. You could debug in two ways:
make sure your .gbk files are organised properly like the structure below:
antismash
├── GCF_000016425.1
│ ├── NC_009380.1.region001.gbk
│ ├── NC_009380.1.region002.gbk
│ └── NC_009380.1.region003.gbk
├── GCF_000018265.1
│ ├── NC_009953.1.region001.gbk
│ ├── NC_009953.1.region002.gbk
│ ├── NC_009953.1.region003.gbk
...
if the first solution does not work, the name of .gbk files might also matter. For example MIAP_547A_C_00023_NODE_23...region001
should be changed to MIAP_547A_C_00023_NODE_23.region001
by keeping only one single dot.
Hi there, Can someone please take a look at my strain_mappings file attached if I am missing something?I keep getting the "No strains" found or the "Failed to find ANY strains, missing strain_mappings.csv" errors. Please help.
strain_mappings.csv