Open fanninpm opened 2 years ago
Hi! I was also wanting to use this pipeline for Influenza B and received the same error. Was anyone able to get this to work?
Hi @kdl480, we currently don't do any flu B sequencing or analysis in our lab, only influenza A. But I don't think it would be that difficult to add better support for flu B to the workflow.
I know that this workflow is advertised to run on Influenza A specimens, but IRMA can also run on Influenza B specimens. However, when I try to run the workflow on an Influenza B control,
parse_influenza_blast_results.py
crashes with anIndexError
..command.err
```text WARNING: Your kernel does not support swap limit capabilities or the cgroup is not mounted. Memory limited without swap. 2021-12-03 18:50:38,273 INFO: Parsing Influenza metadata file "genomeset.dat.gz" [in parse_influenza_blast_results.py:354] 2021-12-03 18:50:40,004 INFO: Parsed Influenza metadata file into DataFrame with n=536691 rows and n=11 columns. There are 169 unique subtypes. [in parse_influenza_blast_results.py:376] 2021-12-03 18:50:40,005 INFO: Parsing BLAST results from BPC.blastn.txt [in parse_influenza_blast_results.py:183] 2021-12-03 18:50:40,463 INFO: Parsed 112085 BLAST results from BPC.blastn.txt [in parse_influenza_blast_results.py:197] 2021-12-03 18:50:40,463 INFO: BPC | n=112085 | Filtering for hits above 0.85% identity. [in parse_influenza_blast_results.py:198] 2021-12-03 18:50:40,477 INFO: BPC | n=111369 | Filtered for hits above 0.85% identity. [in parse_influenza_blast_results.py:204] /sample/tmp-tf-ton01/workspace/nf-iav-illumina/bin/parse_influenza_blast_results.py:207: SettingWithCopyWarning: A value is trying to be set on a copy of a slice from a DataFrame. Try using .loc[row_indexer,col_indexer] = value instead See the caveats in the documentation: https://pandas.pydata.org/pandas-docs/stable/user_guide/indexing.html#returning-a-view-versus-a-copy df_filtered["accession"] = df_filtered.saccver.str.extract( /sample/tmp-tf-ton01/workspace/nf-iav-illumina/bin/parse_influenza_blast_results.py:210: SettingWithCopyWarning: A value is trying to be set on a copy of a slice from a DataFrame. Try using .loc[row_indexer,col_indexer] = value instead See the caveats in the documentation: https://pandas.pydata.org/pandas-docs/stable/user_guide/indexing.html#returning-a-view-versus-a-copy df_filtered["sample"] = sample_name /sample/tmp-tf-ton01/workspace/nf-iav-illumina/bin/parse_influenza_blast_results.py:211: SettingWithCopyWarning: A value is trying to be set on a copy of a slice from a DataFrame. Try using .loc[row_indexer,col_indexer] = value instead See the caveats in the documentation: https://pandas.pydata.org/pandas-docs/stable/user_guide/indexing.html#returning-a-view-versus-a-copy df_filtered["sample"] = pd.Categorical(df_filtered["sample"]) /sample/tmp-tf-ton01/workspace/nf-iav-illumina/bin/parse_influenza_blast_results.py:212: SettingWithCopyWarning: A value is trying to be set on a copy of a slice from a DataFrame. Try using .loc[row_indexer,col_indexer] = value instead See the caveats in the documentation: https://pandas.pydata.org/pandas-docs/stable/user_guide/indexing.html#returning-a-view-versus-a-copy df_filtered["sample_segment"] = df_filtered.qaccver.str.extract(r".+_(\d)$").astype( /sample/tmp-tf-ton01/workspace/nf-iav-illumina/bin/parse_influenza_blast_results.py:215: SettingWithCopyWarning: A value is trying to be set on a copy of a slice from a DataFrame. Try using .loc[row_indexer,col_indexer] = value instead See the caveats in the documentation: https://pandas.pydata.org/pandas-docs/stable/user_guide/indexing.html#returning-a-view-versus-a-copy df_filtered["sample_segment"] = pd.Categorical(df_filtered["sample_segment"]) /sample/tmp-tf-ton01/workspace/nf-iav-illumina/bin/parse_influenza_blast_results.py:217: SettingWithCopyWarning: A value is trying to be set on a copy of a slice from a DataFrame. Try using .loc[row_indexer,col_indexer] = value instead See the caveats in the documentation: https://pandas.pydata.org/pandas-docs/stable/user_guide/indexing.html#returning-a-view-versus-a-copy df_filtered["subtype_from_match_title"] = ( /sample/tmp-tf-ton01/workspace/nf-iav-illumina/bin/parse_influenza_blast_results.py:220: SettingWithCopyWarning: A value is trying to be set on a copy of a slice from a DataFrame. Try using .loc[row_indexer,col_indexer] = value instead See the caveats in the documentation: https://pandas.pydata.org/pandas-docs/stable/user_guide/indexing.html#returning-a-view-versus-a-copy df_filtered["subtype_from_match_title"] = df_filtered["subtype_from_match_title"] 2021-12-03 18:50:41,288 INFO: BPC | Merging NCBI Influenza DB genome metadata with BLAST results on accession. [in parse_influenza_blast_results.py:221] ╭───────────────────── Traceback (most recent call last) ──────────────────────╮ │ /sample/tmp-tf-ton01/workspace/nf-iav-illumina/bin/parse_influenza_blast_res │ │ ults.py:495 inBPC.blastn.txt
I hope the attached information is helpful.