Open Rosiekstein opened 1 year ago
I worked it out! If anyone else has this problem, it seems to be an issue with adding the tax_table and otu_table at the same time. So I worked around it by:
# try making a phyloseq object with just otu_table and sam_data
ps.test2 <- phyloseq(otu_table(seqs.test, taxa_are_rows = FALSE), sam_data(sams.test))
all(sample_sums(ps.test2) == rowSums(seqtab.nochim)) # OK, so this works
# add in tax_table to ps.test2
ps.test2@tax_table <- tax_table(tax.test)
all(sample_sums(ps.test2) == rowSums(seqtab.nochim)) # still TRUE, so this worked!
Hello, I'm using dada2 and phyloseq on a supercomputing cluster, with the following Sys.info:
"#1 SMP Mon Jan 9 11:39:52 AEDT 2023" "gadi-cpu-bdw-0090.gadi.nci.org.au" machine login "x86_64" "unknown" user effective_user "rs2207" "rs2207"
and r info:
R version 4.2.2 (2022-10-31) Platform: x86_64-pc-linux-gnu (64-bit) Running under: Ubuntu 22.04.1 LTS
I am trying to create a phyloseq object, but when I do I lose a ton of reads and have several samples that go to 0 reads when they originally had data. I found another post about this issue from 2019 (https://github.com/joey711/phyloseq/issues/1208) but none of the fixes there worked (removing "-" from sample names, etc.). This is defenitely happening in the phyloseq() step, which I tested as follows:
when I look at sample_sums() I get the following before and after the phyloseq() step (this is a subset because I have a lot of samples):
Does anyone have any suggestions on how to deal with this issue? Thank you.