phyloseq is a set of classes, wrappers, and tools (in R) to make it easier to import, store, and analyze phylogenetic sequencing data; and to reproducibly share that data and analysis with others. See the phyloseq front page:
Hope my message is in the right place, it's not directly a phyloseq problem but I think this may be of interest for other phyloseq users, and that at least, someone reading these posts can help. I want to import a .biom file in R that I've dowloaded from the ftp server - the Earth Microbiome Project (release 1).
I tried to open it with both the phyloseq::import_biome and the bioformat::read_biom functions :
emp<-read_biom(biom.file)
I got the following Error message:
E:/Path/to/my/data/EMP_data/emp_cr_silva_16S_123.qc_filtered.biom
either as JSON (BIOM-v1) or HDF5 (BIOM-v2).
Check file path, file name, file itself, then try again.
I then checked the file path and name with the is_file function of the 'fs' package. The file path and names are OK.
So I checked the .biom file I want to import by openning it with Notepad, and it shows strange characters (really sorry but I am not familiar with develloping) such as:
ƒOB§ß;}]0v(ÿQ<ãï8 #OÅ+q8‚´'Ž;º‹ë®Ü-¯§-‡ùP
I had a look at other .biom files I have but none of them look like this. I try to open these other files with the same function and it works.
The problem most likely comes from the file format but I don't know how to deal with this. Has anyone been subjected to a similar problem? I am familiar with the dada2 pipeline but not with qiime/biom format. I have been through the different vignettes and websites but have'n seen similar problems. Hope someone can help with this.
Hi @joey711 and @carolinebrunel . Did you ever sort this? I have worked out that some files do work (i.e. emp_cr_silva_16S_123.subset_2k.rare_10000.biom, but I am having exactly the same problem as you.
Hi Joey711,
Hope my message is in the right place, it's not directly a phyloseq problem but I think this may be of interest for other phyloseq users, and that at least, someone reading these posts can help. I want to import a .biom file in R that I've dowloaded from the ftp server - the Earth Microbiome Project (release 1).
The file comes from the following link: ftp://ftp.microbio.me/emp/release1/otu_tables/closed_ref_silva/ I tried with several of these files but the one I want to import in R (studio) is the first one: 'emp_cr_silva_16S_123.qc_filtered.biom' (293MB)
I've tried several things:
I tried to open it with both the phyloseq::import_biome and the bioformat::read_biom functions :
I got the following Error message:
I then checked the file path and name with the is_file function of the 'fs' package. The file path and names are OK.
So I checked the .biom file I want to import by openning it with Notepad, and it shows strange characters (really sorry but I am not familiar with develloping) such as:
ƒOB§ß;}]0v(ÿQ<ãï8 #OÅ+q8‚´'Ž;º‹ë®Ü-¯§-‡ùP
I had a look at other .biom files I have but none of them look like this. I try to open these other files with the same function and it works.
I tried to obtain this file from other repositories (https://zenodo.org/record/890000) but have a similar problem.
The problem most likely comes from the file format but I don't know how to deal with this. Has anyone been subjected to a similar problem? I am familiar with the dada2 pipeline but not with qiime/biom format. I have been through the different vignettes and websites but have'n seen similar problems. Hope someone can help with this.
Thanks in advanced