bugphyzz #3367

sdgamboa opened 3 months ago

sdgamboa commented 3 months ago

lshep commented 3 months ago

We don't allow data to be store in github. Please store data on zenodo or some other trusted server. If a trusted serve is not available please consider using ExperimentHub.

sdgamboa commented 2 months ago

@lshep, we have published our data on zenodo and use it as the default option in our main function.

A couple of functions still use data from external sites. One of them is not exported. The other is exported, but we wrote in the documentation that it should be used by curator/developers. These functions are not used in any other function meant for end users.

Finally, a dataset that was originally imported from a github repo is now included in the inst/extadata directory with a description in inst/scripts.

Please let me know if I need to make more changes.

Thank you!

lshep commented 1 month ago

Please also include a section in the vignette that is similar to an abstract describing why you are submitting the package to bioconductor and a comparison to an existing Bioconductor packages.

lshep commented 1 month ago

We have added the package to but I am having trouble kicking off an initial build of the package. I'm currently investigating.

lshep commented 1 month ago

For Now this should be true:

Your package has been added to to continue the pre-review process. A build report will be posted shortly. Please fix any ERROR and WARNING in the build report before a reviewer is assigned or provide a justification on why you feel the ERROR or WARNING should be granted an exception.

IMPORTANT: Please read this documentation for setting up remotes to push to All changes should be pushed to moving forward. It is required to push a version bump to to trigger a new build report.

Bioconductor utilized your github ssh-keys for access. To manage keys and future access you may want to active your Bioconductor Git Credentials Account

lshep commented 1 month ago

Figured out the issue. If you have difficulty triggering builds with pushes to let me know and I can try to reprocess the package but I think it should be okay.

sdgamboa commented 1 month ago

@lshep, I added an abstract to the vignette and expanded the introduction to describe the package in more detail. I explain how the package should fit within Bioconductor workflows and related packages. Please let me know if that fulfills the request.

I got an error in the build, but I think this is a problem with one of the dependencies of another package (mia):




* checking for file bugphyzz/DESCRIPTION ... OK
* preparing bugphyzz:
* checking DESCRIPTION meta-information ... OK
* installing the package to build vignettes
* creating vignettes ... ERROR
--- re-building bugphyzz.Rmd using rmarkdown

Quitting from lines 291-294 [unnamed-chunk-9] (bugphyzz.Rmd)
Error: processing vignette 'bugphyzz.Rmd' failed with diagnostics:
package or namespace load failed for 'mia' in loadNamespace(i, c(lib.loc, .libPaths()), versionCheck = vI[[i]]):
 there is no package called 'scuttle'
--- failed re-building bugphyzz.Rmd

SUMMARY: processing the following file failed:

Error: Vignette re-building failed.
Execution halted
DarioS commented 1 week ago

bugphyzz is a package that stores annotations regarding microbiomes and enables enrichment analysis to be done. It will be a useful addition for Bioconductor microbiome researchers. However, there are some modifications required for it to conform to the Bioconductor Developer's Guide.

  output <- vector("list", length(files))
  for (i in seq_along(output)) {
    output[[i]] <- utils::read.csv(files[i], header = TRUE, skip = 1) |>
    dplyr::mutate(Attribute = tolower(.data$Attribute))

Please refer for Vectorize and change all such instances to vapply or lapply.

## Function for importing fatty acid compositions
## TODO This dataset needs more curation.
## TODO Names of the Fatty Acids should be more "user-friendly"
## TODO Maybe a threshold should be decided to consider a FA as present or not.


## TODO correct plant pathogenicity name earlier in the workflow or
## better yet, directly in the curation

See Comments section.

Commenting TODO’s should be avoided in published package code

Please add the missing functionality descibed in TODO sections.

> bp[["aerophilicity"]] <- as(bp[["aerophilicity"]], "DataFrame")
> makeSignatures(dat = bp[["aerophilicity"]], tax_id_type = "Taxon_name", tax_level = "genus")
Error in UseMethod("filter") : 
  no applicable method for 'filter' applied to an object of class "c('DFrame', 'DataFrame', 'SimpleList', 'RectangularData', 'List', 'DataFrame_OR_NULL', 'Vector', 'list_OR_List', 'Annotated', 'vector_OR_Vector')"
limma::voom(counts = assay(se), design = design, plot = FALSE)$E
> assay(se)[1:5, 1:5] # Not count data.
               700103497 700106940 700097304 700099015 700097644
Streptococcus   17.18097  18.50818  16.03412  15.40698 17.148602
Neisseria       16.82849  16.12832  15.12273  15.20496 13.653478
Porphyromonas   12.80149  12.73703  16.37312  15.05930  4.935801
Capnocytophaga  16.79096  15.31840  16.58074  17.05846 17.598692
Actinomyces     17.66332  17.65465  11.65284  16.26541 13.892903

counts: A numeric matrix containing raw counts, or an ExpressionSet containing raw counts, or a DGEList object. Counts must be non-negative and NAs are not permitted.