Submission: colocr - Githubissues

MahShaaban commented 6 years ago

Summary

What does this package do? (explain in 50 words or less): Conduct Co-localization Analysis of Fluorescence Microscopy Images
Paste the full DESCRIPTION file inside a code block below:

Package: colocr
Type: Package
Title: Conduct Co-localization Analysis of Fluorescence Microscopy Images
Version: 0.1.0
License: GPL-3
Authors@R: person("Mahmoud", "Ahmed",
    email = "mahmoud.s.fahmy@students.kasralainy.edu.eg",
    role = c("aut", "cre"))
URL: https://github.com/MahShaaban/colocr
BugReports: https://github.com/MahShaaban/colocr/issues
Description: Automate the co-localization analysis of fluoresence microscopy 
  images. Selecting regions of interest, extract pixel intensities from 
  the image channels and calculate different co-localization statistics.
Encoding: UTF-8
LazyData: true
Suggests: testthat,
    covr,
    knitr,
    rmarkdown,
    devtools
RoxygenNote: 6.0.1
Imports: imager,
  shiny,
  scales
VignetteBuilder: knitr

URL for the package (the development repository, not a stylized html page): https://github.com/MahShaaban/colocr
Please indicate which category or categories from our package fit policies this package falls under *and why(? (e.g., data retrieval, reproducibility. If you are unsure, we suggest you make a pre-submission inquiry.):

[e.g., "data extraction, because the package parses a scientific data file format"] data extraction

Who is the target audience and what are scientific applications of this package? Biologists (no advanced R required)
Are there other R packages that accomplish the same thing? If so, how does yours differ or meet our criteria for best-in-category? The EBImage and the imager packages contain the algorithms for dealing with images. This package uses imager but is focused only on the functionality needed for conducting co-localization of multichannel images. In addition, the package provide a shiny app to inteactively perform the same analysis.
If you made a pre-submission enquiry, please paste the link to the corresponding issue, forum post, or other discussion, or @tag the editor you contacted. A pre-submission, https://github.com/ropensci/onboarding/issues/241

Requirements

Confirm each of the following by checking the box. This package:

[x] does not violate the Terms of Service of any service it interacts with.
[x] has a CRAN and OSI accepted license.
[x] contains a README with instructions for installing the development version.
[x] includes documentation with examples for all functions.
[x] contains a vignette with examples of its essential functions and uses.
[x] has a test suite.
[x] has continuous integration, including reporting of test coverage, using services such as Travis CI, Coveralls and/or CodeCov.
[x] I agree to abide by ROpenSci's Code of Conduct during the review process and in maintaining my package should it be accepted.

Publication options

[x] Do you intend for this package to go on CRAN?
[ ] Do you wish to automatically submit to the Journal of Open Source Software? If so:
- [ ] The package has an obvious research application according to JOSS's definition.
- [ ] The package contains a paper.md matching JOSS's requirements with a high-level description in the package root or in inst/.
- [ ] The package is deposited in a long-term repository with the DOI:
- (Do not submit your package separately to JOSS)
[ ] Do you wish to submit an Applications Article about your package to Methods in Ecology and Evolution? If so:
- [ ] The package is novel and will be of interest to the broad readership of the journal.
- [ ] The manuscript describing the package is no longer than 3000 words.
- [ ] You intend to archive the code for the package in a long-term repository which meets the requirements of the journal (see MEE's Policy on Publishing Code)
- (Scope: Do consider MEE's Aims and Scope for your manuscript. We make no gaurantee that your manuscript willl be within MEE scope.)
- (Although not required, we strongly recommend having a full manuscript prepared when you submit here.)
- (Please do not submit your package separately to Methods in Ecology and Evolution)

Detail

[x] Does R CMD check (or devtools::check()) succeed? Paste and describe any errors or warnings:
[x] Does the package conform to rOpenSci packaging guidelines? Please describe any exceptions:
If this is a resubmission following rejection, please explain the change in circumstances:
If possible, please provide recommendations of reviewers - those with experience with similar packages and/or likely users of your package - and their GitHub user names:

maelle commented 6 years ago

I got a difference again... What I got was "cor": "Average PCC: 0.76 and Average MOC: 0.9" 🤔

I'll wait for your having fixed the Appveyor build before trying again.

MahShaaban commented 5 years ago

Hey @maelle Apologies for the late reply. I was away from my workstation since the beginning of the week. Now, Now that I installed phantomjs on appveyor, the app tests can run. The tests actually failed on appveyor in a way similar to the one you mentioned in this last comment. I can't replicate the error locally though! Could you please give me some information about your setup environment?

maelle commented 5 years ago

:wave: @MahShaaban!

My session info was in https://github.com/ropensci/onboarding/issues/243#issuecomment-421452708

Do you have access to a Windows machine?

MahShaaban commented 5 years ago

Thanks @maelle I checked the package versions, they are comparable to the ones I have and still can't reproduce the error. Assuming that the tests fail only on windows, what would be the way forward? I don't have an easy access to a windows machine, and I never really used R on one.

maelle commented 5 years ago

will have a look myself in the next days. we'll solve this sooner or later 😁

MahShaaban commented 5 years ago

Thanks a lot @maelle

maelle commented 5 years ago

I've just had a look, can't do much more at the moment.

Does the output in artifacts https://ci.appveyor.com/project/MahShaaban/colocr/build/artifacts help at all?
Could you write lines of code that I could run to give you the output of, if that helps?
I was hoping the tests of shinytest on Appveyor would help but I can't access them https://ci.appveyor.com/project/rstudio/shinytest - maybe you could request access?

maelle commented 5 years ago

I'm totally unexperimented with your app, hence my poor debugging. I was thinking that maybe when you set one of the input using the shinytest commands, the cursors move more or less than what you get on mac/Linux?

Btw on Travis do you test on on Linux? It'd be worth adding a mac build, just to see whether you only get the issue for Windows.

maelle commented 5 years ago

Or maybe it's due to a waiting time that's not enough on Windows? Could you try adding waiting time before the snapshot? Or even adding time between all commands?

End of my suggestions for today, sorry about that.

maelle commented 5 years ago

Maybe useful https://rstudio.github.io/shinytest/articles/in-depth.html#getting-input-output-and-export-values (to check the inputs have been set)

MahShaaban commented 5 years ago

Hey @maelle Here is what I tried recently.

I already checked the artifacts and found nothing useful. Only the error message and the difference between the output and the expected.
I am not sure how the tests of shiny test can be useful. I clicked the link and I couldn't access them either.
On Travis I only test on Linux. I agree with your suggestion, maybe testing Mac can give a clue. However, I run the test on my mac and they pass with no changes!
I increased the wait time for the steps that take long to update the output and this had no effects. I still get the same error and differences on appveyor. Check this last commit
I checked the link in your last comment and it has no mention of Linux/windows discrepancies that could be of help.

Here is an idea. Since you can reproduce the errors on your local windows machine. I think it would be useful to see if the tests pass if you run them for the first time. To do that you need to remove the expected output folders from the app tests folder inst/colocr/tests/*-expected/. Once those two folders are removed, you can run the tests from the app directory shinytest::testApp(). This will give a message saying the tests are running for the first time and the expected images are being created. We can then compare the logs.

maelle commented 5 years ago

Doing it now... after removing the timeout_ arguments 👼

maelle commented 5 years ago

This is still quite slow and I get no message yet 🤔

MahShaaban commented 5 years ago

I wrote two tests to check whether the output of the app is reproduced using the same input parameters. Onc test used the colocr functions and another without. The tests passed locally and on Travis, while the two of them failed on Appveyor. In addition to the fact that, these are the only tests that actually check the numeric outputs (the co-localization stats), I think that neither the app nor the colocr functions are the root of the problem.

The tests are in a file called tests/test-reproduce_app.R in this last commit

PS. @seaaan noticed before that the part of the vignette where I check the reproduction of the app output from the same input returns FALSE. The code chunk is check_equal at line 313 of the vignette. Was this happening on a windows machine?

maelle commented 5 years ago

Interesting. Can you add a more minimal example using imager and data that's not in colocr so that I might run it and we can post in imager repo?

MahShaaban commented 5 years ago

I am not sure how to make a minimal example in this case. So far, I've been checking the final outputs of either colocr or imager and the tests passes locally and on travis but not appveyor. I am using multiple imager functions, and this difference could be due to any of them. So,

I am not sure how to identify the particular function that produces the difference.
imager builds on travis, only to an earlier commit, but it doesn't have appveyor integration.
I made the the same test in colocr/test-reproduce_app.R to run independently from colocr if that helps. Basically, I replaced the system.file() calls to download.file() from GitHub first and reading them.

My current thought is to build imager on travis and appveyor and run this test. Meanwhile I am trying to figure out a way to identify the one or more functions that is causing the issue. I am not sure this is the smarts way to do it, but I think I can save all intermediary objects from the test run in an R object and compare them to a test run on windows/appveyor.

maelle commented 5 years ago

@MahShaaban I don't see the test script with download.files()? Could you please write share a gist without testthat? I'll then run it. I was thinking seeing imager:: would help, and at each point where you use an imager function if possible use () to show the output, this way it'll be easier to compare?

MahShaaban commented 5 years ago

Sorry, I forgot to link to the test script in the imager fork. Here, is a gist of the script. I updated the second revision to remove testthat calls.

MahShaaban commented 5 years ago

I traced the difference between the calculated correlations on ubuntu and windows to very first step in loading the images. I think this (dahtah/imager#41) is related, although neither the maintainer nor the user followed up on the issue yet!

I found that the pixel values of the images in the colocr package are loaded differently on the two platforms. This was at least in part for jpeg::readJPEG() which is used in imager::load.image() and colocr::image_load(). I noticed the same when tried with other images. Here is one from the jpeg package itself.

On ubuntu

> version
               _                           
platform       x86_64-pc-linux-gnu         
arch           x86_64                      
os             linux-gnu                   
system         x86_64, linux-gnu           
status                                     
major          3                           
minor          5.1                         
year           2018                        
month          07                          
day            02                          
svn rev        74947                       
language       R                           
version.string R version 3.5.1 (2018-07-02)
nickname       Feather Spray
> packageVersion('jpeg')
[1] ‘0.1.8’               
> fl <- system.file('img', 'Rlogo.jpg', package = 'jpeg')
> img <- jpeg::readJPEG(fl)
> mean(img)
[1] 0.7046421

On windows

> version
               _                           
platform       x86_64-w64-mingw32          
arch           x86_64                      
os             mingw32                     
system         x86_64, mingw32             
status                                     
major          3                           
minor          5.1                         
year           2018                        
month          07                          
day            02                          
svn rev        74947                       
language       R                           
version.string R version 3.5.1 (2018-07-02)
nickname       Feather Spray
> packageVersion('jpeg')
[1] ‘0.1.8’               
> fl <- system.file('img', 'Rlogo.jpg', package = 'jpeg')
> img <- jpeg::readJPEG(fl)
> mean(img)
[1] 0.7047047

Notice the difference starting at the 4th decimal point. I am showing the mean here, but I visually inspected the values themselves and they look different. Could that be due to instability/inaccuracies at the very small decimal points?

I couldn't go beyond that as jpeg::readJPEG itself is a call to a source file, part of CImg C++ library as far as I can tell.

> jpeg::readJPEG
function (source, native = FALSE) 
.Call("read_jpeg", if (is.raw(source)) source else path.expand(source), 
    native, PACKAGE = "jpeg")
<bytecode: 0x2d11910>
<environment: namespace:jpeg>

MahShaaban commented 5 years ago

I used image_read from the magick package instead of load.image from imager and this seems to solve the issue of ubuntu/windows differences. Here, MahShaaban/colocr#3

maelle commented 5 years ago

Cool! In that case why not switch the whole package to magick? 😉

jeroen commented 5 years ago

I would second maelle's suggestion to try and switch to magick. It is a much more comprehensive and reliable image toolkit. Is there particular functionality in imager that you are missing from magick?

MahShaaban commented 5 years ago

Thanks, @maelle @jeroen for the suggestion. I certainly don't mind looking into that.

Although the package currently relies heavily on imager, I don't mind switching to magick. I went through the magick vignette and I think the classes and the basic image transformations that I'd need are already there. However, I don't see the equivalent/alternatives to the Morphological Operations in imager, namely shrink(), grow(), fill() and clean(). Or am I missing something in magick that could replace these?

Here is the relevant part from the NAMESPACE

importFrom(imager,clean)
importFrom(imager,fill)
importFrom(imager,grow)
importFrom(imager,shrink)
importFrom(imager,threshold)

jeroen commented 5 years ago

Thanks, see also this issue: https://github.com/ropensci/magick/issues/136

In the latest dev version of magick you can find the morphology methods with morphology_types():

> morphology_types()
 [1] "Undefined"         "Correlate"         "Convolve"          "Dilate"           
 [5] "Erode"             "Close"             "Open"              "DilateIntensity"  
 [9] "ErodeIntensity"    "CloseIntensity"    "OpenIntensity"     "DilateI"          
[13] "ErodeI"            "CloseI"            "OpenI"             "Smooth"           
[17] "EdgeOut"           "EdgeIn"            "Edge"              "TopHat"           
[21] "BottomHat"         "Hmt"               "HitNMiss"          "HitAndMiss"       
[25] "Thinning"          "Thicken"           "Distance"          "IterativeDistance"
[29] "Voronoi"

I think the main features you use are:

shrink: magick::image_morphology(method = 'Erode', ...)
grow: magick::image_morphology(method = 'Dilate', ...)

I'm not sure what exactly imager::clean does under the hood, but the imagemagick morphology manual explains several morphology methods that can be used for cleaning. We also have a function magick::image_despeckle().

For thresholding you can try image_threshold() or you can try some of the morphology methods.

MahShaaban commented 5 years ago

Sounds great. I will be looking into that. Thanks @jeroen

maelle commented 5 years ago

Approved! Thanks @MahShaaban for submitting and @seaaan @haozhu233 for your reviews! 😸

To-dos:

[x] Open an issue planning to switch everything to magick over time https://github.com/MahShaaban/colocr/issues/4 (@jeroen you can chime in there, thanks again for your comments here)
[ ] Transfer the repo to rOpenSci's "ropensci" GitHub organization under "Settings" in your repo. I have invited you to a team that should allow you to do so. You'll be made admin once you do.
[ ] Add the rOpenSci footer to the bottom of your README " [![ropensci_footer](https://ropensci.org/public_images/ropensci_footer.png)](https://ropensci.org)"
[ ] Fix any links in badges for CI and coverage to point to the ropensci URL. We no longer transfer Appveyor projects to ropensci Appveyor account so after transfer of your repo to rOpenSci's "ropensci" GitHub organization the badge should be [![AppVeyor Build Status](https://ci.appveyor.com/api/projects/status/github/ropensci/pkgname?branch=master&svg=true)](https://ci.appveyor.com/project/individualaccount/pkgname).
[ ] We're starting to roll out software metadata files to all ropensci packages via the Codemeta initiative, see https://github.com/ropensci/codemetar/#codemetar for how to include it in your package, after installing the package - should be easy as running codemetar::write_codemeta() in the root of your package.

Should you want to awknowledge your reviewers in your package DESCRIPTION, you can do so by making them "rev"-type contributors in the Authors@R field (with their consent). More info on this here.

Welcome aboard! We'd also love a blog post about your package, either a short-form intro to it (https://ropensci.org/tech-notes/) or long-form post with more narrative about its development. (https://ropensci.org/blog/). If you are interested, @stefaniebutland will be in touch about content and timing.

We've started putting together a gitbook with our best practice and tips, this chapter starts the 3d section that's about guidance for after onboarding. Please tell us what could be improved, the corresponding repo is here.

MahShaaban commented 5 years ago

Thank you, everyone. I transferred the repo, fixed the ci links and will look into the other suggestions.

I'd like to acknowledge your contributions @maelle, @haozhu233, @seaaan, and @jeroen if you don't mind.

maelle commented 5 years ago

Awesome!

Don't acknowledge my contributions, as mentioned here "Please do not list editors as contributors. Your participation in and contribution to rOpenSci is thanks enough!" :wink: (we mean it!)

stefaniebutland commented 5 years ago

Hello @MahShaaban. Are you interested in writing a post about your package for the rOpenSci blog, either a short-form intro to it (https://ropensci.org/tech-notes/) or long-form post with more narrative about its development (https://ropensci.org/blog/)?

This link will give you many examples of blog posts by authors of onboarded packages so you can get an idea of the style and length you prefer: https://ropensci.org/tags/onboarding/.

Here are some technical and editorial guidelines for contributing a post: https://github.com/ropensci/roweb2#contributing-a-blog-post.

Please let me know what you think.

MahShaaban commented 5 years ago

Thanks, @stefaniebutland for this opportunity. I'd like to write a blog post about colocr. I will read the guides first and get back to you to discuss it.

stefaniebutland commented 5 years ago

@MahShaaban What do you think about setting a deadline to submit a draft post? I'm happy to answer any questions you might have.

MahShaaban commented 5 years ago

Hey @stefaniebutland. I certainly don't mind that. I read the guides you referred to earlier and I think I will go with a short post intro. The idea is to adapt parts of the vignette that explains the goal of the package and how it with examples. If this is okay, I will start right away.

stefaniebutland commented 5 years ago

If you're referring to a tech note (https://ropensci.org/technotes/), they don't require scheduling on a certain day of the week so please submit your draft when ready and I'll review it soon after.

adapt parts of the vignette that explains the goal of the package and how it with examples.

Sounds good. Make sure it's different enough from the vignette. Good if you can lay out one cool example of what you can do with the package, rather than giving several examples.

MahShaaban commented 5 years ago

Okay. Thanks @stefaniebutland.

stefaniebutland commented 5 years ago

@MahShaaban, @maelle just reminded me that this is your second package onboarding! I'm quite curious about package authors' motivations to submit multiple packages e.g. are there diminishing returns on author's effort on subsequent submissions?

I know you indicated you prefer to write a tech note about colocr, but if it interests you and you see value in it for yourself, I'd love to read a blog post that features colocr as you described, but also reflects on your experiences and motivation for onboarding multiple packages.

Zero obligation to do more than you suggested! 😄

MahShaaban commented 5 years ago

The truth is, I intended to write a blog post about this recent submission. The same happened the first time I submitted a package to ropensci. The reason I shied away from it is that I don't see how a detailed description of the package and the features could be different from the vignette! I am definitely willing to be educated on this. There might be different ways of writing or different aspects of the package that I should focus on while writing a blog post vs a package vignette. I think being familiar with the submission and review process helped a lot the second time around. So the second submission was easier in this sense. In both cases, I had a very positive experience. I think the reviews and the suggestions I received improved the packages.

stefaniebutland commented 5 years ago

Sorry @MahShaaban I think I misunderstood when I thought you wanted to write a tech note. Yes your idea for a blog post: " to adapt parts of the vignette that explains the goal of the package and how it with examples" sounds good.

I don't see how a detailed description of the package and the features could be different from the vignette!

I think the blog post differs from the vignette in that the post should tell a bit of a story. Unlike a vignette, it's an opportunity to give your personal perspective on the package, like something you learned, or some really strong motivation for creating it. Was it your first Shiny app? Do you have any general tips for packages with Shiny apps? This might make the post interesting for people outside your field. (Thanks to @maelle for suggesting this to me when I asked her for advice.) Do you know of other users of your package? And how do they use it? Any of those things could go in the post.

One of the big benefits of contributing a blog post is that it can get more eyes on your work. Once published, we tweet from rOpenSci to >20,000 followers and it gets picked up by R-weekly live and R-bloggers.

With that, would you like to set a deadline for submitting a draft via pull request? Technical and editorial guidelines: https://github.com/ropensci/roweb2#contributing-a-blog-post.

MahShaaban commented 5 years ago

Hey @stefaniebutland, I just submitted a PR with a first draft of the blog post, ropensci/roweb2#329. Please, let me know what you think.

ropensci / software-review

Submission: colocr #243

Summary

Requirements

Publication options

Detail