Closed tanpuekai closed 5 years ago
My guess is that you attitude and tone won't get you any answer from the developers. Your whole post could be re-written to be more respectful and less demanding. This is, after all, free and open-source. The whole point of putting it up here is for you (with 10 years of R experience) to help out and improve it (or make it more robust), not to bash it into oblivion.
By the way, I ran the whole pipeline with a few issues that were resolved easily within two days. While running the dataset, did you have a look at how much RAM your system was using? I noticed that my swap file was getting full. All the issue I had were simply, too little RAM. Using 50Gb of RAM solved all my issues. Why don't you try that?
My guess is that you attitude and tone won't get you any answer from the developers. Your whole post could be re-written to be more respectful and less demanding. This is, after all, free and open-source. The whole point of putting it up here is for you (with 10 years of R experience) to help out and improve it (or make it more robust), not to bash it into oblivion.
By the way, I ran the whole pipeline with a few issues that were resolved easily within two days. While running the dataset, did you have a look at how much RAM your system was using? I noticed that my swap file was getting full. All the issue I had were simply, too little RAM. Using 50Gb of RAM solved all my issues. Why don't you try that?
i already listed over 10 reasons why it is a horrible BI software, with solid support. It should not have been accepted in the first place. there are many good BI software, most of which stick to the basic principles of good software engineering, like ease of use (maybe one-line cmd), well-defined input and well-explained output, and minimal dependencies.
it is not true that just because it is free and open-sourced, therefore it needs not be sticking to basic principles of good software engineering.
Dear @tanpuekai ,
I understand your frustration with the versions and dependencies... as developers of this package we have also had to suffer them. That's why we have also created Docker and Singularity images. They reduce dependencies issues to the minimum, so you might want to check them:
If you prefer to stick to R: It seems that most of your trouble was caused by following a tutorial that does not match your SCENIC version. To see the tutorial matching your local version you can use any of these commands (if the vignettes are installed locally):
vignette("SCENIC_Running")
or
vignetteFile <- file.path(system.file('doc', package='SCENIC'), "SCENIC_Running.Rmd"); file.copy(vignetteFile, "SCENIC_myRun.Rmd")
Also, be aware that the tutorial in vignette/SCENIC_Running.Rmd
mostly uses the "wrapper functions" for running the pipeline "as is". If you prefer to run the workflow "manually" (command by command) you can always take the detailed vignettes as template (file name starting by detailedStep_...
), and modify according to your needs... It requires some time and understanding of the workflow... but it is more flexible, and you can decide which dependencies to install...
@tanpuekai
Despite your attitude, I would like to share that SCENIC is also avaiable as a one-line command via Nextflow. Check here.
I also recommend changing your mindset about 'bioinformatics packages' having the ease of use as a top priority. Of couse the user interface is important, but we'll have to face that nowadays Systems Biology tends to lead us towards increasingly more complex algorithms (and, as such, with more dependencies and more prone to little glitches from the user).
Packages will eventually return errors even on vignettes, and whining about it won't make it different. What can make it different is writing a good, non-aggressive issue, addressing the problem you're facing and further documenting it. Sometimes it may be your own mistake at game (for example -> dbs = DBS, it obviously won't recognize it as the variable since R is case-sensitive).
I install according to this link: https://rawcdn.githack.com/aertslab/SCENIC/a0a00644b2f3589a3e2bc65486fc5f6cc00f48e1/inst/doc/SCENIC_Setup.html#sample-dataset-download-format and run according to this link: https://rawcdn.githack.com/aertslab/SCENIC/a0a00644b2f3589a3e2bc65486fc5f6cc00f48e1/inst/doc/SCENIC_Running.html#exploringinterpreting_the_results
I have several PCs with 32G RAM each. My PCs are good enough. Also got a cluster account. R3.6 is used.
on the first link, it says it needs three R packages: "GENIE3", "AUCell", "RcisTarget", but as I run thru the steps, it ended up installing over 100 packages.
After many version conflicts, like Rngtools, some just need older version to get by; I did manage to load "SCENIC" into R.
It says it needs those "databases", so I downloaded them all. I found no matter how I download them (using PCs, and clusters), the Sha256 code is always different from what you provided. For example, these are the SHA256 on my side:
and these are what you said they are in your web:
OK, let us getting into running the first line of SCENIC:
loomPath <- system.file(package="SCENIC", "examples/mouseBrain_toy.loom")
what I found is that loomPath is an empty variable. There is nothing in it after running above line. And of coz, because of that the following lines are all errors:
Since I know that this software is apparently not well engineered, and keeps distracting the end users for useless sidetracks; For example, here apparently what you want is a gene-by-cell matrix called exprMat, I went to your website to download that "mouseBrain.RData", and generated a exprMat to keep it going.
Then your web says:
Can I ask why you use
=
and<-
interchangeably? that just makes your software look so crappy.BTW, what is that
dbs=dbs
for? i have installed the latest version of your package, but it does not accept this argument:I mean you listed all kinds of boundaries (from 500bp, to 5k, to 10k) here, and you have 7 species, 10 species and all kinds of weird but unexplained versions (like mc8/9nr, what the hell is that?), but it seems you only accept the default?
what the hell ... you developed a software that needs to load >100 packages full of conflicts just for this equation? And this is the foundation for all your regulatory network construction?
in another document, you said use
runCorrelation(exprMat_filtered, scenicOptions)
to compute pairwise correlations, the thing is the functionrunCorrelation()
does not exist at all:In your tutorial page, which is where it reaches after I clicked on a link in the main page of this github project, you said:
Seems pretty recently updated, but wait a min, where the hell is the
version 1.1.1.9
?OK, so since
cor(t(exprMat_filtered), method="spearman")
is a built-in command, and is guaranteed to work, so however crappy the formula might seem, I go ahead with this step, and continue into the next stepsrunGenie3(exprMat_L, scenicOptions)
. Thank goodness, it works for this step.So I then proceeded into
runSCENIC_1/2/3
. Luckily,runSCENIC_1_coexNetwork2modules(scenicOptions)
seems to be working, but then I encountered serious setbacks atrunSCENIC_2
:OK, might be the
feather
package is too old, so i re-install the latest version. But then came this: