lpantano / seqcluster

small RNA analysis from NGS data
http://seqcluster.readthedocs.io
MIT License
35 stars 17 forks source link

Issue with IsomirDataSeqFromFiles #23

Closed biowizz closed 7 years ago

biowizz commented 7 years ago

Hello,

i am trying to use the package on smallRNA seq data. I have used miraligner and got *.mirna output. Now I want to use the isoPlot( ) on our data. I tried with the example files given and used the commands as given below it works fine.

path <- system.file("extra", package="isomiRs")
fn_list <- list.files(path, full.names = TRUE)
de <- data.frame(row.names=c("f1" , "f2"), condition = c("newborn", "newborn"))
ids <- IsomirDataSeqFromFiles(fn_list, design=de)

Now, I am trying the same for our data and it is showing the error as follows.

Error in FUN(X[[i]], ...) : 
  assay colnames() must be NULL or equal colData rownames()
In addition: Warning message:
In IsoCountsFromMatrix(listSamples, design) : No miRNA found.

Could you please suggest any changes I need to make. One observation, The frequency column in our data has only 0. Rest all the column are same as the example data. I also deleted the one addtional precursor column as it was not present in example data.

lpantano commented 7 years ago

Hi,

Sorry for this issue.

I think the problem is the column for frequency to be 0.

How did you analyze your data from the beginning? Normally, there is a collapsing step where you end up with a fasta/fastq file with this format:

seq_ID_x100 SEQUENCE

where x100 is the count data. This step is mandatory if you want to quantify miRNAs and use isomiRs. If you tell me each step you did from the raw FASTQ files I can help you to fix the problem.

Thanks!

On Apr 23, 2017, at 7:13 AM, biowizz notifications@github.com wrote:

Hello,

i am trying to use the package on smallRNA seq data. I have used miraligner and got *.mirna output. Now I want to use the isoPlot( ) on our data. I tried with the example files given and used the commands as given below it works fine.

path <- system.file("extra", package="isomiRs") fn_list <- list.files(path, full.names = TRUE) de <- data.frame(row.names=c("f1" , "f2"), condition = c("newborn", "newborn")) ids <- IsomirDataSeqFromFiles(fn_list, design=de) Now, I am trying the same for our data and it is showing the error as follows.

Error in FUN(X[[i]], ...) : assay colnames() must be NULL or equal colData rownames() In addition: Warning message: In IsoCountsFromMatrix(listSamples, design) : No miRNA found. �������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������������Could you please suggest any changes I need to make. One observation, The frequency column in our data has only 0. Rest all the column are same as the example data. I also deleted the one addtional precursor column as it was not present in example data.

— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub https://github.com/lpantano/seqcluster/issues/23, or mute the thread https://github.com/notifications/unsubscribe-auth/ABi_HO8HTGihMjPcvyO6SZjGoRimUZyFks5ryzJlgaJpZM4NFYC5.

biowizz commented 7 years ago

Hi Ipantano, Thank you for the quick assistance. The reads that I have are single end from ion Torrent. I have used the manufacturer recommended methodology to do the QC on our data and have the fastq files from them. I used these fastq files in java based miraligner to obtain *.mirna files. I have not used the seqcluster for the entire analysis rather only miraligner on fastq files. Is that a problem?

I completely missed out on the collapsing step. I will try the same and get back to you.

Thanks

biowizz commented 7 years ago

Collapsing the reads did the trick. Thanks a ton for pointing out in the right direction.

lpantano commented 7 years ago

I am glad you got it working!

On Apr 24, 2017, at 1:42 AM, biowizz notifications@github.com wrote:

Collapsing the reads did the trick. Thanks a ton for pointing out in the right direction.

— You are receiving this because you commented. Reply to this email directly, view it on GitHub https://github.com/lpantano/seqcluster/issues/23#issuecomment-296529323, or mute the thread https://github.com/notifications/unsubscribe-auth/ABi_HL04SyBf0HchSA2aXEuzUw-n9YGUks5rzDZDgaJpZM4NFYC5.