LooseLab / readfish

CLI tool for flexible and fast adaptive sampling on ONT sequencers
https://looselab.github.io/readfish/
GNU General Public License v3.0
163 stars 31 forks source link

readfish stats #325

Closed alexomics closed 3 months ago

alexomics commented 5 months ago
          Dear Team,

I do have another issue. I tried to analyze the results using readfish stats with the --toml and --fastq-directory arguments. However, I see an error

(Channel 72 not found in Channel Map or', ' Barcode barcode48 not found in toml human_chr_selection.toml')

My toml file has the same content as the original human_chr_selection.toml file except for the file path to the reference mmi file. I suspect that I have to create a TOML table named barcodes and specify each of the barcodes within the toml file. Is this the way to go? Alternatively, could it be that the file path to the --fastq-directory is wrong? In this case, I specified the filepath to the run directory (e.g., /data/AFO_V3) or the file path to the fastq_pass directory (e.g., /data/AFO_V3/sample1/20240125_1411_MS00000_id_XXXXX/fastq_pass). I also tried to specify the file path to a specific barcode directory hosting fastq files (E.g., /data/AFO_V3/sample1/20240125_1411_MS00000_id_XXXXX/fastq_pass/barcode01). However, I still encountered the same error.

Originally posted by @ayoraind in https://github.com/LooseLab/readfish/issues/221#issuecomment-1918765115

mattloose commented 5 months ago

Hi,

You say you used the human_chr_selection.toml file - is this the one shown here:

https://github.com/LooseLab/readfish/blob/main/docs/_static/example_tomls/human_chr_selection.toml

This toml file is not necessarily compatible with barcoded runs for analysis with readfish stats - did you use this toml in the generation of the data? The data you are looking at are obviosuly barcoded and readfish stats will be assuming that (and inferring it from the reads).

If you did run adaptive sampling on a barcoded run with tthe human_chr_select.toml then we might need to tweak the readfish stats program to better handle this.

The adaptive sampling experiment should have worked regardless.

ayoraind commented 5 months ago

Yes, I used the human_chr_selection.toml file shown in the link you posted. I ran adaptive sampling using the human_chr_select.toml file.

The (stop-gap) solution was to

Seems to be working so far, although it is taking quite a while to align the FASTQ (currently at 2hr 33 minutes).

mattloose commented 5 months ago

OK - that's a plausible workaround. Let us know what the results look like - I'm not quite sure how it will behave.

ayoraind commented 5 months ago

Seems like it worked. I got the stats after 3hrs 54 minutes.

image

bjaysheel commented 4 months ago

Hi, What fastq-directory was used for stats, the fastq_pass directory or the root of output folder as you indicated in your first post?

ayoraind commented 4 months ago

Hi @bjaysheel,

I specified the file path to the run directory (e.g., /data/AFO_V3).

github-actions[bot] commented 3 months ago

This issue is stale because it has been open 30 days with no activity. Remove stale label or comment or this will be closed in 5 days.

github-actions[bot] commented 3 months ago

This issue was closed because there has been no response for 5 days after becoming stale.