Clinical-Genomics / BALSAMIC

Bioinformatic Analysis pipeLine for SomAtic Mutations In Cancer
https://balsamic.readthedocs.io/
MIT License
44 stars 16 forks source link

Log file for BALSAMIC #862

Closed moahaegglund closed 1 year ago

moahaegglund commented 2 years ago

Is your feature request related to a problem? Please describe. Today we don't keep track of the input files in BALSAMIC. We need to have a log file where the input fastq files are stated.

Describe the solution you'd like The goal is to have a log file for each case that is saved to housekeeper bundles. This would make it a lot easier to follow up analyses.

Describe alternatives you've considered An alternative solution, while waiting for the case specific log file, could be to have a common log file for all BALSAMIC cases that are started.

Current BALSAMIC version 8.2.5

khurrammaqbool commented 2 years ago

balsamic generates a log file BALSAMIC_run/.snakemake/log/*.snakemake.log, but do we need to store it in housekeeper everytime balsamic run on the case?

vwirta commented 2 years ago

Let's discuss this on next meeting. It would be nice to have a log file of each analysis that keeps track of what has been analysed and how.

pbiology commented 1 year ago

Let's also make sure that these logs contain versioning of all tools used.

ivadym commented 1 year ago

Closing!

Changes in develop are actually reflecting the request here since we've started to add a more detailed sample dictionary to the config.json file, including the linked fastq files ({lane}_{date_str}_{flowcell}_{sample}_{index}_R_{read}.fastq.gz). The config.json file is also saved in HK, so it would be possible to trace which files have been used for past analyses.

"samples": [
        {
            "type": "tumor",
            "name": "ACC5821A7",
            "fastq_info": {
                "2_171015_HKWN2DSXX_ACC5821A7_XXXXXX_R": {
                    "fwd": "/home/proj/stage/cancer/cases/fleetjay/fastq/2_171015_HKWN2DSXX_ACC5821A7_XXXXXX_R_1.fastq.gz",
                    "rev": "/home/proj/stage/cancer/cases/fleetjay/fastq/2_171015_HKWN2DSXX_ACC5821A7_XXXXXX_R_2.fastq.gz"
                },
                "4_171015_HKWN2DSXX_ACC5821A7_XXXXXX_R": {
                    "fwd": "/home/proj/stage/cancer/cases/fleetjay/fastq/4_171015_HKWN2DSXX_ACC5821A7_XXXXXX_R_1.fastq.gz",
                    "rev": "/home/proj/stage/cancer/cases/fleetjay/fastq/4_171015_HKWN2DSXX_ACC5821A7_XXXXXX_R_2.fastq.gz"
                },
                "3_171015_HKWN2DSXX_ACC5821A7_XXXXXX_R": {
                    "fwd": "/home/proj/stage/cancer/cases/fleetjay/fastq/3_171015_HKWN2DSXX_ACC5821A7_XXXXXX_R_1.fastq.gz",
                    "rev": "/home/proj/stage/cancer/cases/fleetjay/fastq/3_171015_HKWN2DSXX_ACC5821A7_XXXXXX_R_2.fastq.gz"
                },
                "2_171015_HKL7FDMXX_ACC5821A7_XXXXXX_R": {
                    "fwd": "/home/proj/stage/cancer/cases/fleetjay/fastq/2_171015_HKL7FDMXX_ACC5821A7_XXXXXX_R_1.fastq.gz",
                    "rev": "/home/proj/stage/cancer/cases/fleetjay/fastq/2_171015_HKL7FDMXX_ACC5821A7_XXXXXX_R_2.fastq.gz"
                },
                "1_171015_HKL7FDMXX_ACC5821A7_XXXXXX_R": {
                    "fwd": "/home/proj/stage/cancer/cases/fleetjay/fastq/1_171015_HKL7FDMXX_ACC5821A7_XXXXXX_R_1.fastq.gz",
                    "rev": "/home/proj/stage/cancer/cases/fleetjay/fastq/1_171015_HKL7FDMXX_ACC5821A7_XXXXXX_R_2.fastq.gz"
                },
                "1_171015_HKWN2DSXX_ACC5821A7_XXXXXX_R": {
                    "fwd": "/home/proj/stage/cancer/cases/fleetjay/fastq/1_171015_HKWN2DSXX_ACC5821A7_XXXXXX_R_1.fastq.gz",
                    "rev": "/home/proj/stage/cancer/cases/fleetjay/fastq/1_171015_HKWN2DSXX_ACC5821A7_XXXXXX_R_2.fastq.gz"
                }
            }
        },

And for version tracking, we already have an open feature request: