vscentrum / vsc-software-stack

Central repository of easyconfigs used in the software installations on VSC clusters.
2 stars 6 forks source link

Medaka #352

Closed laraPPr closed 3 months ago

laraPPr commented 5 months ago
PetrKralCZ commented 5 months ago
PetrKralCZ commented 5 months ago
boegel commented 5 months ago

@PetrKralCZ Let's stick to foss/2023a for this, since we don't have TensorFlow with 2023b yet.

This was requested with GPU support though, so we'll need TensorFlow with foss/2023a + CUDA`

@laraPPr Can you dig up more details on what wasn't working with previous versions, and drop that info in here so @PetrKralCZ can check on that?

laraPPr commented 5 months ago

Ok just ran it quickly on their account and it might be something on their site

because they are doing this in the script

model='/kyukon/scratch/gent/vo/000/gvo00011/vsc40014/rerio/dorado_models/res_dna_r10.4.1_e8.2_400bps_sup@v4.0.1

Cannot import pyabpoa, some features may not be available.

Cannot import pyabpoa, some features may not be available.

Cannot import pyabpoa, some features may not be available.

Checking program versions

This is medaka 1.11.3

Cannot import pyabpoa, some features may not be available.

Program    Version    Required   Pass     

bcftools   1.18       1.11       True     

bgzip      1.18       1.11       True     

minimap2   2.26       2.11       True     

samtools   1.18       1.11       True     

tabix      1.18       1.11       True     

Cannot import pyabpoa, some features may not be available.

Traceback (most recent call last):

  File "/apps/gent/RHEL8/cascadelake-ib/software/medaka/1.11.3-foss-2023a/bin/medaka", line 8, in <module>

    sys.exit(main())

             ^^^^^^

  File "/apps/gent/RHEL8/cascadelake-ib/software/medaka/1.11.3-foss-2023a/lib/python3.11/site-packages/medaka/medaka.py", line 814, in main

    args.func(args)

  File "/apps/gent/RHEL8/cascadelake-ib/software/medaka/1.11.3-foss-2023a/lib/python3.11/site-packages/medaka/medaka.py", line 316, in is_rle_model

    print(is_rle_encoder(args.model))

          ^^^^^^^^^^^^^^^^^^^^^^^^^^

  File "/apps/gent/RHEL8/cascadelake-ib/software/medaka/1.11.3-foss-2023a/lib/python3.11/site-packages/medaka/medaka.py", line 322, in is_rle_encoder

    modelstore = medaka.models.open_model(model_name)

                 ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^

  File "/apps/gent/RHEL8/cascadelake-ib/software/medaka/1.11.3-foss-2023a/lib/python3.11/site-packages/medaka/models.py", line 222, in open_model

    raise ValueError(

ValueError: Model /kyukon/scratch/gent/vo/000/gvo00011/vsc40014/rerio/dorado_models/res_dna_r10.4.1_e8.2_400bps_sup@v4.0.1 does not have .hdf5 or .gz extension.
boegel commented 5 months ago

@laraPPr What did you run to get this output?

laraPPr commented 5 months ago

medaka_consensus -t 24 -o $out -m $model -d $genome -i $reads

laraPPr commented 5 months ago

The model is from here https://github.com/nanoporetech/rerio

PetrKralCZ commented 5 months ago
boegel commented 5 months ago

Looks like pyabpoa is a missing dependency? If that can be added, we'll install medaka v1.12.0 and see if that works

boegel commented 4 months ago

GPU variant requires:

PetrKralCZ commented 3 months ago

GPU variant:

boegel commented 3 months ago

overlaps with #361, all done, ready for cleanup & closing