galaxyproject / usegalaxy-playbook

Ansible Playbook for usegalaxy.org
Academic Free License v3.0
30 stars 24 forks source link

Extract Genomic DNA versions 3.x+ do not link with built-in genome indexes #286

Open jennaj opened 4 years ago

jennaj commented 4 years ago

Workaround for end-users:

Please use bedtools GetFastaBed instead. It performs the same basic function plus has more options.

Tool: Extract Genomic DNA using coordinates from assembled/unassembled genomes (Galaxy Version 3.0.3)

History:

cc @natefoo @mvdbeek

Related tickets:

mvdbeek commented 4 years ago
  • Version 3.x (iuc update) does not link-up with the proper data tables as structured at usegalaxy.org. We adjusted this for version 2.x a few years ago. Maybe the same can be done again?

It just needs the twobit loc entries, are we producing them ? AFAIK we also need them for trackster

jennaj commented 4 years ago

Three fixed problems, thank you!

  1. Extract Genomic DNA finds indexes at the versions available: 3.0.3 and 3.0.3+galaxy2.
  2. Versions 2.x no longer are listed in the tool panel.
  3. If a prior run using 2.x is "rerun", there is a warning and the newest version of the tool is loaded.
Screen Shot 2020-03-02 at 8 13 52 AM

However, there are two (minor) configuration problems

  1. The older version 3.0.3 is what is loaded when clicking on the tool from the tool panel.
  2. There is no warning that the newer version 3.0.3+galaxy2 is available on top of the 3.0.3 tool form.
Screen Shot 2020-03-02 at 8 05 43 AM

Update: Correct/most current tool version now loads from the tool panel

jennaj commented 3 years ago

Update: Extract Genomic DNA version 3.0.3+galaxy2 is also now unlinked from indexes.

ping @davebx @natefoo

jennaj commented 3 years ago

Retest history: https://usegalaxy.org/u/jen-galaxyproject/h/test-extract-genomic-dna

Status:

20201216_test1-extractgenomicdna-mm9-not-found
mvdbeek commented 3 years ago

There is no mm9 twobit. You can go to the admin panel -> Local Data -> towbit (https://usegalaxy.org/admin/data_manager/table/twobit) to check the available indexes.

jennaj commented 3 years ago

@mvdbeek thank you and agree. The larger problem is that existing twoBit data were not migrated out of the byhand dbkeys. Most if not all have the twoBit in the seq directory.

Probably also the root issues for this (Trackster). Punch line: cannot recognize dataset to add into an existing Trackster session, even when the dbkeys/database metadata match up. A single dataset Trackster is the only way it works for so many of the current/major genomes since they were indexed before DMs existed. https://github.com/galaxyproject/usegalaxy-playbook/issues/276

jennaj commented 3 years ago

Update: More genomes with existing twoBit data in CVMFS byhand: ../dbkey/seq/dbkey.2bit are being populated into the proper loc by @davebx

hg19 and mm9 were both already specifically done. hg19 issues were more complex -- but mm9 is a "how-to-fix-it" model for the others.

jennaj commented 3 years ago

Fixed, thanks all

jennaj commented 2 years ago

Update 2022-01-04

Some indexes are missing again at .org.

Example: hg38 missing, mm9 found

Test histories

ping @mvdbeek @davebx @natefoo

jennaj commented 2 years ago

@davebx -- also checked the test server, and that particular 2bit is missing (weird). I created that file when first indexing hg38, and we still have it in CVMFS. Thought that was migrated from the old > new loc already, but maybe was missed. Don't see any others that are obvious/missing.

So, for now, just this line needs to be added to the byhand twobit.loc. I should still be able to do that, will SOS if get stuck: /cvmfs/data.galaxyproject.org/managed/seq/hg38.2bit

(Reminder for Jen: refgenie isn't configured to create 2bit files -- only lastz and extract still use it -- maybe we deprecate to save space (ask Dan) http://datacache.galaxyproject.org/refgenomes-databio/)

jennaj commented 2 years ago

The tool indexes are missing again. Not sure if related to resent server updates.

Should we decide to deprecated this tool? Do any others besides lastz use .2bit indexes?