FlyBase / GO-curation

For projects related to GO curation in FlyBase
MIT License
0 stars 0 forks source link

Test PANGEA 1.1 beta #59

Closed hattrill closed 1 year ago

hattrill commented 1 year ago

@gantonazzo I am just making a ticket to list the issues that we come across. Perhaps we can reciprocally test the bugs to see if we see them

https://www.flyrnai.org/tools/pangea/web/home/7227

hattrill commented 1 year ago

Issue List (edit to update): Adding significant issues to https://docs.google.com/spreadsheets/d/1CwWgdn4DZlNryiu8vzj0e6e6js3PWwUy2CXQ4Fm3roI/edit#gid=0

INPUT

  1. Export gene list from FB not working. Re-openned https://flybase.atlassian.net/browse/WEB-2028?page=com.atlassian.jira.plugin.system.issuetabpanels%3Aall-tabpanel [FIXED]

GRAPH NETWORK

  1. Can't seem to generate a graph network [FIXED]

  2. edges need to be different colours to see overlaps

    Screenshot 2023-01-11 at 09 53 51
  3. With large gene sets would be good if nodes were more seperate e.g. here the abnormal cell number hides the DO overlaps - can we use colour to help here?

    Screenshot 2023-02-01 at 18 07 23
  4. The selection buttons aren't super-obvious as being alternative choices

    Screenshot 2023-02-01 at 17 53 43

SETS

  1. Generic GO consortium Subsets (GO slim) looks like old FB SLIM1. Update with the GOC labelled as goslim_generic in go-basic.obo

  2. Should we include the AGR and fly ribbon subsets? If we do want to have this limited set, should just use the AGR subset across the organsims for consistency.

  3. Looks like only "hierarchy" set uses hierarchy. All except direct should use hierarchy - I think we should get rid off 'direct' as it's not correct to use it like this.

  4. Slimming over regulation in BP would be a nice addition? should we add this set?

  5. Remove root terms from sets as either used with ND evidence code to show that nothing is known or, if using graph then everything in tht aspect will map up: GO:0008150 biological_process GO:0003674 molecular_function GO:0005575 cellular_component Screenshot 2023-02-01 at 13 04 02

  6. COMPLEAT sets don't seem to make much sense: think that we should restict these to geniune complexes or interpretation will be confusing with other sets.

    Screenshot 2023-01-11 at 20 31 41
  7. Look at the way that the expression sets have been generated. Preferred tissue (modEncode RNA_seq) - I don't think that this works very well for enrichment - how can we improve this?

    Screenshot 2023-01-11 at 20 52 08
  8. Expression annotation AGR - looks like just direct to terms - would be better if used ontology structure - perhaps we could use the AGR slim for this?

BAR VISUALIZATION

  1. Can't see Menu ("Use the ... menu to download svg/pngs")
  2. Would be nice to be able to select GO terms that seem interesting - can't even copy-paste them from axis
  3. [set number] is next to GO name for every point on chart. There doesn't appear to be a way to map this to a set name. Screenshot 2023-01-11 at 10 14 52

RESULTS TABLE

  1. Selecting sets for visualization is a bit clunky - have to use alt/shift to select multiple, when none are selected all are analysed. Could be better - select all as default; clear button, unselect by specifically clicking on box not clearing when clicking another
  2. Table on first page and "Mapped Gene Graph Visualization " page should be the same
  3. How can we show hierarchy of GO in results?
  4. Show top 100 in list, as top 10 mis-leading

DOC Some of FAQ and Genesets info should be together

OTHER SPECIES ISSUES

  1. When on results page, click NEW SEARCH button goes back to drosophila pge https://www.flyrnai.org/tools/pangea/web/home/7227 [added issue to Testing sheet]

  2. Worm example list and background are the same [added issue to Testing sheet]

  3. Worm phenotype section contains expression [added issue to Testing sheet]

    Screenshot 2023-02-01 at 18 50 54
  4. . For Rat and Z. fish the test set when combined with an EBI CP enrichment brings up a query error [added issue to Testing sheet]

Screenshot 2023-02-01 at 12 25 45

Other

  1. Could we open a new tab for other visualization pages - would be nice if we could keep the selection sets from the first page
  2. Rename category FlyBase phenotype for classical alleles to FlyBase Phenotype (we can use doc to explain this as these are mainly classical and insertion alleles). [added issue to Testing sheet]
  3. Rename any AGR refs to Alliance or they will be sad e.g. Disease annotation AGR to Alliance Disease annotation Expression annotation AGR to Alliance Expression annotation
  4. Can't go back to query page with input - would be good as you could add more sets if you had another thought [FIXED]

TESTERS SHEET: https://docs.google.com/spreadsheets/d/1CwWgdn4DZlNryiu8vzj0e6e6js3PWwUy2CXQ4Fm3roI/edit#gid=0

hattrill commented 1 year ago

Nice Examples for paper: to show also non-overlapping set

Screenshot 2023-01-11 at 14 28 44
hattrill commented 1 year ago

Example of where groups overlap and don't

Screenshot 2023-01-11 at 14 43 11
hattrill commented 1 year ago

Good example of how different BP and CCs group . With more colours, this would be clearer.

Screenshot 2023-01-11 at 20 02 36
hattrill commented 1 year ago

TEST SETS FLY QuickSearch, pheno_anat fat body fat_body_affected.txt

WORM using phenotype ontology viewer for query: List of 247 genes that were annotated with WBPhenotype:0000598 alimentary system morphology variant or any of its transitive descendant genes_direct_and_inferred_for_WBPhenotype 0000598.txt

hattrill commented 1 year ago

For Worm BP using hier. showing overlap between endocytosis digestive tract morphogenesis tube development enrichment(1)

hattrill commented 1 year ago

Note about advantages: can use enrichment and classification on same sets

hattrill commented 1 year ago

Nice illustration of how certain diseases are associated with certain cell features/exp using http://preview.flybase.org/reports/FBgg0000056.html gene group.

enrichment(8)

and with even more anat/cc/disease enrichment(9)

gantonazzo commented 1 year ago

Still a few issues remain when running a multiple search query:

1) When I run a multiple search I never get to the result page automatically, I always have to click on the “Results” button for redirection

Screenshot 2023-02-08 at 15 05 25

2) Typo after submission: “Cacluation progress” instead of “Calculation progress” (see previous screenshot)

3) It seems when I select all rows to plot, no graph gets generated. It works as intended when you just select a subset of rows

4) overlapping labels in graph is still a issue (Gene

Screenshot 2023-02-08 at 15 04 07

5) Maybe this is more like a preference, but it would be nice if the multiple search results table could show p-value corrections like in the single search

Multiple search

Screenshot 2023-02-08 at 15 09 00

Single search

Screenshot 2023-02-08 at 15 11 08