biocore / microsetta-interface

The Microsetta participant facing user interface
BSD 3-Clause "New" or "Revised" License
1 stars 17 forks source link

[How you compare] page specific revisions #93

Open wasade opened 3 years ago

wasade commented 3 years ago

Reduce technical language in text.

Neighbor comparisons to groups of samples:

wasade commented 3 years ago

@gwarmstrong are these comparisons feasible with the current API?

gwarmstrong commented 3 years ago

It depends on how you quantify "more similar", and what the metadata contain.

If, e.g, there is a binary metadata for "eats a lot of plants", then you could easily do something like query for the k nearest neighbors, and see how many have a True in the "eats a lot of plants" category.

Does that sound reasonable? If not, tell me how you think it would make sense to turn the statement above into a proper mathematical specification, e.g., more similar to A or B -> argmin(avg. dist to A, avg. dist to B) and I can tell you if its possible or evaluate the effort needed to make it possible.

wasade commented 3 years ago

If former is doable and easy, then let's do that. We do not have binary metadata for these it will need to pick best from the possible

wasade commented 3 years ago

Additional comments from Justin

- the blue highlighted text appears as hyperlinks, but are not; I suggest to use a non-blue color as blue is associated with hyperlinks
- i suggest to replace 'features' with 'taxa', and to define 'taxa'/'taxon' as a unit for species that we use because species concepts in microbes are hard to define
- i suggest to use commas or periods in the numeric quantities where relevant (e.g., 1,000 vs. 1000)
    - I realize this varies from EU to USA, but perhaps you can have the website detect the VPN and vary based on that?
- i suggest to move this text to the beginning: "The most common types of organisms (in a very broad sense) that we observe in human associated microbiome samples are bacteria and archaea. But what are bacteria and archaea? Well, these are two domains of life that diverged from each other billions of years ago. Humans, on the other hand, are part of the eukaryotic domain of life."
- i suggest to modify this text: "How diverse is your microbiome? How many types of organisms are there? Did your sample capture everything there is to see- even the rarest of microbes? You can check several measures of this on the Your sample diversity tab." to something similar to "We just told you the number of taxa there are in your microbiome, but which important groups do they represent? How diverse is your sample? What is diversity? Did your sample capture everything there is to see- even the rarest of microbes? You can check several measures of this on the Your sample diversity tab."
- i suggest to remove the significant figures from the age in the third paragraph
- i suggest to modify this text: "What are these organisms in your sample? To check what types of organisms we observed in your sample, please select Taxonomy from the navigation bar." to something similar to "What are the names of the organisms in your sample? To find out, please select Taxonomy from the navigation bar."