rega-cev / phylogeotool

4 stars 3 forks source link

Google Chart - Amount of taxa / Amount of sequences in the cluster #21

Closed Ewout1988 closed 7 years ago

Ewout1988 commented 7 years ago

@GuyBaele is more confused by the fact that the amount of taxa isn't really lining up with the amount of sequences in a cluster. There's two reasons for this:

  1. The number to the right of the gradient legend is the maximum amount of sequences in one country. If you take for example http://phylogeotool.gbiomed.kuleuven.be/euresist/ then 11,169 are the number of sequences originating from Italy as that's the maximum value and the most intensive colour.
  2. Even if we'd sum up all values from all countries shown, we wouldn't come to the amount of taxa in our tree/in a cluster because there are sequences for which we don't know the country of origin.
GuyBaele commented 7 years ago

I'm fine with this, as long as we explain this clearly somewhere.

plibin commented 7 years ago

In the user manual?

GuyBaele commented 7 years ago

Sure.

ktheyss commented 7 years ago

The following sentence was added to the user manual: "The maximum value of the gradient bar shown denotes the highest number of sequences in the dataset for a single country.".

The complete paragraph is then as follows: "The top left panel shows a map where each country is colored according to a gradient, where a darker color signifies that more sequences are originating from this country. The maximum value of the gradient bar shown denotes the highest number of sequences in the dataset for a single country. A drop-down box allows you to select the geographic region on which you want to focus (e.g. Europe, North America, \ldots)."

Ewout1988 commented 7 years ago

That seems to be complete. Is this issue resolved then?