SpeciesFileGroup / taxonworks

Workbench for biodiversity informatics.
http://taxonworks.org
Other
87 stars 26 forks source link

Botanical name modeling #4088

Open camwebb opened 3 days ago

camwebb commented 3 days ago

There have been several issues related to modeling botanical nomenclature, and several conversations on Matrix, and I thought I would try to write down the outstanding concerns, as I understand them. A Github issue seems as good a place as any to put this, for future reference. No immediate issue resolution is expected.

Perhaps the best way to start is with the minimal elements of a data model that I see as being necessary:

This can be visualized as:

A a X --> A a X ex Y --+---+---------> A a X ex Y             || synonym
                        \   \
                         \   +-------> A b var. a (X ex Y) Q  || synonym
                          \
                           +---------> B a (X ex Y) Z         || accepted

Here is a real world example:

Based on my exploration of TW - but not a comprehensive understanding of it! - several of the above capabilities seem not to be present:

In essence, It seems that TW offers a linear path from basionym to one of the new combinations, with a single preferred name to use for OTUs. Currently, in order to model branching a graph such as indicated above, it is not possible to just create a single protonym with combinations, but one must follow one of two approaches (omitting the complication of the ex):

  1. for each name (A a, B a, A b var. a) create a new protonym A a, then create the combinations for B a and A b var. a, and then indicate synonymy with relationship homotypic synonym between A a and B a, and A b var. a and B a, or,
  2. for each name (A a, B a, A b var. a) directly create new protonyms without combinations, using verbatim authorship to make the parentheses ((X) Q, (X) Z), and then indicate synonymy with relationship homotypic synonym between A a and B a, and A b var. a and B a, and is basionym of relationships from B a to A a, and A b var. a to A a.

However the creation of these additional protonyms seems to confuse the combination logic engine and the various independently created protonyms get ‘tangled up’ (demonstration offered, when we meet).

If there were to be no further discussion/development of this, approach (2) would be the way to go, being simpler, and completely avoiding the combination logic engine. However, this seems a shame, and currently does not export basionym relationships in the CoL dump.

None of this is a criticism of TW as it stands - these are just barriers I currently see for implementing a botanical monograph in TW. Thanks as ever for this powerful informatics tool.

StefanieBond commented 3 days ago

The code also uses “nomenclatural novelty”, it refers to any or all of the categories: name of a new taxon, new combination, name at new rank, and replacement name. I understand you are specifically talking about new combinations, but these can at the same time be a name at new rank (comb. & stat. nov.); a nomenclatural novelty with a basionym need not be either of these.