Closed cpfaff closed 8 years ago
Well that is quite a bunch of information packed into the first post. The fist thing to be implemented is what we we already discussed. _A more generic structure for organism taxonomy_. EML can serve as a inspiration for that. We also need to _implement a recursive structure for the generic taxonomical naming with ranks and values_.
The new generic and recursive structure now allows for things like
<organism>
<organismicCoverage>
<organismItem>
<taxonomicClassification>
<taxonRankName>class</taxonRankName>
<taxonRankValue>coniferopsida</taxonRankValue>
<taxonomicClassification>
<taxonRankName>order</taxonRankName>
<taxonRankValue>coniferales</taxonRankValue>
<taxonomicClassification>
<taxonRankName>family</taxonRankName>
<taxonRankValue>piceoideae</taxonRankValue>
<taxonomicClassification>
<taxonRankName>subfamily</taxonRankName>
<taxonRankValue>piceoideae</taxonRankValue>
<taxonomicClassification>
<taxonRankName>genus</taxonRankName>
<taxonRankValue>picea</taxonRankValue>
<taxonomicClassification>
<taxonRankName>speices</taxonRankName>
<taxonRankValue>Picea abies</taxonRankValue>
</taxonomicClassification>
</taxonomicClassification>
</taxonomicClassification>
</taxonomicClassification>
</taxonomicClassification>
</taxonomicClassification>
</organismItem>
</organismicCoverage>
</organism>
The separation for the naming into botanical, zoological, virus and bacterial is interesting. We may benefit from that separation and granularity as well. This would have to be best implemented into the generic structure we have now.
While the split into atomic fields for the species names is interesting and useful in general I ask myself now if we really need to do that. This would extend our deepest level of our taxonomic classification just to be able to store separate parts of a string (species name) in separate fields. I am not sure about the benefit in the search. Just adding the elements of ABCD would also mean a dublication at least to a certain degree as. For example there is fields in ABCD for e.g. epithet, and genus. We allow for this in the generic structure with taxonRankName = gensus ..., taxonRankName = species. Ok the full species name and the genus would also mean a slight dublication as the species name can contain the genus plus the epithet. Maybe we just skip this. @EichenbergBEF What do you mean?
Another possible way to go if we decide to go with more detail would be to copy ABCD and then somehow prevent that somebody provides the species and the genus in the taxonomic classification to prevent redundant information.
Not sure if I fully understand. You think that it does not really make sense to split the taxonomic classification into separate field. However, as we discussed, this would allow a higher flexibility within the search in terms of abstraction to e.g. higher taxonomic levels. Example already given: I have data on Picea abies and now aslo want to find all datasets concerning the same genus (i.e. Picea). But maybe there are also other ways you already thought of to allow this.
Am 18.12.2015 um 12:45 schrieb Claas-Thido Pfaff:
While the split into atomic fields for the species names is interesting and useful in general I ask myself now if we really need to do that. This would extend our deepest level of our taxonomic classification just to be able to store separate parts of a string (species name) in separate fields. I am not sure about the benefit in the search. Just adding the elements of ABCD would also mean a dublication at least to a certain degree as. For example there is fields in ABCD for e.g. epithet, and genus. We allow for this in the generic structure with taxonRankName = gensus ..., taxonRankName = species. Maybe we just skip this. @EichenbergBEF https://github.com/EichenbergBEF What do you mean?
— Reply to this email directly or view it on GitHub https://github.com/cpfaff/cas/issues/17#issuecomment-165759158.
Dr. rer. nat. David Eichenberg BEF-China research consortium Data manager (BEF China Dataportal) Tel: 0049-341-9738587 Department of Systematic Botany and Functional Biodiversity University of Leipzig Room 120 Johannisalles 21 04103 Leipzig GERMANY
Now we can do the following for example:
<organism>
<organismicCoverage>
<organismItem>
<taxonomicClassification>
<taxonRankName>class</taxonRankName>
<taxonRankValue>coniferopsida</taxonRankValue>
<taxonomicClassification>
<taxonRankName>order</taxonRankName>
<taxonRankValue>coniferales</taxonRankValue>
<taxonomicClassification>
<taxonRankName>family</taxonRankName>
<taxonRankValue>piceoideae</taxonRankValue>
<taxonomicClassification>
<taxonRankName>subfamily</taxonRankName>
<taxonRankValue>piceoideae</taxonRankValue>
</taxonomicClassification>
</taxonomicClassification>
</taxonomicClassification>
</taxonomicClassification>
<organismName>
<botanicalName>
<genusOrMonomial>
picea
</genusOrMonomial>
<firstEpithet>
abies
</firstEpithet>
<authorTeamParenthesis>
(L.)
</authorTeamParenthesis>
<authorTeam>
H.Karst
</authorTeam>
</botanicalName>
</organismName>
</organismItem>
</organismicCoverage>
</organism>
In Organisms we do:
ABCD does:
EML does
DwC does: