NCEAS / z-test-issues

Test issue imports from redmine
0 stars 0 forks source link

xs:string to ComplexType TextType, minOccurs=0, judiciously applied #437

Open mbjones opened 7 years ago

mbjones commented 7 years ago

Author Name: Margaret O'Brien (Margaret O'Brien) Original Redmine Issue: 3181, https://projects.ecoinformatics.org/ecoinfo/issues/3181 Original Date: 2008-03-21 Original Assignee: James Brunt


This is a summary of a recent discussion on eml-dev which does not appear to have been entered in bugzilla. Several people have expressed a need for additional structure in leaf nodes that are currently designated xs:string, generally to accommodate formatting for species binomials, chemical notation and lists. Examples include , <method>, and <protocol>. </p> <p>One solution is to change these from xs:string to txt:TextType. Since TextType is mixed content, it will not affect existing documents containing strings. The nodes to apply this change should be agreed on by this group, and this is not meant to be a work-around for eml which needs enhancement. Database implementations will need to correctly interpret the data typing when searching these elements. For more info on TextType, see bug 2703, and the docbook schema (<a rel="noreferrer nofollow" target="_blank" href="http://www.docbook.org/specs/">http://www.docbook.org/specs/</a>).</p> <p>EML 2.0.1 title element:</p> <xs:element name="title" type="xs:string" maxOccurs="unbounded"> EML 2.0.2 proposed title element: <xs:element name="title" type="txt:TextType" maxOccurs="unbounded"> Either of these is valid: <eml> <dataset> <title>Uptake of nitrogen by Alnus tenuifolia and Alnus crispa in six different successional habitats ... Uptake of nitrogen by <emphasis>Alnus tenuifolia</emphasis> and <emphasis>Alnus crispa</emphasis> in six different successional habitats ...

mbjones commented 7 years ago

Original Redmine Comment Author Name: Margaret O'Brien (Margaret O'Brien) Original Date: 2008-03-21T23:29:06Z


changed summary only, -- this is not a change to txt:TextType, since that would require adding children elements (para or section) late friday.

mbjones commented 7 years ago

Original Redmine Comment Author Name: Margaret O'Brien (Margaret O'Brien) Original Date: 2008-09-22T19:20:22Z


targeting for 2.1.0, although may drop back to unspecified.

mbjones commented 7 years ago

Original Redmine Comment Author Name: Margaret O'Brien (Margaret O'Brien) Original Date: 2008-10-01T16:57:17Z


Summary 2008-09-30: Matt Jones, Margaret O'Brien, James Brunt, Mark Servilla, Inigo San Gil, Chris Jones, Corinna Gries, Ken Ramsey

Retyping from xs:string to TextType to allow formatting in certain fields is not an ideal solution for 2 reasons (also discussed in previous comments to this bug)

  1. any embedded elements within titles and abstracts will create barriers to searching in both knb and other catalogs, and
  2. the practice of adding visual formatting to EML does not conform to the basic principle that XML should encode semantics not style.

A better solution might be to identify the terms which need additional formatting and create structures to accomodate them which can be transformed as needed. Known terms include species binomials and chemical formulae, which might be surrounded by tags such as and . would have simple content that could be transformed (e.g., to in stylesheets). Authors would need to employ another schema (e.g. ChemicalML) inside . Other types of terms used by EML authors will need to be identified.

A subgroup of eml-dev (James Brunt, Chris Jones) will consider possible olutions. Current uses of txt:TextType in instance docs should be examined to determine whether these are semantic or presentational, and to identify terms to consider for additional structures. Other schema languages could also be examined (FDGC has no provisions for formatting resource titles).

mbjones commented 7 years ago

Original Redmine Comment Author Name: Redmine Admin (Redmine Admin) Original Date: 2013-03-27T21:22:32Z


Original Bugzilla ID was 3181