PASTAplus / PASTA

Repository for the Provenance Aware Synthesis Tracking Architecture (PASTA) project.
12 stars 3 forks source link

PASTA EML parsing results in addition of whitespace when <superscript> and <subscript> are adjacent #33

Closed servilla closed 3 years ago

servilla commented 3 years ago

An Nguyen (BLE) reported that the rendering of two adjacent <superscript> and <subscript> elements contains extra whitespace when viewed on the portal-s when the original EML XML does not have whitespace between the elements:

On Thu, Oct 22, 2020 at 2:20 PM An T. Nguyen enthusiast@utexas.edu wrote: Hello EDI,

We noticed that in this dataset on staging, the abstract and method sections have a minor formatting error that we think occurs on EDI end. E.g. if there are subscripts and superscript text elements next to each other, the HTML display inserts a space between. See screenshot:

-- An T. Nguyen Beaufort Lagoon Ecosystems Long Term Ecological Research Network The University of Texas at Austin

knb-lter-ble 17 1

This can also be seen by reviewing the data package summary page (aka landing page) at: https://portal-s.edirepository.org/nis/metadataviewer?packageid=knb-lter-ble.17.1

As an example of the raw XML element of the EML <abstract>

<abstract>
      <para>
Permafrost cores (4.5-7.5 m long) were collected along a geomorphic gradient near Drew Point, Alaska to characterize active layer and permafrost geochemistry and material properties. Cores were collected from a young drained lake basin, an ancient drained lake basin, and primary surface that has not been reworked by thaw lake cycles. Measurements of total organic carbon (TOC) and total nitrogen (TN) content, stable carbon isotope ratios (δ<superscript>13</superscript>C) and radiocarbon (<superscript>14</superscript>C) analyses of bulk soils/sediments were conducted on 45 samples from 3 permafrost cores. Porewaters were extracted from these same core sections and used to measure salinity, dissolved organic carbon (DOC), total dissolved nitrogen (TDN), anion (Cl<superscript>-</superscript>, Br<superscript>-</superscript>, SO<subscript>4</subscript><superscript>2-</superscript>, NO<subscript>3</subscript><superscript>-</superscript>), and trace metal (Ca, Mn, Al, Ba, Sr, Si, and Fe) concentrations. Radiogenic strontium (<superscript>87</superscript>Sr/<superscript>86</superscript>Sr) was measured on a subset of porewater samples. Cores were also sampled for material property measurements such as dry bulk density, water content, and grain size fractions.
</para>
    </abstract>
servilla commented 3 years ago

Deployed to production.