adsabs / ADSImportPipeline

Data ingest pipeline for ADS classic->ADS+
GNU General Public License v3.0
1 stars 12 forks source link

Index native author field #156

Open aaccomazzi opened 7 years ago

aaccomazzi commented 7 years ago

IOP at least provides some native author data (see e.g. 2017ApJ...843...68J), and ADSCachedExports currently exports it in its output, but we are not using it further downstream. I think we should index this in a separate author field, or even better in the same field and with the same position if possible at all.

      <author nr="1">
        <name>
          <western>Jiang, Yan-Fei</western>
          <native>&#x59DC;&#x71D5;&#x98DE</native>
          <normalized>Jiang, Y</normalized>
        </name>
        <type>regular</type>
      </author>
aaccomazzi commented 6 years ago

As a clarification, we don't want to use synonyms to handle situations such as this one, since the westerinized name (Jiang, Yan-Fei) is a lossy translation of its original script. So this really involves creating a new field.