ericphanson / arxiv-search

Elasticsearch-backed rewrite of arxiv-sanity
MIT License
4 stars 1 forks source link

Author carnage on 1801.06898v1 #25

Open EdAyers opened 6 years ago

EdAyers commented 6 years ago

Has country names as authors!

ericphanson commented 6 years ago

Hmm, something must be wrong with how we get the metadata. via the api, http://export.arxiv.org/api/query?search_query=all:1801.06898&start=0&max_results=10, we get

    <author>
      <name>Eduard Vorobyov</name>
      <arxiv:affiliation xmlns:arxiv="http://arxiv.org/schemas/atom">Institute of Fluid Mechanics and Heat Transfer, TU Wien, 1060, Vienna, Austria</arxiv:affiliation>
      <arxiv:affiliation xmlns:arxiv="http://arxiv.org/schemas/atom">Research Institute of Physics, Southern Federal University, Stachki Ave. 194, 344090, Rostov-on-Don, Russia</arxiv:affiliation>
      <arxiv:affiliation xmlns:arxiv="http://arxiv.org/schemas/atom">Department of Astrophysics, University of Vienna, Vienna, 1180, Austria</arxiv:affiliation>
    </author>
    <author>
      <name>Vitaly Akimkin</name>
      <arxiv:affiliation xmlns:arxiv="http://arxiv.org/schemas/atom">Institute of Astronomy, Russian Academy of Sciences, Pyatnitskaya str. 48, 119017, Moscow, Russia</arxiv:affiliation>
    </author>
    <author>
      <name>Olga Stoyanovskaya</name>
      <arxiv:affiliation xmlns:arxiv="http://arxiv.org/schemas/atom">Novosibirsk State University, Lavrentieva str. 2, 630090, Novosibirsk, Russia and Boreskov Institute of Catalysis, Lavrentieva str. 5, 630090, Novosibirsk, Russia</arxiv:affiliation>
    </author>
    <author>
      <name>Yaroslav Pavlyuchenkov</name>
      <arxiv:affiliation xmlns:arxiv="http://arxiv.org/schemas/atom">Institute of Astronomy, Russian Academy of Sciences, Pyatnitskaya str. 48, 119017, Moscow, Russia</arxiv:affiliation>
    </author>
    <author>
      <name>Hauyu Baobab Liu</name>
      <arxiv:affiliation xmlns:arxiv="http://arxiv.org/schemas/atom">European Southern Observatory</arxiv:affiliation>
    </author>

which encodes it as affiliations. However, the metadata we downloaded via the OAI interface (via the format arXivRaw) encodes it as

<authors>Eduard Vorobyov (1,2,3), Vitaly Akimkin (4), Olga Stoyanovskaya (5),
  Yaroslav Pavlyuchenkov (4), and Hauyu Baobab Liu (6) ((1) Institute of Fluid
  Mechanics and Heat Transfer, TU Wien, 1060, Vienna, Austria, (2) Research
  Institute of Physics, Southern Federal University, Stachki Ave. 194, 344090,
  Rostov-on-Don, Russia, (3) Department of Astrophysics, University of Vienna,
  Vienna, 1180, Austria, (4) Institute of Astronomy, Russian Academy of
  Sciences, Pyatnitskaya str. 48, 119017, Moscow, Russia, (5) Novosibirsk State
  University, Lavrentieva str. 2, 630090, Novosibirsk, Russia and Boreskov
  Institute of Catalysis, Lavrentieva str. 5, 630090, Novosibirsk, Russia, (6)
  European Southern Observatory (ESO), Karl-Schwarzschild-Str. 2, D-85748
  Garching, Germany)</authors>

which doesn't distinguish between authors and affiliations.

Maybe we're better off just using the API?

EdAyers commented 6 years ago

Also occurs on 1803.11046v1.

EdAyers commented 6 years ago

I think that we should use the API and ignore OAI.