MPDL / dataverse

Open source research data repository software
http://dataverse.org
Other
1 stars 0 forks source link

Add a mapping for the different organization names #41

Closed helkv closed 1 year ago

helkv commented 2 years ago

The Author Organization Facet lists different names for the same Max Planck Institutes. (For example "MPI for Chemical Ecology" and "Max Planck Institute for Chemical Ecology" are two spellings of the same institute).

Add a mapping to the Solr index, so that each Max Planck Institute with different spelling is mapped to the official name of the Institute (ideally the ROR name). This way the submitted organization name is not changed, but organizations with different names are listed under one name in the Author Organization Facet.

Use the following mappings for the Institute names:

Max Planck Institut für Chemie -> Max Planck Institute for Chemistry
Max-Planck-Institut für Kolloid- und Grenzflächenforschung -> Max Planck Institute of Colloids and Interfaces
MPI for -> Max Planck Institute for
MPI of -> Max Planck Institute of
helkv commented 2 years ago

Implementation, see commit: ccffebf46839c82371d92fd6a3bf9cf2d69db6c1

wfvs commented 2 years ago

changes to schema.xml and the appropriate charFilterMapping are here: https://github.com/MPDL/dataverse/commit/ccffebf46839c82371d92fd6a3bf9cf2d69db6c1 After copying the files to Solr, a complete reindex is required:

  1. /api/admin/index/clear
  2. check if Solr index data is empty (/server/solr/collection1/data/index)
  3. remove remaining files and restart Solr.
  4. /api/admin/index
helkv commented 2 years ago

Testserver: qa-edmond2.mpdl.mpg.de Browser: ff Version: v5.11-mpdl-1 (07188f3) User: User Result: OK

wfvs commented 2 years ago

updated filter_mapping.txt according to https://jira.mpdl.mpg.de/browse/FDM-135

hofmannc commented 1 year ago

Testserver: qa-edmond2.mpdl.mpg.de Browser: ff Version: v5.12.1-mpdl-1 User: Admin Testcase: create data set -> enter author organization of left column-> save -> author organization of right column is displayed in facet -> dataset is included to right column Result: nok, mapping in qa not implemented

Max Planck institute for the science of human history, Jena, Germany | Max Planck Institute for the Science of Human History -- | -- MPI für Evolutionsbiologie | Max Planck Institute for Evolutionary Biology MPI für Eisenforschung GmbH | Max-Planck-Institut für Eisenforschung GmbH Department of Evolutionary Neuroethology, Prof. B. S. Hansson, Max Planck Institute for Chemical Ecology, Max Planck Society | Max Planck Institute for Chemical Ecology Max-Planck Institute of colloids and interfaces | Max Planck Institute of Colloids and Interfaces Max-Planck-Institut für empirische Ästhetik | Max Planck Institute for Empirical Aesthetics MPDL | Max Planck Digital Library max planck institute for empirical aesthetics | Max Planck Institute for Empirical Aesthetics Atmospheric Chemistry, Max Planck Institute for Chemistry, Max Planck Society | Max Planck Institute for Chemistry Andrew Mackenzie, Physics of Quantum Materials, Max Planck Institute for Chemical Physics of Solids, Max Planck Society | Max Planck Institute for Chemical Physics of Solids MPI für Kohlenforschung | Max-Planck-Institut für Kohlenforschung
hofmannc commented 1 year ago

Testserver: qa-edmond2.mpdl.mpg.de Browser: ff Version: v5.12.1-mpdl-1 User: Admin Testcase: create data set -> enter author organization of left column-> save -> author organization of right column is displayed in facet -> dataset is included to right column Result: nok, MPDL -> Max Planck Digital Library is not included to mapping. All other mapping pairs are ok.

haarli commented 1 year ago

Can be closed, as discussed