scieloorg / Web

SciELO Web
www.scielo.br
6 stars 21 forks source link

(BUG) DOIs duplicados para os artigos listados abaixo: #694

Closed robertatakenaka closed 5 years ago

robertatakenaka commented 5 years ago

Problema está na base de dados doi/query

robertatakenaka commented 5 years ago

doi_br_mst.txt

G4 hercules:/bases/scl.000/bases/doi/ci/v27n2/v27n2

G4 hercules:/bases/scl.000/bases/doi $ mx ci/v27n2/v27n2 S0100-19651998000200002
       1  S0100-19651998000200002
       1  Set #000000001
Hits=1
mfn=    19
880  "S0100-19651998000200002"
  1  "10.1590/S0100-19651998000200002"
  2  "doi^d20071122 155727 4 325"
  3  "art"
 91  "20071122 155727 4 325"
999  "^uhttp://dx.doi.org/10.1590/S0100-19651998000200002^lCrossRef"
900  "scielo_crs/databases/crossref/crossref_DOIReport^snew^d200711170950"
->x
G4 hercules:/bases/scl.000/bases/doi $ mx ci/v27n2/v27n2 S0100-19651998000200001
       1  S0100-19651998000200001
       1  Set #000000001
Hits=1
mfn=     4
880  "S0100-19651998000200001"
  1  "10.1590/S0100-19651998000200001"
  2  "doi^d20071122 155726 4 325"
  3  "art"
 91  "20071122 155726 4 325"
999  "^uhttp://dx.doi.org/10.1590/S0100-19651998000200001^lCrossRef"
900  "scielo_crs/databases/crossref/crossref_DOIReport^snew^d200711170950"

G4 hercules:/bases/scl.000/bases/doi/query

G4 hercules:/bases/scl.000/bases/doi $ mx query pid=S0100-19651998000200001
       1  PID=S0100-19651998000200001
       1  Set #000000001
Hits=1
mfn=640870
237  "10.1590/S0100-19651998000200002"
  2  "j"
880  "S1415-6555200800030001100024^cscl^d20110705 083557 2 185"
880  "S0100-1965199900030000400003^cscl^d20110705 083557 2 185"
880  "S0100-1965199900030000500001^cscl^d20110705 083557 2 185"
880  "S0100-1965200600030000400003^cscl^d20110705 083557 2 185"
880  "S0100-1965200700020000100003^cscl^d20110705 083557 2 185"
880  "S0100-1965200900020000700004^cscl^d20110705 083557 2 185"
880  "S0100-1965200900030001200002^cscl^d20110705 083557 2 185"
880  "S1516-8484200200010000200005^cscl^d20110713 085849 3 193"
880  "S0034-8910199900010000100001^cspa^d20110829 101210"
880  "S1413-8123200400020001100016^cspa^d20110829 101210"
880  "S0100-19651998000200001^cscl^d20110909 104221"
880  "S0100-19651998000200002^cscl^d20110909 104221"
->

G4 hercules:/bases/scl.000/bases/doi $ mx query pid=S0100-19651998000200002
       1  PID=S0100-19651998000200002
       1  Set #000000001
Hits=1
mfn=640870
237  "10.1590/S0100-19651998000200002"
  2  "j"
880  "S1415-6555200800030001100024^cscl^d20110705 083557 2 185"
880  "S0100-1965199900030000400003^cscl^d20110705 083557 2 185"
880  "S0100-1965199900030000500001^cscl^d20110705 083557 2 185"
880  "S0100-1965200600030000400003^cscl^d20110705 083557 2 185"
880  "S0100-1965200700020000100003^cscl^d20110705 083557 2 185"
880  "S0100-1965200900020000700004^cscl^d20110705 083557 2 185"
880  "S0100-1965200900030001200002^cscl^d20110705 083557 2 185"
880  "S1516-8484200200010000200005^cscl^d20110713 085849 3 193"
880  "S0034-8910199900010000100001^cspa^d20110829 101210"
880  "S1413-8123200400020001100016^cspa^d20110829 101210"
880  "S0100-19651998000200001^cscl^d20110909 104221"
880  "S0100-19651998000200002^cscl^d20110909 104221"
->

Esta base doi/query era gerada por um processamento que consultava o CrossRef e retornava o PID do DOI consultado e também o PID das referências. Está guardando os PID no campo v880. Em algum momento deste processamento a consulta estava trazendo o valor incorreto do DOI.

robertatakenaka commented 5 years ago

Estava analisando a base doi/query e aparentemente era erro na query. Mas o pessoal da produção corrigiu os dados e o PID e DOI ficaram corretos na base artigo:

G4 hercules:/home/roberta.takenaka $ mx /bases/scl.000/bases/artigo/artigo hr=S0100-19651998000200002
       1  HR=S0100-19651998000200002
       1  Set #000000001
Hits=1
mfn=2824333
  4  "v27n2"
702  "V:\Scielo\serial\ci\v27n2\markup\scielo.html"
705  "S"
706  "h"
700  "2"
701  "1"
709  "article"
708  "1"
 71  "oa"
 40  "pt"
  1  "br1.1"
 42  "1"
120  "2.0"
 38  "FIG"
 38  "TAB"
121  "02"
 49  "CI020"
 30  "Ci. Inf."
 31  "27"
 32  "2"
 65  "19980000"
 35  "0100-1965"
 14  "^fnd^lnd"
237  "10.1590/S0100-19651998000200001"