scieloorg / PC-Programs

SciELO PC Programs
7 stars 24 forks source link

[xc] Tratar o traceback sobre documento vazio ao adicionar o pid v3 #3291

Closed robertatakenaka closed 4 years ago

robertatakenaka commented 4 years ago

Descrição do problema

Aparentemente falta um tratamento de exceção para quando o XML está inválido ao inserir o pid v3. Analisar esta hipótese

Traceback (most recent call last):
  File "/home/scielo/xc/app/xc_2020/xml/.venv/src/scielo-production-tools/src/scielo/bin/xml/prodtools/xc.py", line 163, in convert_package
    scilista_items, xc_status, mail_info = self.proc.convert_package(package)
  File "/home/scielo/xc/app/xc_2020/xml/.venv/src/scielo-production-tools/src/scielo/bin/xml/prodtools/processing/pkg_processors.py", line 421, in convert_package
    conversion.register_pids_and_update_xmls(self.pid_manager)
  File "/home/scielo/xc/app/xc_2020/xml/.venv/src/scielo-production-tools/src/scielo/bin/xml/prodtools/processing/pkg_processors.py", line 127, in register_pids_and_update_xmls
    update_article_with_aop_status=self.db.get_valid_aop,
  File "/home/scielo/xc/app/xc_2020/xml/.venv/src/scielo-production-tools/src/scielo/bin/xml/prodtools/data/kernel_document.py", line 79, in add_article_id_to_received_documents
    tree = xml_utils.get_xml_object(file_path)
  File "/home/scielo/xc/app/xc_2020/xml/.venv/src/scielo-production-tools/src/scielo/bin/xml/prodtools/utils/xml_utils.py", line 262, in get_xml_object
    return etree.parse(file_path, parser)
  File "src/lxml/etree.pyx", line 3521, in lxml.etree.parse
  File "src/lxml/parser.pxi", line 1839, in lxml.etree._parseDocument
  File "src/lxml/parser.pxi", line 1865, in lxml.etree._parseDocumentFromURL
  File "src/lxml/parser.pxi", line 1769, in lxml.etree._parseDocFromFile
  File "src/lxml/parser.pxi", line 1163, in lxml.etree._BaseParser._parseDocFromFile
  File "src/lxml/parser.pxi", line 601, in lxml.etree._ParserContext._handleParseResultDoc
  File "src/lxml/parser.pxi", line 711, in lxml.etree._handleParseResult
  File "src/lxml/parser.pxi", line 640, in lxml.etree._raiseParseError
  File "/tmp/tmpid36bp1_/scielo_package/1413-8123-csc-25-07-2755.xml", line 1
lxml.etree.XMLSyntaxError: Document is empty, line 1, column 1

Passos para reproduzir o problema

  1. Executar o XC para o pacote:
    • 1413-8123-csc-25-07.zip (1413-8123-csc-25-07-2755.xml)

Comportamento esperado

que o processo do xc termine sem levantar exceção

Screenshots ou vídeos

n/a

Anexos

n/a

Ambiente utilizado

dsteste possivelmente arquivo esteja em 2020_archive