sul-dlss / dlme-airflow

This is a new repository to capture the work related to the DLME ETL Pipeline and establish airflow
Apache License 2.0
1 stars 0 forks source link

Manchester endpoint returning a 404 #376

Closed aaron-collier closed 3 weeks ago

aaron-collier commented 1 year ago
(dlme-airflow-py3.10) ➜  dlme-airflow git:(readme) bin/get manchester nashriyah
Traceback (most recent call last):
  File "/Users/edsummers/Projects/sul-dlss/dlme-airflow/bin/get", line 68, in <module>
    main(opts)
  File "/Users/edsummers/Projects/sul-dlss/dlme-airflow/bin/get", line 38, in main
    df = collection.catalog.read()
  File "/Users/edsummers/Projects/sul-dlss/dlme-airflow/dlme_airflow/drivers/oai_xml.py", line 255, in read
    self._load_metadata()
  File "/Users/edsummers/Library/Caches/pypoetry/virtualenvs/dlme-airflow-4uPLCoq4-py3.10/lib/python3.10/site-packages/intake/source/base.py", line 285, in _load_metadata
    self._schema = self._get_schema()
  File "/Users/edsummers/Projects/sul-dlss/dlme-airflow/dlme_airflow/drivers/oai_xml.py", line 207, in _get_schema
    self._open_set()
  File "/Users/edsummers/Projects/sul-dlss/dlme-airflow/dlme_airflow/drivers/oai_xml.py", line 81, in _open_set
    oai_records = self._collection.ListRecords(**opts)
  File "/Users/edsummers/Library/Caches/pypoetry/virtualenvs/dlme-airflow-4uPLCoq4-py3.10/lib/python3.10/site-packages/sickle/app.py", line 150, in ListRecords
    return self.iterator(self, params, ignore_deleted=ignore_deleted)
  File "/Users/edsummers/Library/Caches/pypoetry/virtualenvs/dlme-airflow-4uPLCoq4-py3.10/lib/python3.10/site-packages/sickle/iterator.py", line 135, in __init__
    super(OAIItemIterator, self).__init__(sickle, params, ignore_deleted)
  File "/Users/edsummers/Library/Caches/pypoetry/virtualenvs/dlme-airflow-4uPLCoq4-py3.10/lib/python3.10/site-packages/sickle/iterator.py", line 46, in __init__
    self._next_response()
  File "/Users/edsummers/Library/Caches/pypoetry/virtualenvs/dlme-airflow-4uPLCoq4-py3.10/lib/python3.10/site-packages/sickle/iterator.py", line 138, in _next_response
    super(OAIItemIterator, self)._next_response()
  File "/Users/edsummers/Library/Caches/pypoetry/virtualenvs/dlme-airflow-4uPLCoq4-py3.10/lib/python3.10/site-packages/sickle/iterator.py", line 84, in _next_response
    self.oai_response = self.sickle.harvest(**params)
  File "/Users/edsummers/Library/Caches/pypoetry/virtualenvs/dlme-airflow-4uPLCoq4-py3.10/lib/python3.10/site-packages/sickle/app.py", line 130, in harvest
    http_response.raise_for_status()
  File "/Users/edsummers/Library/Caches/pypoetry/virtualenvs/dlme-airflow-4uPLCoq4-py3.10/lib/python3.10/site-packages/requests/models.py", line 1021, in raise_for_status
    raise HTTPError(http_error_msg, response=self)
requests.exceptions.HTTPError: 404 Client Error: 404 for url: https://luna.manchester.ac.uk/luna/servlet/oai?set=Manchester~18~18&metadataPrefix=oai_dc&verb=ListRecords