Esri / geoportal-server-harvester

Metadata Harvester for Esri Geoportal Server
http://esri.github.io/geoportal-server/
Apache License 2.0
31 stars 24 forks source link

PyCSW harvest ESRI Geoportal 2.6.5 (CSW) failed #186

Closed leevydanomalik closed 4 months ago

leevydanomalik commented 2 years ago

Hi Guys,

I would like to harvest ESRI Geoportal CSW from pyCSW engine adn I got the following error (PyCSW Logs), I checked no null identifier. pls help <?xml version="1.0" encoding="UTF-8" standalone="no"?>

Harvest (insert) failed: (psycopg2.errors.NotNullViolation) null value in column "identifier" of relation "pycsw" violates not-null constraint DETAIL: Failing row contains (null, csw:Record, http://www.opengis.net/cat/csw/2.0.2, local, 2022-05-22T05:56:56Z, <csw:Capabilities xmlns:csw="http://www.opengis.net/cat/csw/3.0"..., CSW A catalogue service that conforms to the HTTP protocol bi..., null, application/xml, null, null, null, null, null, null, null, null, null, null, null, null, null, null, null, null, null, null, null, null, null, null, null, null, null, null, http://10.10.254.246:8082/geoportal/csw, null, null, null, null, null, null, null, null, null, null, null, null, null, null, null, null, null, null, null, null, null, null, null, null, null, null, null, null, null). [SQL: INSERT INTO pycsw (typename, schema, mdsource, insert_date, xml, anytext, metadata, metadata_type, language, type, title, title_alternate, abstract, keywords, keywordstype, parentidentifier, relation, time_begin, time_end, topicategory, resourcelanguage, creator, publisher, contributor, organization, securityconstraints, accessconstraints, otherconstraints, date, date_revision, date_creation, date_publication, date_modified, format, source, crs, geodescode, denominator, distancevalue, distanceuom, wkt_geometry, servicetype, servicetypeversion, operation, couplingtype, operateson, operatesonidentifier, operatesoname, degree, classification, conditionapplyingtoaccessanduse, lineage, responsiblepartyrole, specificationtitle, specificationdate, specificationdatetype, platform, instrument, sensortype, cloudcover, bands, links, anytext_tsvector, wkb_geometry) VALUES (%(typename)s, %(schema)s, %(mdsource)s, %(insert_date)s, %(xml)s, %(anytext)s, %(metadata)s, %(metadata_type)s, %(language)s, %(type)s, %(title)s, %(title_alternate)s, %(abstract)s, %(keywords)s, %(keywordstype)s, %(parentidentifier)s, %(relation)s, %(time_begin)s, %(time_end)s, %(topicategory)s, %(resourcelanguage)s, %(creator)s, %(publisher)s, %(contributor)s, %(organization)s, %(securityconstraints)s, %(accessconstraints)s, %(otherconstraints)s, %(date)s, %(date_revision)s, %(date_creation)s, %(date_publication)s, %(date_modified)s, %(format)s, %(source)s, %(crs)s, %(geodescode)s, %(denominator)s, %(distancevalue)s, %(distanceuom)s, %(wkt_geometry)s, %(servicetype)s, %(servicetypeversion)s, %(operation)s, %(couplingtype)s, %(operateson)s, %(operatesonidentifier)s, %(operatesoname)s, %(degree)s, %(classification)s, %(conditionapplyingtoaccessanduse)s, %(lineage)s, %(responsiblepartyrole)s, %(specificationtitle)s, %(specificationdate)s, %(specificationdatetype)s, %(platform)s, %(instrument)s, %(sensortype)s, %(cloudcover)s, %(bands)s, %(links)s, %(anytext_tsvector)s, %(wkb_geometry)s)] [parameters: {'typename': 'csw:Record', 'schema': 'http://www.opengis.net/cat/csw/2.0.2', 'mdsource': 'local', 'insert_date': '2022-05-22T05:56:56Z', 'xml': '<csw:Capabilities xmlns:csw="http://www.opengis.net/cat/csw/3.0" xmlns:fes="http://www.opengis.net/fes/2.0" xmlns:gml="http://www.opengis.net/gml/3.2 ... (11463 characters truncated) ... poralOperator name="TOverlaps"/>\n </fes:TemporalOperators>\n </fes:Temporal_Capabilities>\n </fes:Filter_Capabilities>\n</csw:Capabilities>', 'anytext': ' CSW A catalogue service that conforms to the HTTP protocol binding of the OpenGIS Catalogue Service specification (version 3.0) CSW CSW 3.0.0 ... (824 characters truncated) ... lse false false false false false false false false en true true ', 'metadata': None, 'metadata_type': 'application/xml', 'language': None, 'type': None, 'title': None, 'title_alternate': None, 'abstract': None, 'keywords': None, 'keywordstype': None, 'parentidentifier': None, 'relation': None, 'time_begin': None, 'time_end': None, 'topicategory': None, 'resourcelanguage': None, 'creator': None, 'publisher': None, 'contributor': None, 'organization': None, 'securityconstraints': None, 'accessconstraints': None, 'otherconstraints': None, 'date': None, 'date_revision': None, 'date_creation': None, 'date_publication': None, 'date_modified': None, 'format': None, 'source': 'http://10.10.254.246:8082/geoportal/csw', 'crs': None, 'geodescode': None, 'denominator': None, 'distancevalue': None, 'distanceuom': None, 'wkt_geometry': None, 'servicetype': None, 'servicetypeversion': None, 'operation': None, 'couplingtype': None, 'operateson': None, 'operatesonidentifier': None, 'operatesoname': None, 'degree': None, 'classification': None, 'conditionapplyingtoaccessanduse': None, 'lineage': None, 'responsiblepartyrole': None, 'specificationtitle': None, 'specificationdate': None, 'specificationdatetype': None, 'platform': None, 'instrument': None, 'sensortype': None, 'cloudcover': None, 'bands': None, 'links': None, 'anytext_tsvector': None, 'wkb_geometry': None}] (Background on this error at: https://sqlalche.me/e/14/gkpj).
mhogeweg commented 2 years ago

can you try this public Geoportal instance? https://gpt.geocloud.com/geoportal2/csw?request=GetCapabilities&service=CSW&version=2.0.2

If you let me know when you try harvesting, I can check the logs if needed.