This pull request provides support for STAC harvesting.
STAC repositories are organized in catalogs which in turn could hold other sub-catalogs and/or items. Entire structure is expressed through JSON files i.e. one catalog which is defined as JSON has an array of URL's pointing to sub catalogs and separate array of URL's pointing to items. STAC broker crawls catalog structure and fetches fetches items, then it publishes it accordingly.
Testing
Current implementation has been tested with the following repositories:
This pull request provides support for STAC harvesting.
STAC repositories are organized in catalogs which in turn could hold other sub-catalogs and/or items. Entire structure is expressed through JSON files i.e. one catalog which is defined as JSON has an array of URL's pointing to sub catalogs and separate array of URL's pointing to items. STAC broker crawls catalog structure and fetches fetches items, then it publishes it accordingly.
Testing
Current implementation has been tested with the following repositories:
Note, some other mentioned catalogs were unavailable at the time of development/testing:
Important !!!
By default Elastic Search refuses to accept documents with more than 1000 properties (fields) in it. More about that feature could be found: https://www.elastic.co/guide/en/elasticsearch/reference/master/mapping-settings-limit.html. If this is a case then the workaround is to increase that limit. It could be done using freely available tools like POSTMAN or curl. The following example would increase that limit: