gbif / content-crawler

Crawls CMS and articles from Mendeley into ElasticSearch indexes
Apache License 2.0
1 stars 1 forks source link

composition pages not indexed properly #57

Open MortenHofft opened 9 months ago

MortenHofft commented 9 months ago

In contentful there is a contentType: Composition That has an ordered list of blocks like: [header, image, list of cards, ...] - this allow editors more freedom when creating pages. This list of blocks is also used on programme pages and the home page.

But it isn't reflected in the elasticsearch index.

Example entry from dev http://cms-search.gbif-dev.org:9200/composition/_search?q=id:3fvWSwDCj8tZBpRFiWC8QQ in Contentful: https://app.contentful.com/spaces/uo17ejk9rkwj/environments/dev/entries/3fvWSwDCj8tZBpRFiWC8QQ

issues:

We want these entries to be searchable despite all the text lives in other entries. And we want to be able to display them obviously.

We could either include all the information in the elasticsearch document, or we could use graphql to unfold it. If we opt for the latter, then we need to add those contentTypes to ES as well (we do not currently index all the block types like featureBlocketc)

@fmendezh I need your input here. I do not know what is best/easiest for you