open-contracting / ocds-index

A command-line tool and library to index OCDS documentation in Elasticsearch
https://ocds-index.readthedocs.io
BSD 3-Clause "New" or "Revised" License
0 stars 0 forks source link

Check if extracted text ought to include JSON text #5

Closed jpmckinney closed 3 years ago

jpmckinney commented 3 years ago

The extracted text is the same as in standard-search (see https://github.com/OpenDataServices/standard-search/issues/29).

However, we might not want to include JSON text or text from other directives.

The data.json file OCDS Index generates can be reviewed.

yolile commented 3 years ago

If the JSON key that someone could search is relevant in the documentation where it is mentioned, I think that probably it will appear also in the Markdown, so I think that it is fine to not include JSON texts in the results to avoid having too many non-related results, especially with some common keys as "tender", "ocid", etc

jpmckinney commented 3 years ago

Closed in a082325