Open ali786XI opened 1 year ago
@aliabbas-elastic Where should this content go? It looks like it's quite generic and can be referenced by various integrations.
@alaudazzi Yes this is quite generic content related to reindexing a particular data stream when there are conflicts in particular fields. Ideally this should be in a generic place from where we can add that link in all of our integrations README.
@aliabbas-elastic and I met and discussed the next steps to fix the docs, which would impact both ES and Integrations doc pages:
@lalit-satapathy @SubhrataK These steps are quite complex and might not be appropriate for a regular user. Unless there is an easier way to provide this information, these steps should go into the public doc. Please check if the suggested approach is OK, or if you think there is any better alternative.
These two procedures can be integrated into the current Reindex API page, in the Elasticsearch doc set
It will be nice. I think sometime back we decided to move all these reindexing documentation from inside the integration document and point to relevant elasticsearch document instead. I hope those changes are done @agithomas . I agree to any further simplification of these docs, but we may have some differences of re-indexing in TSDB index vs non-TSDB index.
Thanks for the additional background @agithomas! Please check if the following actions make sense:
[ES team] Add these procedures to the ES docs:
[Arianna] As we try to avoid 4 levels in the nav tree, put these two sections after the page Reindex a TSDS:
[Arianna] Create the links from Integrations to ES docs
Provide the list of integrations that are TSDB or NON-TSDB enabled from where these two procedures should be referenced.
Shared the details separately
Main Issue
Reindexing steps document
Related issues
7624
Description
This issue provides the detailed reindexing steps for TSDB enabled data streams that need to be followed when there are field conflicts have been found because of the mismatched datatype.
For example, let's say host.ip field is shown conflicted under
metrics-*
data view, then this issue can be solved by reindexing the particular data stream's indices.To reindex the data, the following steps must be performed.
Step 1 Stop the data stream by going to
Integrations -> <integration_name> -> Integration policies
open the configuration of integration and disable the impacted data stream and save the integration.Step 2 Copy data into the temporary index by performing the following steps in the Dev tools.
Example:
Step 3 Note down the following values from the backing indices and index template of the data stream to be re indexed.
Stack Management -> Index Management -> Index Templates
index.time_series.start_time
andindex.time_series.end_time
index settings to match the lowest and highest@timestamp
values in the old data stream. ( Set the values to one second before and after the lowest and highest@timestamp
values )index.number_of_shards
index setting to the sum of all primary shards of all backing indices of the old data stream.index.number_of_replicas
to zero and unset the index.lifecycle.name index setting.Step 4 Create the index template after setting all the parameters mentioned in Step 3. (Here we will create a clone template hence the name metrics-dummy.cluster-copy)
Step 5 Now navigate to the created index template
Stack Management -> Index Management -> Index Templates
and click on theManage-> Edit
. Under Logistics, enable theCreate data stream
and set the priority to 300 (it should be greater than that of the metrics-dummy.cluster-default index template).Step 6 Delete the existing data stream by performing the following steps in the Dev tools.
Example:
Step 7 Copy data from the temporary index to the new index by performing the following steps in the Dev tools.
Example:
Step 8 Verify data is reindexed completely and the conflicts are resolved.
Step 9 Now navigate to the created index template
Stack Management -> Index Management -> Index Templates
and click on theManage-> Edit
. Under Logistics, unset the priority which was set in Step 5.Step 10 Invoke the rollover api on the destination data stream without any conditions set.
Example:
Step 11 Delete temporary index and index template by performing the following step in the Dev tools.
Step 12 Start the data stream by going to the
Integrations -> <integration_name> -> Integration policies
and open configuration of integration and enable theCollect <integration_name> metrics
toggle and save the integration.