[Fleet] Improve data streams API efficiency

hop-dev commented 3 years ago

Kibana version:

7.15.0, 7.16.0, master

Description of the problem including expected versus actual behavior:

Originally pointed out by @joshdover here:

The data stream view can be quite slow to load when there are a lot of streams. We currently get all data streams in one request without pagination and perform an aggregation per data stream.

This issue is to look into ways of improving the performance, current options discussed:

1. Using the data stream name to extract the type, dataset and namespace instead of aggregating

Currently, there is no guarantee that the constant_keyword values in the data match the data stream name. @ruflin suggested we could look at putting a feature request for elastic to validate the constant keywords against the data stream name allowing us to rely on this link.

However, we are now looking at adding another aggregation as part of https://github.com/elastic/integrations/issues/768 so there may no longer be a big efficiency gain to be found here.

2. Introducing pagination

We could introduce pagination to limit the work we do, however there would be some challenges:

the data stream stats API doesn't support pagination , but it can take comma separated names
we currently sort in-memory based on last_activity_ms from the stats API, however this is about to change in https://github.com/elastic/integrations/issues/768 when we will sort by event.ingested. We would still have to get the values for all data streams and then sort in memory I believe, so there may not be a massive performance gain

3. Combine individual aggregations into one aggregation I am not sure this is possible. We could find a way to use filters and sub aggregations to get the namespace, dataset and type for each data stream in one query. We would need to be able to distinguish each data stream using a filter query I believe and the only way to distinguish them would be to use the values we are querying for!

Steps to reproduce:

Setup Fleet & Fleet Server
Create an agent policy with many integrations to create many data streams
Go to /app/fleet/data-streams
Note that the page can be quite slow to load

elasticmachine commented 3 years ago

Pinging @elastic/fleet (Team:Fleet)

joshdover commented 3 years ago

@elastic/kibana-stack-management have you all solved optimizing your usage of the Data Streams stats API? I noticed that by default, stats are excluded from your Data Streams UI (you have to switch on a toggle in the top right). Curious if there's any history behind this decision and if we should also consider excluding stats by default or removing them from the list view entirely.

cjcenizal commented 3 years ago

@joshdover We haven't had an opportunity to revisit that functionality since it was first implemented. Because loading the data stream stats requires hitting a separate API (https://github.com/elastic/kibana/pull/75107/files#diff-0db7f035e2e41be22bac202848c325fabf209f626b8a934d09cce5e9e074941bR34), and I think the stats themselves might take awhile to fetch, it might take awhile to retrieve the data streams along with their stats. I recommend pinging the ES Data Management team for more detailed and up-to-date info.