elastic / elasticsearch

Free and Open, Distributed, RESTful Search Engine
https://www.elastic.co/products/elasticsearch
Other
68.52k stars 24.34k forks source link

geo-line aggregation outlier detection #87708

Open nreese opened 2 years ago

nreese commented 2 years ago

Description

Geo-line metric aggregation aggregates all geo_point values within a bucket into a LineString ordered by the chosen sort field. It would be very useful to have the aggregation remove outlier points to provide cleaner results.

For example, suppose geo_point values are GPS positions from a bicycle rider tracking their route. Because of a sensor error or messy data etc, the dataset contains an invalid point that is 1000 km away from the other positions in the track. The current results of geo-line aggregation would contain the error and display a track that spikes to the outlier data point. Instead, it would be useful for the geo-line aggregation to detect and ignore outlier positions that fall outside of realistic travel speeds and not include those points in geo-line results.

elasticmachine commented 2 years ago

Pinging @elastic/es-analytics-geo (Team:Analytics)