jdinkla / location-based-nearest-neighbours

Using k-d trees with Apache Spark and Scala
Apache License 2.0
11 stars 3 forks source link

Can I use this algorithm to query datapoints in another dataframe? #1

Open munitech4u opened 6 years ago

munitech4u commented 6 years ago

The example illustrates, where all the points are stored within same dataframe and we get the nearest neighbour to each data point. Is it possible to use this algorithm to query points in another dataframe and return count of nearest neighbours to all the points in dataframe one?

jdinkla commented 6 years ago

Hi munitech4u,

If you use more than one dataframe, you have to split the data and combine the results.

Regards,

Jörn

Gesendet von Mail für Windows 10

Von: munitech4u Gesendet: Donnerstag, 30. November 2017 10:13 An: jdinkla/location-based-nearest-neighbours Cc: Subscribed Betreff: [jdinkla/location-based-nearest-neighbours] Can I use this algorithmto query datapoints in another dataframe? (#1)

The example illustrates, where all the points are stored within same dataframe and we get the nearest neighbour to each data point. Is it possible to use this algorithm to query points in another dataframe and return count of nearest neighbours to all the points in dataframe one? — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub, or mute the thread.

munitech4u commented 6 years ago

Thanks. But my objective is to return the nearest points from another dataframe only. (not from the same dataframe). Is it possible to do that?