Closed SamPassmore closed 2 years ago
also pinging @HedvigS and @RustyGray
Well, it confirms what we suspected - it varies a lot. That's not a problem in itself, of course. I think this was suggested as a first step to evaluating whether really we ought to be using x nearest neighbours as the spatial covariance predictor. I think based on this, that is still an option. What should x be? I don't know. 10? 20? 50?
Thanks Sam -- I was worried that we would be averaging n=1 or something embarrassing. I think I prefer the k approach rather than n neighbors as more elegant.
I agree. R.
On 11. Jul 2022, at 10:34, Simon J Greenhill @.***> wrote:
Thanks Sam -- I was worried that we would be averaging n=1 or something embarrassing. I think I prefer the k approach rather than n neighbors as more elegant.
— Reply to this email directly, view it on GitHub https://github.com/grambank/grambank-analysed/issues/65#issuecomment-1180117532, or unsubscribe https://github.com/notifications/unsubscribe-auth/AEETOPBYTSCI7WURPZNMLLTVTPL77ANCNFSM53FYH4GA. You are receiving this because you were mentioned.
What is the k approach? Is that what we've been doing?
yep. Sorry, should have been clearer
In our last meeting @QuentinAtkinson @blasid and @SimonGreenhill suggested we should look at how many neighbors each languages would have based on a distance of approximately 1000km radius. I will display these results below, alongside the number of neighbors with a non-zero covariance. The results are histograms showing the number of languages that have N neighbors. Note that this is a count of neighbors from the sample of languages used in the analyses. Also note that the covariance matrix doesn't hit approximately zero at exactly 1000 kilometers, hence the slight difference in the summary statistics.
Number of neighbors within 1000km:
Summary statistics:
From the earlier conversation, it was the lower end of the extremes that people wanted to investigate, so here is the count of languages who have 1 to 10 neighbors.
Number of neighbors with more than 0.01 Covariance:
Number of languages with 1 - 10 Neighbors based on covariance:
Script to reproduce these results which can be run from the R_Grambank directory of this repository