The coevolve package allows the user to fit Bayesian dynamic coevolutionary phylogenetic models in Stan. These models can be used to estimate how variables have coevolved over evolutionary time and to assess causal directionality (X → Y vs. Y → X) and contingencies (X, then Y) in evolution.
While existing methods only allow pairs of binary traits to coevolve (e.g., BayesTraits), the coevolve package allows users to include multiple traits of different data types, including binary, ordinal, count, and continuous traits.
To use the coevolve package, you must first install the cmdstanr
package (see full installation instructions here:
https://mc-stan.org/cmdstanr/).
install.packages("cmdstanr", repos = c("https://mc-stan.org/r-packages/", getOption("repos")))
You can then install the development version of coevolve with:
# install.packages("devtools")
devtools::install_github("ScottClaessens/coevolve")
library(coevolve)
As an example, we analyse the coevolution of political and religious authority in 97 Austronesian societies. These data were compiled and analysed in Sheehan et al. (2023). Both variables are four-level ordinal variables reflecting increasing levels of authority. We use a phylogeny of Austronesian languages to assess patterns of coevolution.
fit <-
coev_fit(
data = authority$data,
variables = list(
political_authority = "ordered_logistic",
religious_authority = "ordered_logistic"
),
id = "language",
tree = authority$phylogeny,
# manually set prior
prior = list(A_offdiag = "normal(0, 2)"),
# arguments for cmdstanr
parallel_chains = 4,
iter_sampling = 500,
refresh = 0,
seed = 1
)
#> Running MCMC with 4 parallel chains...
#>
#> Chain 3 finished in 972.1 seconds.
#> Chain 1 finished in 976.9 seconds.
#> Chain 4 finished in 995.5 seconds.
#> Chain 2 finished in 1003.6 seconds.
#>
#> All 4 chains finished successfully.
#> Mean chain execution time: 987.0 seconds.
#> Total execution time: 1004.0 seconds.
#> Warning: 35 of 2000 (2.0%) transitions ended with a divergence.
#> See https://mc-stan.org/misc/warnings for details.
The results can be investigated using:
summary(fit)
#> Variables: political_authority = ordered_logistic
#> religious_authority = ordered_logistic
#> Data: authority$data (Number of observations: 97)
#> Phylogeny: authority$phylogeny (Number of trees: 1)
#> Draws: 4 chains, each with iter = 500; warmup = 1000; thin = 1
#> total post-warmup draws = 2000
#>
#> Autoregressive selection effects:
#> Estimate Est.Error 2.5% 97.5% Rhat Bulk_ESS Tail_ESS
#> political_authority -0.67 0.53 -1.97 -0.02 1.00 1015 769
#> religious_authority -0.78 0.57 -2.15 -0.04 1.00 1265 1061
#>
#> Cross selection effects:
#> Estimate Est.Error 2.5% 97.5% Rhat Bulk_ESS Tail_ESS
#> political_authority ⟶ religious_authority 2.28 0.98 0.31 4.20 1.01 874 951
#> religious_authority ⟶ political_authority 1.71 1.09 -0.36 3.95 1.01 494 1012
#>
#> Drift parameters:
#> Estimate Est.Error 2.5% 97.5% Rhat Bulk_ESS Tail_ESS
#> sd(political_authority) 1.98 0.80 0.30 3.51 1.01 402 344
#> sd(religious_authority) 1.28 0.77 0.07 2.93 1.01 447 776
#> cor(political_authority,religious_authority) 0.27 0.31 -0.42 0.77 1.00 1303 1340
#>
#> Continuous time intercept parameters:
#> Estimate Est.Error 2.5% 97.5% Rhat Bulk_ESS Tail_ESS
#> political_authority 0.20 0.95 -1.62 2.05 1.00 2342 1495
#> religious_authority 0.27 0.90 -1.42 2.02 1.01 2094 1229
#>
#> Ordinal cutpoint parameters:
#> Estimate Est.Error 2.5% 97.5% Rhat Bulk_ESS Tail_ESS
#> political_authority[1] -1.34 0.86 -2.97 0.33 1.00 1113 1468
#> political_authority[2] -0.58 0.83 -2.22 1.08 1.00 1257 1585
#> political_authority[3] 1.61 0.86 -0.00 3.31 1.00 1371 1540
#> religious_authority[1] -1.50 0.92 -3.29 0.34 1.00 1647 1280
#> religious_authority[2] -0.81 0.90 -2.54 0.93 1.00 1693 1464
#> religious_authority[3] 1.61 0.94 -0.17 3.52 1.00 1751 1648
#> Warning: There were 35 divergent transitions after warmup.
#> http://mc-stan.org/misc/warnings.html#divergent-transitions-after-warmup
The summary provides general information about the model and details on the posterior draws for the model parameters. In particular, the output shows the autoregressive selection effects (i.e., the effect of a variable on itself in the future), the cross selection effects (i.e., the effect of a variable on another variable in the future), the amount of drift, continuous time intercept parameters for the schocastic differential equation, and cutpoints for the ordinal variables.
While this summary output is useful as a first glance, it is difficult to interpret these parameters directly to infer directions of coevolution. Another approach is to “intervene” in the system. We can hold variables of interest at their average values and then increase one variable by a standardised amount to see how this affects the optimal trait value for another variable.
The coev_plot_delta_theta()
function allows us to visualise
$\Delta\theta{z}$ for all variable pairs in the model.
$\Delta\theta{z}$ is defined as the change in the optimal trait value
of one variable which results from a one median absolute deviation
increase in another variable.
coev_plot_delta_theta(fit)
#> Warning: Removed 123 rows containing non-finite outside the scale range (`stat_density()`).
This plot suggests that both variables influence one another in their coevolution. A standardised increase in political authority results in an increase in the optimal trait value for religious authority, and vice versa. In other words, these two variables reciprocally coevolve over evolutionary time.
When using the coevolve package, please cite the following papers: