Paganini is a lightweight python library meant to help with the design of combinatorial samplers. Given a combinatorial specification, expressed using a domain-specific language closely resembling Flajolet and Sedgewick's symbolic method, Paganini gives its users some additional control over the distribution of structures constructed using the designed samplers.
If you use Paganini or its components for published work, we encourage you to cite the accompanying papers:
Maciej Bendkowski, Olivier Bodini, Sergey Dovgal
Consider the following example. Suppose that we are interested in designing a sampler for plane trees of unbounded degree (i.e. with an arbitrary number of children), specified as
T = Z SEQ(T)
where SEQ(T)
stands for a (possibly empty) sequence of trees and Z
marks the size of a node.
In Paganini, we write the following snippet defining the same combinatorial class:
sp = Specification()
z, T = Variable(), Variable()
sp.add(T, z * Seq(T))
Now, if we we want to construct a corresponding sampler, say analytic (or
Boltzmann) sampler, we have to find a specific value of Z
and use it to
compute branching probabilities governing the random choices of our sampler
(essentially the number of children for each of the constructed nodes). What
value of Z
should be choose if we are interested in large, uniform, and
unbounded in size trees? With Paganini, this task amounts to invoking
sp.run_singular_tuner(z)
... and that's it! Paganini determines the corresponding value of z
for us.
Once tuned, variables are decorated with appropriate numerical values:
z.value // 0.25
T.value // 0.5
Paganini allows its users to focus on the design of specifications, taking care of the rest.
With the help of Paganini, users can demand the computation of tuning variables with specific, finite target expectations. Suppose that we are interested in designing an analytic sampler for Motzkin trees (i.e. plane unary-binary trees) however we would also like to demand that the outcome trees consists of around 1000 nodes, among which around 200 are unary. To achieve this goal, we construct the following specification:
sp = Specification()
z, u, M = Variable(1000), Variable(200), Variable()
sp.add(M, z + u * z * M + z * M ** 2)
sp.run_tuner(M)
Here z
and u
are two marking variables standing for the tree size
and the number of unary nodes, respectively. Once we run the tuner, all three variables
are decorated with respective numerical values, which the user can then use to compute
respective branching probabilities. A sampler designed with such values is guaranteed
to output Motzkin trees for which the expected size and mean number of unary nodes obey
the design specification.
Paganini is under constant development, supporting a growing class of so-called admissible constructors. For more specifications, please visit our
Paganini is available as a pip package.
pip install paganini
Paganini relies on published work of numerous excellent authors. Below, you can find a short (and definitely inexhaustive) list of papers on the subject:
If you are interested in the practical design of analytic samplers, we encourage you to check out the related Boltzmann Brain software.