PhaseGen: Exact Solutions for Time-Inhomogenous Multivariate Coalescent Distributions under Diverse Demographies
PhaseGen: Exact Solutions for Time-Inhomogenous Multivariate Coalescent Distributions under Diverse Demographies
Sendrowski, J.; Hobolth, A.
AbstractPhase-type theory is emerging as a powerful framework for modeling coalescent processes, allowing for the exact computation of quantities of interest. This includes moments of tree height, total branch length, the site-frequency spectrum (SFS), and the full distribution of the time to the most recent common ancestor (TMRCA). However, prior applications have largely been limited to time-homogeneous settings, with constant population sizes and migration rates, restricting the range of demographic scenarios that can be modeled. In this study, we apply time-inhomogeneous phase-type theory to enable the exact computation of (cross-)moments of arbitrary order and reward structure under piecewise-constant demographies. This extension enables the modeling of significantly more complex demographic scenarios, including population expansions, contractions, bottlenecks, and splits. It furthermore supports the fitting of demographic models to data through gradient-based optimization. To support these advancements, we introduce PhaseGena software package designed for the numerically stable computation of exact moments under diverse demographic scenarios, with support for gradient-based parameter estimation.