Understanding how hidden probabilities change over time presents a significant challenge in many fields, from tracking social trends to modelling genetic diversity. Marco Dalla Pria from University of Torino, Matteo Ruggiero from Stern at NYU Abu Dhabi, and Dario Spanò from University of Warwick, along with their colleagues, now offer a powerful new approach to this problem. Their research introduces a method for accurately tracking these evolving, unobserved probabilities using a sophisticated statistical model, even when only summarised cluster data are available. The team achieves this by developing a unique inferential framework that avoids complex simulations and instead relies on a clever mathematical duality, allowing them to calculate exact probabilities and make predictions with full uncertainty quantification, representing a substantial advancement over existing techniques like particle filtering. This breakthrough promises to improve analysis in areas where understanding dynamic, hidden structures is crucial.
To address the computational challenges of analyzing complex models, researchers developed an inferential framework that avoids direct simulation of the latent state, instead exploiting a duality between the model and a pure-death process on partitions, alongside operators that capture the impact of new data.
Bayesian Clustering with Flexible Poisson-Dirichlet Models
This research introduces a new method for estimating the current state of a system evolving over time, using a statistical model combining a two-parameter Poisson-Dirichlet diffusion and a Hidden Markov Model structure. The key innovation lies in utilizing a dual process, a related, simpler process that allows for exact filtering in certain cases, a significant achievement given the usual difficulty of filtering in these types of models. The team demonstrates that this dual process approach yields smoother, more accurate estimates compared to traditional methods, particularly with limited data, and reveals connections to population genetics. This method offers potential benefits including exact inference, improved accuracy, computational efficiency, and broad applicability to fields like population genetics, ecology, finance, machine learning, and bioinformatics. In essence, this work provides a powerful tool for analyzing complex dynamic systems with hidden states and evolving group structures, offering a way to overcome computational challenges and achieve more reliable inference.
Hidden Probability Tracking via Poisson-Dirichlet Models
Scientists have developed a new method for tracking the evolution of probability distributions over time, even when those distributions are hidden. The work employs a nonparametric approach that analyzes discrete data consisting of unlabelled partitions, grouping similar items without pre-defined categories, and utilizes a two-parameter Poisson-Dirichlet distribution to represent complex, changing probabilities. To overcome computational challenges, the team devised an inferential framework that avoids enumerating all possible groupings or simulating the hidden process directly, exploiting a mathematical duality between the model and a pure-death process on partitions. This delivers closed-form, recursive updates for both forward and backward inference, enabling precise calculations of the latent state at any point in time and accurate predictions of future partitions. Experiments demonstrate that this method achieves higher accuracy and lower variance compared to particle filtering techniques, while also providing computational gains, successfully recovering interpretable patterns in time-varying data. The breakthrough delivers a structurally exact filter and smoother for a continuous-time model, leveraging the finite combinatorics of the dual process.
Tracking Latent Dynamics From Limited Data
This research presents a new method for tracking unobserved probability distributions that change over time, using only limited, aggregate data. The team developed a statistical framework based on a two-parameter Poisson-Dirichlet process, allowing them to model latent processes from discrete observations of unlabelled groupings, such as those found in social and genetic data. Crucially, they devised an efficient inferential approach that avoids computationally intensive techniques, instead leveraging a duality between the model and a pure-death process on partitions. The resulting method achieves accurate estimations of both past and future states of the latent process, along with reliable measures of uncertainty, and demonstrates improved accuracy and computational speed compared to existing techniques. Validation through experiments and an application to dynamic social networks confirms the method’s ability to recover meaningful patterns from complex data, offering potential for broader impact across various scientific fields.
👉 More information
🗞 Exact inference via quasi-conjugacy in two-parameter Poisson-Dirichlet hidden Markov models
🧠 ArXiv: https://arxiv.org/abs/2512.22098
