| 25 Jun 2026 |
Tiffany Vlaar
University of Glasgow
|
Dissecting Deep Neural Networks
|
|
First, I will discuss the role played by individual layers and substructures of deep neural networks. Layer-wise sensitivity to the choice of initialisation and optimiser hyperparameter settings varies. I will show that training neural network layers differently can lead to enhanced generalisation performance and/or reduced computational cost, as well as affect resulting adversarial robustness. I will then consider the role of data and how image annotator variability in real-world ecological datasets affects the performance of deep learning models.
|
| 18 Jun 2026 |
Alexander Heinlein
Delft University of Technology
|
Surrogate Models for Complex Physical Systems
|
|
Many problems in science and engineering require the repeated simulation of complex physical systems. Applications such as uncertainty quantification, optimization, parameter estimation, and related many-query tasks can therefore become computationally prohibitive when relying solely on high-fidelity numerical simulations. Machine-learning surrogate models are designed to approximate such simulations at a significantly reduced computational cost while enabling the integration of observational and experimental data. In this talk, we discuss surrogate modeling approaches based on convolutional and graph neural networks, as well as operator-learning methods that approximate mappings between function spaces. Particular emphasis is placed on applications involving complex geometries, multiscale phenomena, high-dimensional parametrizations, and limited training data. We further discuss how physical knowledge can be incorporated into these models and combined with data, with applications in fluid dynamics, geoscience, Earth system modeling, and biomedical simulations.
|
| 5 Jun 2026 |
Matti Vihola
University of Jyväskylä
|
Mixing time of the conditional backward sampling particle filter
|
JRSSB
|
The conditional backward sampling particle filter (CBPF; also known as the particle Gibbs with ancestor sampling, PGAS) is a powerful Markov chain Monte Carlo sampler for general state space hidden Markov model smoothing. It was proposed as an improvement over the conditional particle filter, which is known to have an O(T^2) computational time complexity under a general ‘strong mixing’ assumption of the model, where T is the time horizon. We provide the first proof that the CBPF admits an O(T log T) computational complexity under strong mixing, complementing strong empirical evidence of the superiority of the CBPF in practice. In particular, the CBPF’s mixing time is upper bounded by O(log T), for any sufficiently large number of particles N that depends only on the mixing constants and not T. We show that an O(log T) mixing time is optimal. The proof involves the analysis of a novel coupling of two CBPFs, which involves a maximal coupling of two particle systems at each time instant. The coupling is implementable, and thus can also be used to construct unbiased, finite variance, estimates of functionals which have arbitrary dependence on the latent state’s path, with a total expected cost of O(Tlog T).
The talk is based on joint work with Joona Karjalainen, Sumeetpal S. Singh and Anthony Lee.
|
| 4 Jun 2026 |
Tamas Papp
National University of Singapore
|
Efficiency and tuning of unbiased MCMC
|
Paper [1]
Paper [2]
Paper [3]
|
The recently proposed unbiased MCMC methodology of Jacob et al. [1] eliminates the burn-in bias of standard MCMC, simplifying downstream inference and allowing more effective use of parallel computing resources. Although the method has received continued attention since its release [e.g. 2,3], key questions remain under-studied:
How does the efficiency of unbiased MCMC compare to that of standard MCMC? How should one tune unbiased MCMC for optimal performance?
In this talk, I will discuss our ongoing work tackling these related questions. For the former, we provide sharp non-asymptotic bounds that demonstrate that unbiased MCMC is as efficient as standard MCMC, while additionally being unbiased. For the latter, we provide principled yet generic tuning guidelines that achieve near-optimal efficiency, and that increase the efficiency by up to 10^6× (!!) compared to prior tuning. Our work demonstrates that unbiased MCMC is a competitive alternative to standard MCMC.
This talk is based on joint work with Sam Power (University of Bristol).
|
| 28 May 2026 |
Edwin Fong
University of Hong Kong
|
Quantile Martingale Posteriors
|
|
In this talk, we introduce a novel Bayesian nonparametric method for quantile estimation/regression based on the martingale posterior (MP) framework. The core idea of the MP is that posterior sampling is equivalent to predictive imputation, which allows us to break free of the stringent likelihood-prior specification. We demonstrate that a recursive estimate of a smooth quantile function, subject to a martingale condition, is entirely sufficient for full nonparametric Bayesian inference. We term the resulting posterior distribution as the quantile martingale posterior (QMP), which arises from an implicit generative predictive distribution. Associated with the QMP is an expedient, MCMC-free and parallelizable posterior computation scheme, which can be further accelerated with an asymptotic approximation based on a Gaussian process. Furthermore, the well-known issue of monotonicity in quantile estimation is naturally alleviated through increasing rearrangement due to the connections to the Bayesian bootstrap, and the QMP has a particularly tractable form that allows for comprehensive theoretical study.
|
| 21 May 2026 |
Gabriel Diaz-Aylwin
Lancaster University
|
Constrained Bayesian Optimisation of Field-Valued Constraints with an application to Fusion Reactors.
|
|
I will discuss an approach for constrained Bayesian optimisation which uses ideas from PDE-constrained optimisation to pre-compute an approximation of the feasible subset of the optimisation domain. This approximation is injected as a prior into the traditional optimisation loop. I will also introduce and discuss its application to divertor optimisation in tokamak fusion reactors via the Grad-Shafranov equation. This is work in progress, in collaboration with the UK Atomic Energy Authority.
|
| 14 May 2026 |
Chris Nemeth
Lancaster University
|
Hypergraph Generation via Structured Stochastic Diffusion
|
|
Hypergraphs model higher-order interactions, but realistic hypergraph generation remains difficult because incidence, hyperedge-size heterogeneity, and overlap structure are not faithfully captured by pairwise reductions. We propose HEDGE, a generative model defined directly on relaxed incidence matrices via a structured stochastic diffusion. The forward process combines a hypergraph-specific two-sided heat operator with an Ornstein–Uhlenbeck component, preserving structure-aware noising near the data while yielding an explicit Gaussian terminal law. Conditional on an observed hypergraph, this forward process is linear-Gaussian, so conditional means, covariances, scores, and reverse-drift targets are available in closed form. We therefore learn a permutation-equivariant state-only reverse-drift field in incidence space by regressing onto exact conditional targets, and generate samples by simulating a learned reverse-time SDE from the Gaussian base law. We establish exactness in the ideal state-only setting together with finite-horizon stability guarantees, and empirically show improved hypergraph generation quality relative to strong baselines.
|
| 30 Apr 2026 |
Liam Llamazares-Elias
Lancaster University
|
Non-stationary Gaussian fields and Penalized Complexity Priors
|
Slides
|
Gaussian random fields (GFs) are fundamental tools in spatial modeling and can be represented flexibly and efficiently as solutions to stochastic partial differential equations (SPDEs). The SPDEs depend on specific parameters, which govern various field behaviors and can be estimated using Bayesian inference. Informative priors are essential to ensure meaningful posterior covariance structures. This study builds on previous work by constructing penalized complexity (PC) priors for a smooth, invertible parameterization of the correlation range, diffusion matrix, and variance of a non-stationary GF. The formulated prior is weakly informative, effectively penalizing complexity by pushing the model towards stationarity while allowing for enough flexibility to capture non-stationary behavior. The model is applied to model precipitation in Spain, particulate matter in California, and electoral data in France with promising results.
|
| 23 Apr 2026 |
Rui Zhang
Lancaster University
|
Why Should We Care About Wasserstein Gradient Flows?
|
Slides
|
Wasserstein gradient flow (WGF) has emerged as a useful tool in computational statistics and machine learning from both a theoretical and a methodological point of view. From the theoretical side, WGF helps to interpret the bias of unadjusted Langevin, as well as establish new convergence bounds. From the methodological side, WGF formulation fosters novel sampling algorithms such as Stein variational gradient descent. In addition, the WGF formulation allows us to sample from posteriors that are unattainable via classical methods like MCMC, such as the post-Bayesian predictively-oriented posteriors. In this talk, we will survey some of these developments, especially those that are more relevant to BayesComp methodologies, and (hopefully) keep the mathematics approachable.
|