Home Seminars & Events CSE Community Seminar | Octobe…

CSE Community Seminar | October 31, 2025

Presenter Themistoklis Sapsis, Professor of Mechanical and Ocean Engineering, Department of Mechanical Engineering, MIT

Talk Title Optimal sampling and extreme-event aware learning for probabilistic modeling of extremes in complex systems

Abstract

Analysis of real-world physical and engineering systems is characterized by unique computational challenges associated with high dimensionality of parameter spaces, large cost of simulations or experiments, as well as existence of uncertainty. For a wide range of these problems the goal is to either quantify uncertainty and compute risk for critical events, optimize parameters or control strategies, and/or making decisions. In this talk we will discuss two aspects related to the modeling of extreme events: i) optimal sampling of data for the characterization of extreme events, and ii) extreme-event aware learning, i.e. learning methods that do not necessarily assume the existence of extreme events in the training data.

In the first part of the talk, we will discuss how Bayesian active learning provides a flexible framework for i) identifying the most informative data for extremes, which is relevant for engineering problems where optimal experimental design is feasible, or ii) quantify the value of data in a prescribed dataset, e.g. relevant for climate modeling. Despite its attractive properties, the Bayesian framework is often prohibitively expensive in terms of the required simulations or experiments, even in the active learning setting. We introduce a new class of acquisition functions that utilize a likelihood-weighted ratio that accounts for the importance of the output relative to the input. This ratio acts essentially as a probabilistic sampling weight and guides the sampling algorithm towards regions of the input space where the objective function assumes abnormal values, resulting in significant savings of computational or experimental resources needed for convergence. We show that the adopted acquisition functions can be rigorously derived as the asymptotic limit of an optimal acquisition function that has a minimax form over a functional space. Subsequently, we demonstrate their favorable properties compared to standard methods on benchmark functions commonly used in the optimization community as well as real world applications involving turbulence, fluid-structure interaction problems and optimal sensor placement.

In the second part of the talk, we examine the problem of learning for extremes without the assumption of extreme events in the training data. We introduce the framework of Extreme Event Aware (e2a or eta) or $\eta$-learning which does not assume the existence of extreme events in the available data. $\eta$-learning reduces the uncertainty even in `unchartered’ extreme event regions, by enforcing the extreme event statistics of a few observables during training, which can be available or assumed through qualitative arguments or other forms of analysis. This type of statistical regularization results in models that fit the observed data, but also enforces consistency with the prescribed or assumed statistics of some observables, enabling the generation of unprecedented extreme events even when the training data lack extremes therein. A series of theoretical results based on optimal transport theory offers a rigorous justification and highlights the optimality of the introduced method. The favorable properties $\eta$-learning are demonstrated in synthetic examples as well as a real world example involving modeling extreme precipitation events.