The 3rd Bayes-Duality Workshop 2025

June 24

Morning Session (chair: TBA)

[10:00-11:00] Discussion: Influence function (Christopher)
[11:00-11:30] Coffee Break
[11:30-12:30] Discussion: Federated Learning (Siddharth, Emti, Thomas)
[12:30-14:00] Lunch Break (On Your Own)

Afternoon Session (chair: TBA)

[14:00-16:00] Discussion: Model merging and LLM Basics (Nico)
[15:00-16:00] Free Discussion
[16:00-16:30] Coffee Break
[16:30-18:00] Free Discussion

[Go to Program]

June 25

Morning Session (chair: Emtiyaz Khan)

[10:00-11:00] Rebekka Burkholz: Sparsification and Implicit Bias Details Video
Abstract: Modern deep learning systems impress with their capabilities but, at the same time, also face considerable challenges—requiring massive datasets, enormous computational resources, and raising growing concerns about their transparency and trustworthiness. In this talk, we will ask the question if it really has to be like this and discuss some of the major challenges that limit the success of deep learning on smaller scales. We will offer algorithmic and theoretical insights into sparsity, overparameterization, and their implicit biases. Exploiting our theoretical insights, we will adapt the implicit bias of standard optimizers according to a dynamic sparsity principle, which achieves performance boosts comparable to SAM, yet, with a complementary, novel mechanism.

Bio: I am a tenure-track faculty member at the Helmholtz Center CISPA, where I lead the Relational Machine Learning Group. Our research combines robust algorithm design and complex network science with the quest for a theoretical understanding of deep learning. Based on theoretical and experimental insights, we develop efficient models and algorithms that are robust to noise, adapt to a changing environment, and integrate information that can be available by small amounts of data and various forms of domain knowledge. This makes our approach well suited for the biomedical domain and sciences in general. While we care about solving real world problems in collaboration with domain experts, we have a special interest in problems related to glycans, gene regulation, and its alterations during cancer progression.
[11:00-11:30] Coffee Break
[11:30-12:30] André Martins: Dynamic Sparsity for Machine Learning Details Video
Abstract:In this talk, I describe how sparse modeling techniques can be extended and adapted for facilitating dynamic sparsity in neural models, where different neural pathways are activated depending on the input. The building block is a family of sparse transformations induced by Tsallis entropies called alpha-entmax, a drop-in replacement for softmax, which contains sparsemax as a particular case. Entmax transformations are differentiable and (unlike softmax) they can return sparse probability distributions, useful for routing, interpretability, efficiency, and length generalization, being less prone to phenomena such as dispersion, oversquashing, and representational collapse. They can also be used to design new Fenchel-Young loss functions, replacing the cross-entropy loss. Variants of these sparse transformations and losses have been applied with success to machine translation, natural language inference, visual question answering, Hopfield networks, reinforcement learning, and other tasks. I will discuss AdaSplash, an efficient implementation of entmax attention (https://arxiv.org/abs/2502.12082), and recent applications of these sparse losses to conformal prediction (https://arxiv.org/abs/2502.14773) and to generalized Bayesian inference (https://arxiv.org/abs/2502.10295).

Bio:I am an Associate Professor at the Computer Science Department (DEI) and at the Electrical and Computer Engineering Department (DEEC) at Instituto Superior Técnico. I am also the VP of AI Research at Unbabel in Lisbon, Portugal, and a Senior Researcher at the Instituto de Telecomunicações, where I lead the SARDINE Lab. Until 2012, I was a PhD student in the joint CMU-Portugal program in Language Technologies, at Carnegie Mellon University and at Instituto Superior Técnico, where I worked under the supervision of Mário Figueiredo, Noah Smith, Pedro Aguiar and Eric Xing. My research interests revolve around natural language processing and machine learning, more specifically sparse and structured transformations, uncertainty quantification, interpretability, and multimodal processing applied to machine translation, natural language generation, quality estimation, and evaluation. My research has been funded by a ERC Starting Grant (DeepSPIN) and Consolidator Grant (DECOLLAGE), among other grants, and has received several paper awards at ACL conferences. I co-founded and co-organize the Lisbon Machine Learning School (LxMLS). I am a Fellow of the ELLIS society and a co-director of the ELLIS Program in Natural Language Processing. I am a member of the Lisbon Academy of Sciences and of the Research & Innovation Advisory Group (RIAG) of the EuroHPC Joint Undertaking.
[12:30-14:00] Lunch Break (On Your Own)

Afternoon Session (chair: Joe Austerweil)

[14:00-15:00] David Rügamer: Lessons from Sampling Bayesian Neural Networks Details Video
Abstract: Sampling-based inference is often regarded as the gold standard for posterior inference in Bayesian neural networks (BNNs), yet it continues to face skepticism regarding its practicality in large-scale or complex models. This perception has been challenged by recent methodological and computational advances that significantly broaden the scope of feasible applications. The presentation examines how sampling operates in BNNs, how performance can be improved through targeted adaptations, and why not all sampling procedures are equally effective. It further explores the role of implicit regularization induced by both the network architecture and the sampling dynamics. The discussion points toward future opportunities where sampling may redefine Bayesian deep learning, contingent on addressing current challenges in scalability, efficiency, and inference cost.

Bio: I am an Associate Professor at the LMU Munich, heading the Munich Uncertainty Quantification AI Lab, an Ellis Member, Associated Fellow of the Konrad Zuse School of Excellence in Reliable AI (relAI), and also a Principal Investigator of the Munich Center for Machine Learning (MCML). My current research involves the development of uncertainty quantification for deep learning approaches (using e.g. a Bayesian paradigm), the unification of concepts from statistics and deep learning, and studying overparametrization in neural networks. See also my Google Scholar profile for some of my recent research.
[15:00-16:00] Poster Session
[16:00-16:30] Coffee Break
[16:30-18:00] Poster Session

[Go to Program]

June 23

June 24

June 25

June 26

June 27