Towards Denotational Semantics of AD for Higher-Order, Recursive, Probabilistic LanguagesRemote
Automatic differentiation (AD) aims to compute derivatives of user-defined functions, but in Turing-complete languages, this simple specification does not fully capture AD’s behavior: AD sometimes disagrees with the true derivative of a differentiable program, and when AD is applied to non-differentiable or effectful programs, it is unclear what guarantees (if any) hold of the resulting code. We study an expressive differentiable programming language, with piecewise-analytic primitives, higher-order functions, and general recursion. Our main result is that even in this general setting, a version of Lee et al. (2020)’s correctness theorem (originally proven for a first-order language without partiality or recursion) holds: all programs denote so-called omega-PAP functions, and AD computes correct intensional derivatives of them. Mazza and Pagani (2021)’s recent theorem, that AD disagrees with the true derivative of a differentiable recursive program at a measure-zero set of inputs, is derived as a corollary of this fact. We also apply the framework to study probabilistic programs, and recover a recent result from Mak et al. (2021) via a novel denotational argument.
Slides (LAFI2022_Alex_Lew.pdf) | 3.10MiB |
Sun 16 JanDisplayed time zone: Eastern Time (US & Canada) change
15:05 - 16:20 | |||
15:05 18mTalk | Towards Denotational Semantics of AD for Higher-Order, Recursive, Probabilistic LanguagesRemote LAFI Alexander K. Lew Massachusetts Institute of Technology, USA, Mathieu Huot Oxford University, Vikash K. Mansinghka MIT File Attached | ||
15:23 18mTalk | A Language and Smoothed Semantics for Convergent Stochastic Gradient DescentRemote LAFI File Attached | ||
15:42 18mTalk | Nonparametric Involutive Markov Chain Monte CarloRemote LAFI Carol Mak University of Oxford, Fabian Zaiser University of Oxford, C.-H. Luke Ong University of Oxford File Attached | ||
16:01 18mTalk | Rigorous Approximation of Posterior Inference for Probabilistic ProgramsRemote LAFI Fabian Zaiser University of Oxford, Raven Beutner CISPA Helmholtz Center for Information Security, Germany, C.-H. Luke Ong University of Oxford File Attached |