9–12 Sept 2024
Imperial College London
Europe/London timezone

Exhaustive Symbolic Regression: Learning Astrophysics directly from Data

10 Sept 2024, 18:11
1m
Lecture Theatre 2, Blackett Laboratory (Imperial College London)

Lecture Theatre 2, Blackett Laboratory

Imperial College London

Poster Social

Speaker

Harry Desmond (University of Portsmouth)

Description

A key challenge in the field of AI is to make machine-assisted discovery interpretable, enabling it not only to uncover correlations but also to improve our physical understanding of the world. A nascent branch of machine learning – Symbolic Regression (SR) – aims to discover the optimal functional representations of datasets, producing perfectly interpretable outputs (equations) by construction.

SR is traditionally done using a “genetic algorithm”which stochastically selects trial functions by analogy with natural selection; I will describe the more ambitious approach of exhaustively searching and evaluating function space.

Coupled to an information-theoretic model selection principle based on minimum description length, our algorithm “Exhaustive Symbolic Regression” (ESR) is guaranteed to find the simple functions that optimally balance accuracy with simplicity on a dataset. This gives it broad application across science. I will detail the method, its relation to Bayesian statistics and an optional language model-based prior on functions designed to enhance their physicality. Then I will use ESR to quantify the extent to which state-of-the-art astrophysical theories – FLRW cosmology, General Relativity and Inflation – are implied by the current data.

Presentation materials