9–12 Sept 2024
Imperial College London
Europe/London timezone

Exhaustive Symbolic Regression: Learning Astrophysics directly from Data

Not scheduled
2m
Lecture Theatre 2, Blackett Laboratory (Imperial College London)

Lecture Theatre 2, Blackett Laboratory

Imperial College London

Poster Social

Speaker

Harry Desmond (University of Portsmouth)

Description

A key challenge in the field of AI is to make machine-assisted discovery interpretable, enabling it not only to uncover correlations but also to improve our physical understanding of the world. A nascent branch of machine learning -- Symbolic Regression (SR) -- aims to discover the optimal functional representations of datasets, producing perfectly interpretable outputs (equations) by construction. SR is traditionally done using a “genetic algorithm” which stochastically selects trial functions by analogy with natural selection; I will describe the more ambitious approach of exhaustively searching and evaluating function space.

Coupled to an information-theoretic model selection principle based on minimum description length, our algorithm "Exhaustive Symbolic Regression" (ESR) is guaranteed to find the simple functions that optimally balance accuracy with simplicity on a dataset. This gives it broad application across science. I will detail the method, its relation to Bayesian statistics and an optional language model-based prior on functions designed to enhance their physicality. Then I will use ESR to quantify the extent to which state-of-the-art astrophysical theories -- FLRW cosmology, General Relativity and Inflation -- are implied by the current data.

Presentation materials

There are no materials yet.