Speaker
Michael Douglas
(Stony Brook University)
Description
Machine learning critically depends on high quality datasets. In a theoretical subject like string theory, we can generate datasets, but what sort of data should we generate and study? We discuss this question from several perspectives: mathematical (generating solutions), statistical (getting representative samples), and methodological (improving access to prior work).
Primary author
Michael Douglas
(Stony Brook University)