Speaker
Jorge Fernández de Cossío Díaz
(CEA Paris-Saclay)
Description
Over the last decade machine learning has had tremendous impact on biological sequence data analysis. In this talk, I will begin by introducing general issues related to biological sequence modeling. I will then review a selection of recent works on this topic, including: i) generative models for sequence design, ii) sampling of evolutionary paths between natural sequences of different classes, and iii) predictive models of directed evolution. I will also discuss some sources of uncertainty that arise with biological sequence data in different contexts (alignment, phylogenetic correlations, sampling noise, …), their potential impact on the models, and efforts to mitigate it.
Author
Jorge Fernández de Cossío Díaz
(CEA Paris-Saclay)