30 January 2024 to 27 February 2024
University of Houston - Main Campus
US/Central timezone

On a Kneading Theory for Gene-Splicing

24 Feb 2024, 14:12
12m
University of Houston - Main Campus

University of Houston - Main Campus

101 Farish Hall
Talk Biological and Statistical Physics Statistical Physics and Condensed Matter and Others

Speaker

Ethan Speakman

Description

Two well-known facets in protein synthesis in eukaryotic cells are transcription of DNA to pre-RNA in the nucleus and the translation of messenger-RNA (mRNA) to proteins in the cytoplasm. A critical intermediate step is the removal of segments (introns) containing ∼97% of the nucleic acid sites in pre-RNA and sequential alignment of the retained segments (exons) to from mRNA through a process referred to as splicing. Alternative forms of splicing enrich the proteome while abnormal splicing can enhance the likelihood of a cell developing cancer or other diseases. Mechanisms for splicing and origins of splicing errors are only partially known.
Our goal is to determine if rules on splicing can be inferred from data analytics on nucleic acid sequences. We represent a nucleic acid site as a point in a plane defined in terms of the anterior and posterior sub-sequences of the site. It is found that point-sets for exons and introns are visually different, and that the differences can be quantified using a family of generalized moments. We design a machine-learning algorithm that can recognize individual exons or introns with 91% accuracy. Point-set distributions and generalized moments are found to differ between organisms.

Academic year 4th year
Research Advisor Gemunu Gunaratne

Authors

Ethan Speakman Dr Gemunu Gunaratne (University of Houston)

Presentation materials