Bioinformatics:Decoding the Genome (1/5)
by
Introduction and presentation of the cycle
by Chris Jones/CERN
Material:
Announcement -
transparencies
Extracting the fundamental genomic sequence from the DNA
From Genome to Sequence
Biology in the early 21st century has been radically transformed by the availability of the full genome sequences of an ever increasing number of life forms, from bacteria to major crop plants and to humans. The lecture will concentrate on the computational challenges associated with the production, storage and analysis of genome sequence data, with an emphasis on mammalian genomes. The quality and usability of genome sequences is increasingly conditioned by the careful integration of strategies for data collection and computational analysis, from the construction of maps and libraries to the assembly of raw data into sequence contigs and chromosome-sized scaffolds. Once the sequence is assembled, a major challenge is the mapping of biologically relevant information onto this sequence: promoters, introns and exons of protein-encoding genes, regulatory elements, functional RNAs, pseudogenes, transposons, etc. The methodological approaches and data requirements for genome annotation will be discussed, as well as user interfaces for exploring genomes.