Some interesting data: Cryptic DNA sequence periodicities are ubiquitous, organism specific and distinguish introns from exons.
- đ¤ Speaker: Giselle Walker
- đ Date & Time: Wednesday 28 February 2007, 14:00 - 15:00
- đ Venue: TCM Seminar Room, Cavendish Laboratory, Department of Physics
Abstract
Nucleotide bases in DNA are usually assumed to evolve independently. However, within a genome each of the 16 dinucleotide pairs tends to be consistently either commoner or rarer than expected, to the point where together they constitute taxon-specific genomic signatures. Here we extend these observations to explore gapped dinucleotide motifs (GDMs): dinucleotide pairs with 0-9 intervening bases. As with simple dinucleotides, GDMs define signatures that are highly consistent across a genome. Trees constructed from matrices of GDM similarity suggest that phylogenetic signal exists but can sometimes be swamped by convergence. To find the likely basis of GDM signatures we used a principal component analysis to summarise GDM profiles and show that the dominant predictors of both the strength and nature of the patterning are the %AT composition of the genome and the optimal growth temperature of the organism, particularly among prokaryotes. For eukaryotes, other factors such as genome size and chromosome number exert a minor but significant influence. Dividing eukaryotic sequences into intronic and exonic sequences reveals a striking difference in GDM patterns, exons being dominated by 3-base periodicities that are largely absent from introns, and from any of the whole genome sequence profiles. By implication, GDMs have potential to be used to identify anonymous DNA fragments, either to taxon among species, or perhaps even by function within a species. GDMs also illuminate constraints leading to convergent evolution. Either way, the strength of the GDM profiles suggest that DNA evolution is less random than is often assumed.
Series This talk is part of the Inference Group series.
Included in Lists
- All Cavendish Laboratory Seminars
- All Talks (aka the CURE list)
- Biology
- Cambridge Neuroscience Seminars
- Cambridge talks
- Centre for Health Leadership and Enterprise
- Chris Davis' list
- dh539
- dh539
- Featured lists
- Guy Emerson's list
- Hanchen DaDaDash
- Inference Group
- Inference Group Summary
- Interested Talks
- Joint Machine Learning Seminars
- Life Science
- Life Sciences
- Machine Learning Summary
- ME Seminar
- ML
- Neurons, Fake News, DNA and your iPhone: The Mathematics of Information
- Neuroscience
- Neuroscience Seminars
- Neuroscience Seminars
- Required lists for MLG
- rp587
- School of Physical Sciences
- Stem Cells & Regenerative Medicine
- TCM Seminar Room, Cavendish Laboratory, Department of Physics
- Thin Film Magnetic Talks
- yk373's list
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)


Wednesday 28 February 2007, 14:00-15:00