Principal components analysis in tree space
- π€ Speaker: Nye, T (Newcastle)
- π Date & Time: Tuesday 21 June 2011, 16:30 - 16:50
- π Venue: Seminar Room 1, Newton Institute
Abstract
Phylogenetic analysis commonly gives rise to a collection or sample of inferred evolutionary trees, each differing from the others. There is a need for methods that visualize, compare, and quantify variability in such sets of trees, in terms of both topological and geometrical differences. Standard tools of multivariate analysis such as multi-dimensional scaling and clustering have been applied to sets of trees, but Principal Components Analysis (PCA) cannot be applied directly since the space of evolutionary trees on a fixed set of taxa is not a vector space. I propose a novel geometrical approach to PCA in tree-space that works in an analogous way to standard linear Euclidean PCA . Given a data set of phylogenetic trees, a geodesic path is sought that maximises the variance of the data under a form of projection within tree-space onto the path. Geodesic paths identified in this way reveal and quantify the principal sources of variation in the original collection of trees in terms of both topology and branch lengths, and can be visualized as animations of smoothly changing alternative evolutionary trees. The potential of the approach is illustrated by applying tree-space PCA to experimental data from metazoa and a simulation study of long-branch attraction.
Series This talk is part of the Isaac Newton Institute Seminar Series series.
Included in Lists
- All CMS events
- bld31
- dh539
- Featured lists
- INI info aggregator
- Isaac Newton Institute Seminar Series
- School of Physical Sciences
- Seminar Room 1, Newton Institute
Note: Ex-directory lists are not shown.
![[Talks.cam]](/static/images/talkslogosmall.gif)


Tuesday 21 June 2011, 16:30-16:50