University of Cambridge > Talks.cam > Isaac Newton Institute Seminar Series > What the 1000 genomes project tells us about systematic bias and batch effects in sec-gen data

What the 1000 genomes project tells us about systematic bias and batch effects in sec-gen data

Add to your list(s) Download to your calendar using vCal

If you have a question about this talk, please contact Mustapha Amrani.

Statistical Challenges Arising from Genome Resequencing

First, we will report findings on systematic biases in this technology gleaned from analysis of a single chromosome from the 1000 Genomes Project Data. While it is known that coverage of whole-genome re-sequencing data is not uniform, its variation is highly correlated across multiple samples in unrelated populations with a strong platform and date effects. We will describe how some of these systematic biases affect SNP and CNV calls. Second, we will describe our first steps towards statistical solutions for these problems. Our approach is based on specific genome features that are highly correlated to differences in coverage in genome regions. Note that this work is preliminary and that the technology is moving fast. So by the time I give this talk, in 3 months, it might have nothing to do with this abstract.

This talk is part of the Isaac Newton Institute Seminar Series series.

Tell a friend about this talk:

This talk is included in these lists:

Note that ex-directory lists are not shown.

 

© 2006-2024 Talks.cam, University of Cambridge. Contact Us | Help and Documentation | Privacy and Publicity