Improved imputation of low-frequency and rare variants using the UK10K haplotype reference panel

J. Huang, B. Howie, S. Mccarthy, Y. Memari, K. Walter, J.L. Min, P. Danecek, G. Malerba, E. Trabetti, H.-F. Zheng, G. Gambaro, J.B. Richards, R. Durbin, N.J. Timpson, J. Marchini, N. Soranzo, S. Al Turki, A. Amuzu, C.A. Anderson, R. AnneyD. Antony, M.S. Artigas, M. Ayub, S. Bala, J.C. Barrett, I. Barroso, P. Beales, M. Benn, J. Bentham, S. Bhattacharya, E. Birney, D. Blackwood, M. Bobrow, E. Bochukova, P.F. Bolton, R. Bounds, C. Boustred, G. Breen, M. Calissano, K. Carss, J.P. Casas, J.C. Chambers, R. Charlton, K. Chatterjee, L. Chen, A. Ciampi, S. Cirak, P. Clapham, G. Clement, G. Coates, M. Cocca, D.A. Collier, C. Cosgrove, T. Cox, N. Craddock, L. Crooks, S. Curran, D. Curtis, A. Daly, I.N.M. Day, A. Day-Williams, G. Dedoussis, T. Down, Y. Du, C.M. Van Duijn, I. Dunham, S. Edkins, R. Ekong, P. Ellis, D.M. Evans, I.S. Farooqi, D.R. Fitzpatrick, P. Flicek, J. Floyd, A.R. Foley, C.S. Franklin, M. Futema, L. Gallagher, P. Gasparini, T.R. Gaunt, M. Geihs, D. Geschwind, C. Greenwood, H. Griffin, D. Grozeva, X. Guo, H. Gurling, D. Hart, A.E. Hendricks, P. Holmans, L. Huang, T. Hubbard, S.E. Humphries, M.E. Hurles, P. Hysi, V. Iotchkova, A. Isaacs, D.K. Jackson, Y. Jamshidi, J. Johnson, C. Joyce, K.J. Karczewski, J. Kaye, T. Keane, J.P. Kemp, K. Kennedy, A. Kent, J. Keogh, F. Khawaja, M.E. Kleber, M. Van Kogelenberg, A. Kolb-Kokocinski, J.S. Kooner, G. Lachance, C. Langenberg, C. Langford, D. Lawson, I. Lee, E.M. Van Leeuwen, M. Lek, R. Li, Y. Li, J. Liang, H. Lin, R. Liu, J. Lönnqvist, L.R. Lopes, M. Lopes, J. Luan, D.G. Macarthur, M. Mangino, G. Marenne, W. März, J. Maslen, A. Matchan, I. Mathieson, P. Mcguffin, A.M. Mcintosh, A.G. Mckechanie, A. Mcquillin, S. Metrustry, N. Migone, H.M. Mitchison, A. Moayyeri, J. Morris, R. Morris, D. Muddyman, F. Muntoni, B.G. Nordestgaard, K. Northstone, M.C. O'Donovan, S. O'Rahilly, A. Onoufriadis, K. Oualkacha, M.J. Owen, A. Palotie, K. Panoutsopoulou, V. Parker, J.R. Parr, L. Paternoster, T. Paunio, F. Payne, S.J. Payne, J.R.B. Perry, O. Pietilainen, V. Plagnol, R.C. Pollitt, S. Povey, M.A. Quail, L. Quaye, L. Raymond, K. Rehnström, C.K. Ridout, S. Ring, G.R.S. Ritchie, N. Roberts, R.L. Robinson, D.B. Savage, P. Scambler, S. Schiffels, M. Schmidts, N. Schoenmakers, R.H. Scott, R.A. Scott, R.K. Semple, E. Serra, S.I. Sharp, A. Shaw, H.A. Shihab, S.-Y. Shin, D. Skuse, K.S. Small, C. Smee, G.D. Smith, L. Southam, O. Spasic-Boskovic, T.D. Spector, D. St. Clair, B. St. Pourcain, J. Stalker, E. Stevens, J. Sun, G. Surdulescu, J. Suvisaari, P. Syrris, I. Tachmazidou, R. Taylor, J. Tian, M.D. Tobin, D. Toniolo, M. Traglia, A. Tybjaerg-Hansen, A.M. Valdes, A.M. Vandersteen, A. Varbo, P. Vijayarangakannan, P.M. Visscher, L.V. Wain, J.T.R. Walters, G. Wang, J. Wang, Y. Wang, K. Ward, E. Wheeler, P. Whincup, T. Whyte, H.J. Williams, K.A. Williamson, C. Wilson, Scott Wilson, K. Wong, C. Xu, J. Yang, G. Zaza, E. Zeggini, F. Zhang, P. Zhang, W. Zhang

Research output: Contribution to journalArticlepeer-review

235 Citations (Scopus)

Abstract

© 2015 Macmillan Publishers Limited. All rights reserved. Imputing genotypes from reference panels created by whole-genome sequencing (WGS) provides a cost-effective strategy for augmenting the single-nucleotide polymorphism (SNP) content of genome-wide arrays. The UK10K Cohorts project has generated a data set of 3,781 whole genomes sequenced at low depth (average 7x), aiming to exhaustively characterize genetic variation down to 0.1% minor allele frequency in the British population. Here we demonstrate the value of this resource for improving imputation accuracy at rare and low-frequency variants in both a UK and an Italian population. We show that large increases in imputation accuracy can be achieved by re-phasing WGS reference panels after initial genotype calling. We also present a method for combining WGS panels to improve variant coverage and downstream imputation accuracy, which we illustrate by integrating 7,562 WGS haplotypes from the UK10K project with 2,184 haplotypes from the 1000 Genomes Project. Finally, we introduce a novel approximation that maintains speed without sacrificing imputation accuracy for rare variants.
Original languageEnglish
Pages (from-to)1-9
JournalNature Communications
Volume6
DOIs
Publication statusPublished - 2015

Fingerprint

Dive into the research topics of 'Improved imputation of low-frequency and rare variants using the UK10K haplotype reference panel'. Together they form a unique fingerprint.

Cite this