TY - JOUR
T1 - Genome-wide variant calling in reanalysis of exome sequencing data uncovered a pathogenic TUBB3 variant
AU - Solve-RD SNV-indel working group
AU - Solve-RD DITF-ITHACA
AU - de Boer, Elke
AU - Yaldiz, Burcu
AU - Denommé-Pichon, Anne Sophie
AU - Matalonga, Leslie
AU - Laurie, Steve
AU - Steyaert, Wouter
AU - de Reuver, Rick
AU - Gilissen, Christian
AU - Kwint, Michael
AU - Pfundt, Rolph
AU - Verloes, Alain
AU - Willemsen, Michèl A.A.P.
AU - de Vries, Bert B.A.
AU - Vitobello, Antonio
AU - Kleefstra, Tjitske
AU - Vissers, Lisenka E.L.M.
AU - Johari, Mridul
N1 - Copyright © 2021 The Authors. Published by Elsevier Masson SAS.. All rights reserved.
PY - 2022/1
Y1 - 2022/1
N2 - Almost half of all individuals affected by intellectual disability (ID) remain undiagnosed. In the Solve-RD project, exome sequencing (ES) datasets from unresolved individuals with (syndromic) ID (n = 1,472 probands) are systematically reanalyzed, starting from raw sequencing files, followed by genome-wide variant calling and new data interpretation. This strategy led to the identification of a disease-causing de novo missense variant in TUBB3 in a girl with severe developmental delay, secondary microcephaly, brain imaging abnormalities, high hypermetropia, strabismus and short stature. Interestingly, the TUBB3 variant could only be identified through reanalysis of ES data using a genome-wide variant calling approach, despite being located in protein coding sequence. More detailed analysis revealed that the position of the variant within exon 5 of TUBB3 was not targeted by the enrichment kit, although consistent high-quality coverage was obtained at this position, resulting from nearby targets that provide off-target coverage. In the initial analysis, variant calling was restricted to the exon targets ± 200 bases, allowing the variant to escape detection by the variant calling algorithm. This phenomenon may potentially occur more often, as we determined that 36 established ID genes have robust off-target coverage in coding sequence. Moreover, within these regions, for 17 genes (likely) pathogenic variants have been identified before. Therefore, this clinical report highlights that, although compute-intensive, performing genome-wide variant calling instead of target-based calling may lead to the detection of diagnostically relevant variants that would otherwise remain unnoticed.
AB - Almost half of all individuals affected by intellectual disability (ID) remain undiagnosed. In the Solve-RD project, exome sequencing (ES) datasets from unresolved individuals with (syndromic) ID (n = 1,472 probands) are systematically reanalyzed, starting from raw sequencing files, followed by genome-wide variant calling and new data interpretation. This strategy led to the identification of a disease-causing de novo missense variant in TUBB3 in a girl with severe developmental delay, secondary microcephaly, brain imaging abnormalities, high hypermetropia, strabismus and short stature. Interestingly, the TUBB3 variant could only be identified through reanalysis of ES data using a genome-wide variant calling approach, despite being located in protein coding sequence. More detailed analysis revealed that the position of the variant within exon 5 of TUBB3 was not targeted by the enrichment kit, although consistent high-quality coverage was obtained at this position, resulting from nearby targets that provide off-target coverage. In the initial analysis, variant calling was restricted to the exon targets ± 200 bases, allowing the variant to escape detection by the variant calling algorithm. This phenomenon may potentially occur more often, as we determined that 36 established ID genes have robust off-target coverage in coding sequence. Moreover, within these regions, for 17 genes (likely) pathogenic variants have been identified before. Therefore, this clinical report highlights that, although compute-intensive, performing genome-wide variant calling instead of target-based calling may lead to the detection of diagnostically relevant variants that would otherwise remain unnoticed.
KW - ERN ITHACA
KW - Exome sequencing
KW - Genome-wide variant calling
KW - Solve-RD
KW - TUBB3
UR - http://www.scopus.com/inward/record.url?scp=85120818105&partnerID=8YFLogxK
U2 - 10.1016/j.ejmg.2021.104402
DO - 10.1016/j.ejmg.2021.104402
M3 - Article
C2 - 34863918
SN - 1769-7212
VL - 65
JO - European Journal of Medical Genetics
JF - European Journal of Medical Genetics
IS - 1
M1 - 104402
ER -