A novel taxon selection method, aimed at minimizing recombination, clarifies the discovery of a new sub-population of Helicobacter pylori from Australia

Research output: Contribution to journalArticle

Abstract

We present a novel method for taxon selection, the aim being to minimize problems arising from highly recombinant species such as Helicobacter pylori. Helicobacter pylori has accompanied modern-human migration out of Africa and is marked by a phylogeographic strain distribution, which has been exploited to add an extra layer of information about human migrations to that obtained from human sources. However, H. pylori's genome has high sequence heterogeneity combined with a very high rate of recombination, causing major allelic diversification across strains. On the other hand, recombination events that have become preserved in sub-populations are a useful source of phylogenetic information. This creates a potential problem in selecting representative strains for particular genetic or phylogeographic clusters and generally ameliorating the impact on analyses of extensive low-level recombination. To address this issue, we perform multiple population structure-based analyses on core genomes to select exemplar strains, called 'quintessents', which exhibit limited recombination. In essence, quintessent strains are representative of their specific phylogenetic clades and can be used to refine the current MLST concatenation-based population structure classification system. The use of quintessents reduces the noise due to local recombination events, while preserving recombination events that have become fixed in sub-populations. We illustrate the method with an analysis of core genome concatenations from 185 H. pylori strains, which reveals a recent speciation event resulting from the recombination of strains from phylogeographic clade hpSahul, carried by Aboriginal Australians, and hpEurope, carried by some of the people who arrived in Australia over the past 200 years. The signal is much clearer when based on quintessent strains, but absent from the analysis based on MLST concatenations.

Original languageEnglish
Number of pages12
JournalEvolutionary Applications
DOIs
Publication statusE-pub ahead of print - 18 Sep 2019

Cite this

@article{85c176888bea4abb8c81f7d35b305f33,
title = "A novel taxon selection method, aimed at minimizing recombination, clarifies the discovery of a new sub-population of Helicobacter pylori from Australia",
abstract = "We present a novel method for taxon selection, the aim being to minimize problems arising from highly recombinant species such as Helicobacter pylori. Helicobacter pylori has accompanied modern-human migration out of Africa and is marked by a phylogeographic strain distribution, which has been exploited to add an extra layer of information about human migrations to that obtained from human sources. However, H. pylori's genome has high sequence heterogeneity combined with a very high rate of recombination, causing major allelic diversification across strains. On the other hand, recombination events that have become preserved in sub-populations are a useful source of phylogenetic information. This creates a potential problem in selecting representative strains for particular genetic or phylogeographic clusters and generally ameliorating the impact on analyses of extensive low-level recombination. To address this issue, we perform multiple population structure-based analyses on core genomes to select exemplar strains, called 'quintessents', which exhibit limited recombination. In essence, quintessent strains are representative of their specific phylogenetic clades and can be used to refine the current MLST concatenation-based population structure classification system. The use of quintessents reduces the noise due to local recombination events, while preserving recombination events that have become fixed in sub-populations. We illustrate the method with an analysis of core genome concatenations from 185 H. pylori strains, which reveals a recent speciation event resulting from the recombination of strains from phylogeographic clade hpSahul, carried by Aboriginal Australians, and hpEurope, carried by some of the people who arrived in Australia over the past 200 years. The signal is much clearer when based on quintessent strains, but absent from the analysis based on MLST concatenations.",
keywords = "core genomes, Helicobacter pylori, phylogeography, recombination, taxon selection, PHYLOGENETIC ANALYSIS, POPULATION-STRUCTURE, GENOME, EVOLUTION, PREVALENCE, INFERENCE, GENES, ASSOCIATION, ADAPTATION, ALGORITHM",
author = "Binit Lamichhane and Wise, {Michael J.} and Chua, {Eng Guan} and Marshall, {Barry J.} and Tay, {Chin Yen}",
year = "2019",
month = "9",
day = "18",
doi = "10.1111/eva.12864",
language = "English",
journal = "Evolutionary Applications",
issn = "1752-4563",
publisher = "Wiley-Blackwell",

}

TY - JOUR

T1 - A novel taxon selection method, aimed at minimizing recombination, clarifies the discovery of a new sub-population of Helicobacter pylori from Australia

AU - Lamichhane, Binit

AU - Wise, Michael J.

AU - Chua, Eng Guan

AU - Marshall, Barry J.

AU - Tay, Chin Yen

PY - 2019/9/18

Y1 - 2019/9/18

N2 - We present a novel method for taxon selection, the aim being to minimize problems arising from highly recombinant species such as Helicobacter pylori. Helicobacter pylori has accompanied modern-human migration out of Africa and is marked by a phylogeographic strain distribution, which has been exploited to add an extra layer of information about human migrations to that obtained from human sources. However, H. pylori's genome has high sequence heterogeneity combined with a very high rate of recombination, causing major allelic diversification across strains. On the other hand, recombination events that have become preserved in sub-populations are a useful source of phylogenetic information. This creates a potential problem in selecting representative strains for particular genetic or phylogeographic clusters and generally ameliorating the impact on analyses of extensive low-level recombination. To address this issue, we perform multiple population structure-based analyses on core genomes to select exemplar strains, called 'quintessents', which exhibit limited recombination. In essence, quintessent strains are representative of their specific phylogenetic clades and can be used to refine the current MLST concatenation-based population structure classification system. The use of quintessents reduces the noise due to local recombination events, while preserving recombination events that have become fixed in sub-populations. We illustrate the method with an analysis of core genome concatenations from 185 H. pylori strains, which reveals a recent speciation event resulting from the recombination of strains from phylogeographic clade hpSahul, carried by Aboriginal Australians, and hpEurope, carried by some of the people who arrived in Australia over the past 200 years. The signal is much clearer when based on quintessent strains, but absent from the analysis based on MLST concatenations.

AB - We present a novel method for taxon selection, the aim being to minimize problems arising from highly recombinant species such as Helicobacter pylori. Helicobacter pylori has accompanied modern-human migration out of Africa and is marked by a phylogeographic strain distribution, which has been exploited to add an extra layer of information about human migrations to that obtained from human sources. However, H. pylori's genome has high sequence heterogeneity combined with a very high rate of recombination, causing major allelic diversification across strains. On the other hand, recombination events that have become preserved in sub-populations are a useful source of phylogenetic information. This creates a potential problem in selecting representative strains for particular genetic or phylogeographic clusters and generally ameliorating the impact on analyses of extensive low-level recombination. To address this issue, we perform multiple population structure-based analyses on core genomes to select exemplar strains, called 'quintessents', which exhibit limited recombination. In essence, quintessent strains are representative of their specific phylogenetic clades and can be used to refine the current MLST concatenation-based population structure classification system. The use of quintessents reduces the noise due to local recombination events, while preserving recombination events that have become fixed in sub-populations. We illustrate the method with an analysis of core genome concatenations from 185 H. pylori strains, which reveals a recent speciation event resulting from the recombination of strains from phylogeographic clade hpSahul, carried by Aboriginal Australians, and hpEurope, carried by some of the people who arrived in Australia over the past 200 years. The signal is much clearer when based on quintessent strains, but absent from the analysis based on MLST concatenations.

KW - core genomes

KW - Helicobacter pylori

KW - phylogeography

KW - recombination

KW - taxon selection

KW - PHYLOGENETIC ANALYSIS

KW - POPULATION-STRUCTURE

KW - GENOME

KW - EVOLUTION

KW - PREVALENCE

KW - INFERENCE

KW - GENES

KW - ASSOCIATION

KW - ADAPTATION

KW - ALGORITHM

U2 - 10.1111/eva.12864

DO - 10.1111/eva.12864

M3 - Article

JO - Evolutionary Applications

JF - Evolutionary Applications

SN - 1752-4563

ER -