Mining for single nucleotide polymorphisms and insertions/deletions in maize expressed sequence tag data

Jacqueline Batley, G Barker, H O'Sullivan, KJ Edwards, D Edwards

    Research output: Contribution to journalArticlepeer-review

    202 Citations (Scopus)

    Abstract

    We have developed a computer based method to identify candidate single nucleotide polymorphisms (SNPs) and small insertions/deletions from expressed sequence tag data. Using a redundancy-based approach, valid SNPs are distinguished from erroneous sequence by their representation multiple times in an alignment of sequence reads. A second measure of validity was also calculated based on the cosegregation of the SNP pattern between multiple SNP loci in an alignment. The utility of this method was demonstrated by applying it to 102,551 maize (Zea mays) expressed sequence tag sequences. A total of 14,832 candidate polymorphisms were identified with an SNP redundancy score of two or greater. Segregation of these SNPs with haplotype indicates that candidate SNPs with high redundancy and cosegregation confidence scores are likely to represent true SNPs. This was confirmed by validation of 264 candidate SNPs from 27 loci, with a range of redundancy and cosegregation scores, in four inbred maize lines. The SNP transition/transversion ratio and insertion/deletion size frequencies correspond to those observed by direct sequencing methods of SNP discovery and suggest that the majority of predicted SNPs and insertion/deletions identified using this approach represent true genetic variation in maize.

    Original languageEnglish
    Pages (from-to)84-91
    JournalPlant Physiology
    Volume132
    Issue number1
    DOIs
    Publication statusPublished - 2003

    Fingerprint

    Dive into the research topics of 'Mining for single nucleotide polymorphisms and insertions/deletions in maize expressed sequence tag data'. Together they form a unique fingerprint.

    Cite this