Data from: High-throughput genotyping for species identification and diversity assessment in germplasm collections

  • Annaliese S. Mason (University of Queensland) (Creator)
  • Jingjing Zhang (Huazhong Agricultural University) (Creator)
  • Reece Tollenaere (Creator)
  • Paula Vasquez-Teuber (Creator)
  • Jessica Dalton-Morgan (Creator)
  • Liyong Hu (Creator)
  • Guijun Yan (Creator)
  • Dave Edwards (Creator)
  • Robert Redden (Creator)
  • Jacqueline Batley (Creator)



Germplasm collections provide an extremely valuable resource for breeders and researchers. However, misclassification of accessions by species often hinders the effective use of these collections. We propose that use of high-throughput genotyping tools can provide a fast, efficient and cost-effective way of confirming species in germplasm collections, as well as providing valuable genetic diversity data. We genotyped 180 Brassicaceae samples sourced from the Australian Grains Genebank across the recently released Illumina Infinium Brassica 60K SNP array. Of these, 76 were provided on the basis of suspected misclassification and another 104 were sourced independently from the germplasm collection. Presence of the A- and C-genomes combined with principle components analysis clearly separated Brassica rapa, B. oleracea, B. napus, B. carinata and B. juncea samples into distinct species groups. Several lines were further validated using chromosome counts. Overall, 18% of samples (32/180) were misclassified on the basis of species. Within these 180 samples, 23/76 (30%) supplied on the basis of suspected misclassification were misclassified, and 9/105 (9%) of the samples randomly sourced from the Australian Grains Genebank were misclassified. Surprisingly, several individuals were also found to be the product of interspecific hybridization events. The SNP (single nucleotide polymorphism) array proved effective at confirming species, and provided useful information related to genetic diversity. As similar genomic resources become available for different crops, high-throughput molecular genotyping will offer an efficient and cost-effective method to screen germplasm collections worldwide, facilitating more effective use of these valuable resources by breeders and researchers.,Brassica SNP array data for species identificationSNP molecular genotyping data and marker information for Brassica accessions sourced from the Australian Grains Genebank germplasm collection.Supporting Information3_Supplementary Table 2.xlsx,
Date made available27 Jan 2015
Date of data production27 Jan 2015

Cite this