Bias in resistance gene prediction due to repeat masking

Research output: Contribution to journalArticlepeer-review

44 Citations (Scopus)
230 Downloads (Pure)

Abstract

Several recently published Brassicaceae genome annotations show strong differences in resistance (R)-gene content. We believe that this is caused by different approaches to repeat masking. Here we show that some of the repeats stored in public databases used for repeat masking carry pieces of predicted R-gene-related domains, and demonstrate that at least some of the variance in R-gene content in recent genome annotations is caused by using these repeats for repeat masking. We also show that other classes of genes are less affected by this phenomenon, and estimate a false positive rate of R genes (0 to 4.6%) that are in reality transposons carrying the R-gene domains. These results may partially explain why there has been a decrease in published novel R genes in recent years, which has implications for plant breeding, especially in the face of pathogens changing as a response to climate change.

Original languageEnglish
Pages (from-to)762-765
Number of pages4
JournalNature Plants
Volume4
Issue number10
DOIs
Publication statusPublished - 1 Oct 2018

Fingerprint

Dive into the research topics of 'Bias in resistance gene prediction due to repeat masking'. Together they form a unique fingerprint.

Cite this