A Combinatorial Amino Acid Code for RNA Recognition by Pentatricopeptide Repeat Proteins

A Barkan, M Rojas, Sota Fujii, Aaron Yap, Y.S Chong, Charlie Bond, Ian Small

Research output: Contribution to journalArticle

322 Citations (Scopus)


The pentatricopeptide repeat (PPR) is a helical repeat motif found in an exceptionally large family of RNA–binding proteins that functions in mitochondrial and chloroplast gene expression. PPR proteins harbor between 2 and 30 repeats and typically bind single-stranded RNA in a sequence-specific fashion. However, the basis for sequence-specific RNA recognition by PPR tracts has been unknown. We used computational methods to infer a code for nucleotide recognition involving two amino acids in each repeat, and we validated this model by recoding a PPR protein to bind novel RNA sequences in vitro. Our results show that PPR tracts bind RNA via a modular recognition mechanism that differs from previously described RNA–protein recognition modes and that underpins a natural library of specific protein/RNA partners of unprecedented size and diversity. These findings provide a significant step toward the prediction of native binding sites of the enormous number of PPR proteins found in nature. Furthermore, the extraordinary evolutionary plasticity of the PPR family suggests that the PPR scaffold will be particularly amenable to redesign for new sequence specificities and functions.
Original languageEnglish
Pages (from-to)1-8
JournalPLoS Genetics
Issue number8
Publication statusPublished - 16 Aug 2012

Fingerprint Dive into the research topics of 'A Combinatorial Amino Acid Code for RNA Recognition by Pentatricopeptide Repeat Proteins'. Together they form a unique fingerprint.

Cite this