This Title All WIREs
How to cite this WIREs title:
Impact Factor: 4.928

Finding the target sites of RNA‐binding proteins

Full article on Wiley Online Library:   HTML PDF

Can't access this content? Tell your librarian.

Abstract RNA–protein interactions differ from DNA–protein interactions because of the central role of RNA secondary structure. Some RNA‐binding domains (RBDs) recognize their target sites mainly by their shape and geometry and others are sequence‐specific but are sensitive to secondary structure context. A number of small‐ and large‐scale experimental approaches have been developed to measure RNAs associated in vitro and in vivo with RNA‐binding proteins (RBPs). Generalizing outside of the experimental conditions tested by these assays requires computational motif finding. Often RBP motif finding is done by adapting DNA motif finding methods; but modeling secondary structure context leads to better recovery of RBP‐binding preferences. Genome‐wide assessment of mRNA secondary structure has recently become possible, but these data must be combined with computational predictions of secondary structure before they add value in predicting in vivo binding. There are two main approaches to incorporating structural information into motif models: supplementing primary sequence motif models with preferred secondary structure contexts (e.g., MEMERIS and RNAcontext) and directly modeling secondary structure recognized by the RBP using stochastic context‐free grammars (e.g., CMfinder and RNApromo). The former better reconstruct known binding preferences for sequence‐specific RBPs but are not suitable for modeling RBPs that recognize shape and geometry of RNAs. Future work in RBP motif finding should incorporate interactions between multiple RBDs and multiple RBPs in binding to RNA. WIREs RNA 2014, 5:111–130. doi: 10.1002/wrna.1201 This article is categorized under: RNA Evolution and Genomics > Computational Analyses of RNA RNA Interactions with Proteins and Other Molecules > Protein–RNA Recognition RNA Methods > RNA Analyses In Vitro and In Silico

This WIREs title offers downloadable PowerPoint presentations of figures for non-profit, educational use, provided the content is not modified and full credit is given to the author and publication.

Download a PowerPoint presentation of all images

Three‐dimensional structures of RNA‐binding domain (RBD)–RNA complexes. (a) Solution structure of polypyrimidine tract binding (PTB) protein RBD1 in complex with CUCUCU RNA [Protein Data Bank (PDB): 2AD9]. PTB RBD1 binds a YCU site (Y indicating pyrimidine) through β4, β1, and β2, respectively. (b) Co‐crystal structure of the PUM‐homology domain (PUM‐HD) in human Pum1 complexed with a 10‐nucleotide single‐stranded RNA, 5′‐AUUGUACAUA where the last eight nucleotides (UGUACAUA) are individually recognized by three conserved amino acids in Puf repeats 8 to 1, respectively (PDB: 1M8Y). (c) Solution structure of the Vts1p sterile‐α motif (specific affinity matrix, SAM) domain in complex with a 5′‐CUGGC‐3′ pentaloop as part of a 19nt hairpin (PDB: 2ESE). The specific interaction between the Vts1p SAM domain and the target RNA is stabilized by both the direct interaction to the third guanosine base in the RNA pentaloop and the contacts to the unique backbone structure. (d) Solution structure of dsRBD of yeast Rnt1p in complex with the 5′ terminal AGNN tetraloop of snR47 precursor RNA (PDB: 1T4l). Neither A nor G are recognized by specific hydrogen bonds; instead, the N‐terminal helix of the Rnt1p dsRBD interacts with the backbone and the two nonconserved tetraloop bases, by snugly fitting into the minor groove side of the RNA tetraloop and extending into the minor groove at the top of the stem.
[ Normal View | Magnified View ]
Three‐dimensional structures of multiple RNA‐binding domains (RBDs) in complex with RNA. (a) Solution structure of polypyrimidine tract binding (PTB), RBD3, and RBD4 in complex with CUCUCU RNA [Protein Data Bank (PDB): 2ADC]. RBD3 and RBD4 have different binding specificity: RBD3 binds YCUNN and RBD4 binds YCN (Y, pyrimidine; N, any nucleotide). RBD3 and RBD4 interact extensively, resulting in an antiparallel orientation of their bound RNAs, suggesting that the only way to make these two RBDs bind to a single RNA is to separate their sites by a linker sequence. (b) Solution structure of ADAR2 dsRBD1 and dsRBD2 in complex with GluR‐2 R/G RNA (PDB: 2L3J). The dsRBDs recognize their targets by the shape and by the primary sequence in the minor groove. Sequence‐specific recognition is achieved through a hydrogen bond to the amino group of G (in the GG mismatch for dsRBD1; in the GC pair for dsRBD2) via a β1‐β2 loop and via a hydrophobic contact to adenine H2 (in the AU pair for dsRBD1; in the AC mismatch for dsRBD2) via helix α1. The two dsRBDs bind one face of the RNA and cover about 120° of the turn of the RNA helix.
[ Normal View | Magnified View ]
RNAcontext‐predicted motifs. The figure shows motifs and their structural contexts predicted by RNAcontext using RNAcompete binding data. (Reprinted with permission from Ref . Copyright 2010, PLoS Computational Biology Creative Commons Attribution License.)
[ Normal View | Magnified View ]
Comparison of prediction accuracy for in vivo binding of nine yeast RNA‐binding proteins (RBPs) using parallel analysis of RNA structure (PARS) and RNAplfold to estimate the secondary structure of bound versus unbound transcripts. The results using PARS are shown on the y‐axis, those using RNAplfold on the x‐axis. (a) The analysis was performed on all consensus sites containing at least one nucleotide with a nonzero PARS score. (b) The analysis was performed only considering nucleotides with nonzero PARS score. (c) As for (b) but with the additional constraint that the transcript load (i.e., reads/nucleotide) was at least five. P‐values were calculated using the two‐tailed sign test.
[ Normal View | Magnified View ]
Target site accessibility predicts in vivo binding for a diverse range of RNA‐binding proteins (RBPs). Comparison of accuracy in predicting bound transcripts based on a given consensus, using either #ATS (i.e., the expected number of accessible target sites, y‐axis) or #TS (i.e., the number of target sites, x‐axis). Each dot represents the results of an RBP coupled with its previously defined consensus sequence. If there are multiple reported consensus sequences for a protein, the result for each is shown and is distinguished from others by a superscript. Cartoons indicate the species of origin (yeast, fly, or human). RBPs in bold have significantly improved AUROC for #ATS versus #TS (P < 0.05, Delong‐Delong‐Clarke‐Pearson test). The RBDs housed in the RBPs (using SMART domains) are summarized in the pie graph.
[ Normal View | Magnified View ]
Structural context of target sites improves prediction of target mRNAs bound in vivo by RNA‐binding proteins (RBPs). Bar graphs compare the accuracy of different methods that use the structural context of motif matches to predict in vivo binding of RBPs. The inset describes the different bars within the graph.
[ Normal View | Magnified View ]

Browse by Topic

RNA Evolution and Genomics > Computational Analyses of RNA
RNA Interactions with Proteins and Other Molecules > Protein–RNA Recognition
RNA Methods > RNA Analyses In Vitro and In Silico

Access to this WIREs title is by subscription only.

Recommend to Your
Librarian Now!

The latest WIREs articles in your inbox

Sign Up for Article Alerts