Abstract
Many non-protein-coding RNAs are being discovered each year. At first we know them only by their sequences in a few organisms, but to understand their function and interactions, we need to understand what 3D structures they may form, in whole or in part. Many hairpin and internal loops are known to form recurrent structured 3D motifs, for example, kink turn and sarcin-ricin internal loops, and GNRA and T-loop hairpin loops. A new non-protein-coding RNA may have one or more known structured 3D loop motifs. Here we introduce a tool which identifies loops in Rfam seed alignments that match known 3D loop motifs and makes those identifications easily accessible. JAR3D was developed to map sequences of hairpin and internal loops to known 3D motifs, and was extended for this work to three-way and four-way junction motifs. We applied JAR3D to 4166 Rfam seed alignments from Rfam 15.0 and made the results accessible on the JAR3D web page, making it easy to evaluate the possible matches for each loop in each Rfam family. We provide several examples which validate JAR3D's ability to identify the correct loop motif, using 3D structures of RNAs outside of the training set. We created a page to search for instances of a particular loop motif across all Rfam families, to study how widespread the occurrence of each motif is. We provide statistics on how many Rfam loops appear to match well to a known 3D motif. Match rates are much higher for internal loops than for hairpins or multihelix junctions.