Jump to content

Bioinformatics discovery of non-coding RNAs

From Wikipedia, the free encyclopedia
This is an old revision of this page, as edited by Zashaw (talk | contribs) at 08:30, 31 July 2018 (save intermediate text). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

Non-coding RNAs have been discovered using both experimental and bioinformatic approaches. Bioinformatic approaches can be divided into three main categories. The first involves homology search, although these techniques are by definition unable to find new classes of ncRNAs. The second category includes algorithms designed to discover specific types of ncRNAs that have similar properties. Finally, some discovery methods are based on very general properties of RNA, and are thus able to discovery entirely new kinds of ncRNAs.

Homology search refers to the process of searching a sequence database for RNAs that are similar to already known RNA sequences. Any algorithm that is designed for homology search of nucleic acid sequences can be used, e.g., BLAST[1]. However, such algorithms typically are not as sensitive or accurate as algorithms specifically designed for RNA.

Of particular importance for RNA is its conservation of a secondary structure, which can be modeled to achieve additional accuracy in searches. For example, Covariance models[2] can be viewed as an extension to a profile hidden Markov model that also reflects conserved secondary structure. Covariance models are implemented in the Infernal software package.[3]

Discovery of specific types of ncRNAs

Some types of RNAs have shared properties that algorithms can exploit. For example, tRNAscan-SE[4] is specialized to finding tRNAs. The heart of this program is a tRNA homology search based on covariance models, but other tRNA-specific search programs are used to accelerate searches.

The properties of snoRNAs have enabled the development of programs to detect new examples of snoRNAs, including those that might be only distantly related to previously known examples. Computer programs implementing such approaches include snoscan[5] and snoReport.[6]

Similarly, several algorithms have been developed to detect microRNAs. Examples include

Discovery by general properties

References

  1. ^ Altschul SF, Madden TL, Schäffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ (September 1997). "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs". Nucleic Acids Res. 25 (17): 3389–402. PMC 146917. PMID 9254694.
  2. ^ Eddy SR, Durbin R (June 1994). "RNA sequence analysis using covariance models". Nucleic Acids Res. 22 (11): 2079–88. PMC 308124. PMID 8029015.
  3. ^ Nawrocki EP, Eddy SR (November 2013). "Infernal 1.1: 100-fold faster RNA homology searches". Bioinformatics. 29 (22): 2933–5. doi:10.1093/bioinformatics/btt509. PMC 3810854. PMID 24008419.
  4. ^ Lowe TM, Eddy SR (March 1997). "tRNAscan-SE: a program for improved detection of transfer RNA genes in genomic sequence". Nucleic Acids Res. 25 (5): 955–64. PMC 146525. PMID 9023104.
  5. ^ Lowe TM, Eddy SR (February 1999). "A computational screen for methylation guide snoRNAs in yeast". Science. 283 (5405): 1168–71. PMID 10024243.
  6. ^ Hertel J, Hofacker IL, Stadler PF (January 2008). "SnoReport: computational identification of snoRNAs with unknown targets". Bioinformatics. 24 (2): 158–64. doi:10.1093/bioinformatics/btm464. PMID 17895272.