A. Ribosomal RNA analysis
StructRNAfinder - predicts and annotates RNA families in transcript or genome sequences. This single tool not only displays the sequence/structural consensus alignments for each RNA family, according to Rfam database but also provides a taxonomic overview for each assigned functional RNA. (Reference: Arias-Carrasco R et al. (2018) 19: 55).
Silva - rRNA database project (Max Planck Institute for Marine Microbiology, Bremen, Germany ) - provides comprehensive, quality checked and regularly updated datasets of aligned small (16S/18S, SSU) and large subunit (23S/28S, LSU) ribosomal RNA (rRNA) sequences for all three domains of life (Bacteria, Archaea and Eukarya).
B. Transfer RNAs (tRNA) - for additional information see the Genomic tRNA Database GtRNAdb (Reference: Chan PP & Lowe TM (2016) Nucleic Acids Res; 44(D1): D184-189).or Transfer RNA database tRNAdb (Reference: Jühling F et al. (2009) Nucleic Acids Res; 37(Database issue): D159-162); or tRNADB-CE (tRNA Gene DataBase Curated by Experts) - (Reference: Abe T et al. (2014) Front Genetics 5: 114).
tRNAs: tRNAscan-SE- (Univerisity of California, Santa Cruz, U.S.A,) is incredibly sensitive & also provides secondary structure diagrams of the tRNA molecules (Reference: Lowe, TM, & Eddy, SR. Nucleic Acids Res. 1997. 25: 955-964). tRNAscan-SE 2.0 can also be accessed here. Alternatively use ARAGORN (Reference: Laslett, D. & Canback. 2004. Nucleic Acids Research 32:11-16). Or tRNAfinder (Reference: Kinouchi M & Kurokawa K (2006) J Computer Aided Chem 7: 116-126).
Rfam - The Rfam database is a collection of RNA families, each represented by multiple sequence alignments, consensus secondary structures and covariance models (Reference: Gardner, P.P. et al. 2008. Nucl. Acids Res. 37, Database issue D136-D140)
C. Micro RNAs (miRNAs) are small, non-coding RNA (~20-22 nucleotides) that negatively regulate gene expression at post-transcriptional level. You might want to start with miRGator 3.0 (Reference: Cho S et al. (2013) Nucleic Acids Res 41(Database issue): D252-257).
mirTools 2.0 - is an updated version of mirTools 1.0, which includes the following new features. (1) From miRNA discovery in mirTools 1.0, mirTools 2.0 allows users to detect and profile various types of ncRNAs, such as miRNA, tRNA, snRNA, snoRNA, rRNA, and piRNA. (2) From miRNA profiling in mirTools 1.0, mirTools 2.0 allows users to identify miRNA-targeted genes and performs detailed functional annotation of miRNA targets, including Gene Ontology, KEGG pathway and protein-protein interaction. (3) From comparison of two samples for differentially expressed miRNAs in mirTools 1.0, mirTools 2.0 allows users to detect differentially expressed ncRNAs between two experimental groups or among multiple samples. (4) Other significant improvements include strategies used to detect novel miRNAs and piRNAs, more taxonomy categories to discover more known miRNAs and a stand-alone version of mirTools 2.0. (Reference: Wu J et al. (2013) RNA Biol; 10(7): 1087-92).
miRDB - employs an improved algorithm for miRNA target prediction, we now present updated transcriptome-wide target prediction data in miRDB, including 3.5 million predicted targets regulated by 7000 miRNAs in five species. Further, we have implemented the new prediction algorithm into a web server, allowing custom target prediction with user-provided sequences. Another new database feature is the prediction of cell-specific miRNA targets. miRDB now hosts the expression profiles of over 1000 cell lines and presents target prediction data that are tailored for specific cell models. (Reference: Chen Y & Wang X (2020) Nucleic Acids Res 48(D1): D127-D131).
miR-BAG predict miRNAs from the genomic sequences as well as from Next Generation Sequencing data. It applies a bootstrap aggregating approach to create an ensemble of three different approaches (naïve Bayes, Best First Decision tree and SVM) to achieve a high accuracy. At present miR-BAG includes 6 different species, 4 for animals (Homo sapiens, Canis familiaris, Mus musculus, Rattus norvegicus) alongwith one nematode (Caenorhabditis elegans) and one insect species (Drosophila melanogaster). miR-BAG was found to perform consistently with accuracy level higher than 90% for several species.(Reference: Jha, A. et al. 2012. PLoS ONE 7(9): e45782.)
Small nucleolar RNAs (snoRNAs) - can be detected with Snoscan for methylation-guide for snoRNAs and snoGPS for pseudouridylation-guide snoRNAs (Reference: P. Schattner et al. 2005. Nucl. Acids Res. 33: W686-W689). Test sequences.
sRNAtoolbox - is an integrated collection of small RNA research tools. Includes: sRNAbench: Expression profiling of small RNAs and prediction of novel microRNAs from deep sequencing data; sRNAde: Differential expression analysis; sRNAblast: Blast analysis of deep sequencing reads against a local nt/nr (NCBI link) database.(Reference: A. Rueda et al. 2015. Nucl. Acids Res. 43 (W1): W467-W473).
RNAstructure (webservers for RNA secondary structure prediction) is a software package that includes structure prediction by free energy minimization, prediction of base pairing probabilities, prediction of structures composed of highly probably base pairs, and prediction of structures with pseudoknots. (Reference: Xu ZZ & Mathews DH (2016) Methods Mol Biol; 1490: 15-34).
LocARNA - Multiple Alignment of RNAs - is a tool for multiple alignment of RNA molecules. LocARNA requires only RNA sequences as input and will simultaneously fold and align the input sequences. LocARNA outputs a multiple alignment together with a consensus structure. For the folding it makes use of a very realistic energy model for RNAs as it is by RNAfold of the Vienna RNA package (or Zuker's mfold). For the alignment it features RIBOSUM-like similarity scoring and realistic gap cost. (Reference: C. Smith et al. 2010. Nucl. Acids Res. 38: W373-377).
CARNA is a tool for multiple alignment of RNA molecules. CARNA requires only the RNA sequences as input and will compute base pair probability matrices and align the sequences based on their full ensembles of structures. Alternatively, you can also provide base pair probability matrices (dot plots in .ps format) or fixed structures (as annotation in the FASTA alignment) for your sequences. If you provide fixed structures, only those structures and not the entire ensemble of possible structures is aligned. In contrast to LocARNA, CARNA does not pick the most likely consensus structure, but computes the alignment that fits best to all likely structures simultaneously. Hence, CARNA is particularly useful when aligning RNAs like riboswitches, which have more than one stable structure. (Reference: A. Dragos et al. 2012. Nucleic Acids Reseach 40: W49-W53).
Foldalign - folds and aligns RNA structures (make a foldalignment) based on a lightweight energy model and sequence similarity. The current version makes pairwise fold alignments. (Reference: J. H. Havgaard et al. (2007) J. PLOS computational biology. 3: e193).
For RNA folding use MFold - N.B. The data can be presented in a number of graphic formats. This is my "go to" site if I'm interested in a secondardy structure for a fragment of RNA or DNA (Reference: M. Zuker. 2003. Nucleic Acids Res. 31: 3406-3415).
Vienna RNA secondary structure prediction (University of Vienna, Austria). I have found this site useful for drawing tRNAs in cloverleaf format.
CONTRAfold is a novel secondary structure prediction method based on conditional log-linear models, a flexible class of probabilistic models which generalize upon stochastic context-free grammars by using discriminative training and feature-rich scoring. By incorporating most of the features found in typical thermodynamic models, CONTRAfold achieves the highest single sequence prediction accuracies to date, outperforming currently available probabilistic and physics-based techniques. It provides MARNA-like output couples with hairpin structures (Reference: Do, C.B. et al. 2006. Bioinformatics 22: e90-e98).
Web-Beagle: a web server for the pairwise global or local alignment of RNA secondary structures. (Reference: E. Mattei et al. 2015. Nucl. Acids Res. 43 (W1): W493-W497).
Rclick - this web server that is capable of superimposing RNA 3D structures by using clique matching and 3D least-squares fitting. Rclick has been benchmarked and compared with other popular servers and methods for RNA structural alignments. In most cases, Rclick alignments were better in terms of structure overlap. It also recognizes conformational changes between structures. (References: Nguyen MN, & Verma C. 2015. Bioinformatics 31:966-968).
pKiss - is the successor of pknotsRG, the first pseudoknot class is the canonical simple recursive pseudoknot from pknotsRG. The new class are canonical simple recursive kissing hairpins. (Reference: Janssen, S. & Giegerich, R. Bioinformatics, 2015; 31(3):423-5).
vsfold5 - RNA Pseudoknot Prediction ServerGCGGCCAGCUCCAGGCCGCCAAACAAUAUGGAGCAC ((((((..[[[[[)))))).........]]]]]...
KineFold Web Server - RNA/DNA folding predictions including pseudoknots and entangled helices (Reference: A. Xayaphoummine et al. (2005) Nucleic Acid Res. 33: 605-610).
IPknot: IP-based prediction of RNA pseudoKNOTs - rovides services for predicting RNA secondary structures including a wide class of pseudoknots. IPknot can also predict the consensus secondary structure when a multiple alignment of RNA sequences is given. (Reference: K. Sato et al. (2011) Bioinformatics, 27: i85-i93).
HotKnots - Predict RNA secondary structures with pseudoknots prediction (Reference: Ren, J. et al. 2005. RNA 11: 1494-1504).
RNAstructure - Predict a Secondary Structure Web Server - combines many separate prediction and analysis algorithms: calculating a partition function, predicting a maximum free energy (MFE) structure, finding structures with maximum expected accuracy, and pseudoknot prediction. This server takes a sequence, either RNA or DNA, and creates a highly probable, probability annotated group of secondary structures, starting with the lowest free energy structure and including others with varied probabilities of correctness.
K2N: a service to get from knotted to nested RNA structures. This site provides access to a variety of methods for pseudoknot removal. (Reference: S. Smit et al. (2008) RNA 14(3):410-416).
Promoters, terminators and other regulatory elements:
Virtual Footprint - offers two types of analyses (a) Regulon Analysis - analysis of a whole prokaryotic genome with one regulator pattern and (b) Promoter analysis - Analysis of a promoter region with several regulator patterns (Reference: R.
Münch et al. 2005.Bioinformatics 21: 4187-4189).
ARNold - finds rho-independent terminators in nucleic acid sequences using two complementary programs, Erpin and RNAmotif. The program colors the terminator stem and loop (References: Gautheret D, Lambert A. 2001. J Mol Biol. 313:1003–11 & Macke T. et al. 2001. Nucleic Acids Res. 29:4724–4735 ).
FindTerm (Softberry Inc.) - is one of only two tools on the internet for mapping rho-independent terminators. You might consider using the advanced feature options and minimally increase the default energy threshold to -12.0.
Riboswitches are structured noncoding RNA domains used by many bacteria to monitor the concentrations of target ligands and regulate gene expression accordingly (See: Discovering riboswitches: the past and the future and Parallel Discovery Strategies Provide a Basis for Riboswitch Ligand Design). They can be discovered using Riboswitch Scanner (Reference: Mukherjee, S. and Sengupta, S. (2016) Bioinformatics 32 (5): 776-778) or Riboswitch Finder (Reference: Nucleic Acids Res. 32(Web Server issue): W154-W159) or RNAProbe (Reference: Wirecki TK et al. (2020) Nucleic Acids Res. 48(Issue W1): W292–W299).
SDS (siRNA Design Software) - compares existing design tools, including those listed above. They also attempt to improve the MPI principles and existing tools by an algorithm that can filter ineffective siRNAs. The algorithm is based on some new observations on the secondary structure. (Reference: S. M. Yiu et al. (2004) Bioinformatics 21: 144-151).
ARTS (Alignment of RNA Tertiary Structures) - aligns two nucleic acid structures (RNAs or DNAs) in pdb format and detecting apriori unknown common substructures. The identified common substructures can be either large global folds or small local tertiary motifs with at least two successive base pairs. (Reference: O. Dror et al. 2005. Bioinformatics 21 (Suppl 2):ii47-ii53)
CopraRNA is a tool for sRNA target prediction. It computes whole genome predictions by combination of distinct whole genome IntaRNA predictions (Reference: P.R. Wright et al. 2014. Nucl. Acids Res. 42 (W1), W119-W123).
OligoWalk - calculates thermodynamic features of sense-antisense hybidization. It predicts the free energy changes of oligonucleotides binding to a target RNA. It can be used to design efficient siRNA targeting a given mRNA sequence. (Reference: Lu, Z.J. & Mathews, D.H. 2008. Nucleic Acids Res.36: 640-647).
CRISPR sgRNA Design Tool - GenScript is proud to offer free online access to our gRNA sequence design tool, developed by the Broad Institute of Harvard and MIT. Our gRNA design tool will identify single guide RNAs for use with wild-type S. pyogenes Cas9 for any DNA sequence you input. Start your gRNA design project by entering a sequence up to 250bp in length below.
Rtools (Computational Biology Research Consortium , Japan) - is a bioinformatics web Server for RNA (single FASTA format, <= 400nt) - In order to visualize the whole picture of the distribution of the secondary structure, this web-server provides users with rich information of single RNA sequences using 7 tools: (a) CentroidFold based on a generalized centroid estimator is one of the most accurate tools for predicting RNA secondary structures. (b) CentroidHomfold predicts RNA secondary structures by employing automatically collected homologous sequences of the target. (c) IPknot predicts RNA secondary structures including a wide class of pseudoknots. (d) Rchange computes entropy and internal energy changes of secondary structures for single-point mutated sequences. (e) CapR calculates probabilities that each RNA base position is located within each secondary structural context for long RNA sequences. (f) Raccess computes the accessibility of segment [a, b] = [x, x+l-1] in the transcript for all the positions x with fixed length l (Acc.len) = 5, 10, 20. (g) RintD validates RNA secondary structures. Target secondary structures are predicted by CentroidFold (inference engine: McCaskill) and RNAfold (Minimum free energy structure).