PHYLOGENY

red_bullet.gif (914 bytes) T-Rex (Tree and reticulogram REConstruction) - is dedicated to the reconstruction of phylogenetic trees, reticulation networks and to the inference of horizontal gene transfer (HGT) events. T-REX includes several popular bioinformatics applications such as MUSCLE, MAFFT, Neighbor Joining, NINJA, BioNJ, PhyML, RAxML, random phylogenetic tree generator and some well-known sequence-to-distance transformation models. It also comprises fast and effective methods for inferring phylogenetic trees from complete and incomplete distance matrices as well as for reconstructing reticulograms and HGT networks (Reference: Alix, C. et al. 2012.  Nucl. Acids Res. 40 (W1): W573-W579).

red_bullet.gif (914 bytes) Phylogeny.fr - is a simple to use web service dedicated to reconstructing and analysing phylogenetic relationships between molecular sequences.It includes multiple alignment (MUSCLE, T-Coffee, ClustalW, ProbCons), phylogeny (PhyML, MrBayes, TNT, BioNJ), tree viewer (Drawgram, Drawtree, ATV) and utility programs (e.g. Gblocks to eliminate poorly aligned positions and divergent regions) (Reference: A. Dereeper  et al,. 2008. Nucl. Acids Res. 36 (Web Server Issue):W465-9).   Also available here.

red_bullet.gif (914 bytes) FastME provides distance algorithms to infer phylogenies. FastME is based on balanced minimum evolution, which is the very principle of NJ. FastME improves over NJ by performing topological moves using fast, sophisticated algorithms. The first version of FastME only included Nearest Neighbor Interchange (NNI). The new 2.0 version also includes Subtree Pruning and Regrafting (SPR), while remaining as fast as NJ and providing a number of facilities: distance estimation for DNA and proteins with various models and options, bootstrapping, and parallel computations. (Reference: Lefort V. et al. Molecular Biology & Evolution 32(10): 2798-800, 2015).

red_bullet.gif (914 bytes) PhyML - has been widely used because of its simplicity and a fair compromise between accuracy and speed. In the meantime research on PhyML has continued, and new algorithms and methods have been implemented in the program.   (Reference: V. Lefort et al. Molecular Biology and Evolution, msx149, 2017).

red_bullet.gif (914 bytes) RAxML (Randomized Axelerated Maximum Likelihood) is a program for sequential and parallel Maximum Likelihood based inference of large phylogenetic trees (Reference: Stamatakis, A. 2006. Bioinformatics 22:2688–2690).

ProtTest (David Posada, University of Vigo, Spain) - estimates the empirical model of aminoacid substitution that fits the data best among 64 candidate models. PROTTEST calculates AIC, AICc and BIC values, and obtain a rank of model fits, model-averaged parameter estimates, or measures of parameter importance.  Mac OSX, Windows and Linux versions are available for downloading.

 Phylemon2 - a suite of web-tools for molecular evolution, phylogenetics and phylogenomics (Reference:Sánchez, R. et al. 2011.Nucl. Acids Res. 39/suppl_2/W470)

POWER (PhylOgenetic Web Repeater) - allows users to carry out phylogenetic analysis on most programs of PHYLIP package repeatedly. POWER provide two pipelines to process the analysis. One of them includes multiple sequence alignment (MSA) at the beginning of the pipeline whereas the other begin phylogenetic analysis with aligned sequence. Very user friendly. (Reference: C.-Y. Lin. et al. 2005. Nucl. Acids Res. 33: W553-W556).
red_bullet.gif (914 bytes) Phylodendron - phylogenetic tree printer (D.G. Gilbert, Indiana Univ.) - very useful in visualizing *.dnd file from aligments and saving the results as .GIF, .PS or .PDF files.  N.B. The font style and size can be altered in the .PDF output format.
red_bullet.gif (914 bytes) Phylogenetic tree prediction - GeneBee service (Belozersky Institute of Physico-chemical Biology, Moscow State University, Russia)

red_bullet.gif (914 bytes) Phylogenetic Tree Plot (Laboratory of Bioinformatics, Wageningen UR, The Netherlands) - submit tree descriptions in PHYLIP (Newick) format only

red_bullet.gif (914 bytes) CVTree3 constructs whole-genome based phylogenetic trees without sequence alignment by using a Composition Vector (CV) approach. It was first developed to infer evolutionary relatedness of microbial organisms and then successfully applied to viruses, chloroplasts, and fungi. CVTree3 makes comparison with taxonomy and reports tree-branch monophyleticity from domain to species. (Reference: G. Zuo, & B. Hao (2015) Genomics Proteomics & Bioinformatics, 13: 321-331).

red_bullet.gif (914 bytes) webPRANK -  incorporates phylogeny-aware multiple sequence alignment, visualisation and post-processing in an easy-to-use web interface.(Reference: Löytynoja, A., & Goldman, N. 2010. BMC Bioinformatics. 11:579).

red_bullet.gif (914 bytes) eShadow Evolutionary phylogenetic SHADOWing of closely related species (Reference:  Ovcharenko, D. et al. (2004) Genome Research, 14(6): 1191-1198)

red_bullet.gif (914 bytes) PhyloGene server for identification and visualization of co-evolving proteins using normalized phylogenetic profiles - they used normalized phylogenetic profiling to predict protein function and identify new pathway members and disease genes. The phylogenetic profiles of tens of thousands conserved proteins in the human, mouse, Caenorhabditis elegans and Drosophila genomes can be queried on the new web server, PhyloGene. (Reference: I.R. Sadreyev et al. 2015.  Nucl. Acids Res. 43 (W1): W154-W159).

red_bullet.gif (914 bytes) SIFTER (Statistical Inference of Function Through Evolutionary Relationships) is a statistical approach to predicting protein function that uses a protein family's phylogenetic tree, as the natural structure for representing protein relationships. (Reference: S.M. Sahraeian et al. 2015.  Nucl. Acids Res. 43 (W1): W141-W147).

red_bullet.gif (914 bytes) PATH - is a novel method to infer distant homology relations of two proteins, that accounts for frameshift and point mutations that may have affected the coding sequences. We design a dynamic programming alignment algorithm over memory-efficient graph representations of the complete set of putative DNA sequences of each protein, with the goal of determining the two putative DNA sequences which have the best scoring alignment under a powerful scoring system designed to reflect the most probable evolutionary process. (Reference: Gîrdea, M. et al. Algorithms for Molecular Biology 5: 6 ; 2010).

red_bullet.gif (914 bytes) ReplacementMatrix - maximum-likelihood estimation of amino acid replacement rate matrices. (Reference: Dang, C.C. et al. 2011. Bioinformatics. 27(19):2758-2760).

red_bullet.gif (914 bytes) DIVEIN - starting with a set of aligned sequences, DIVEIN estimates evolutionary parameters and phylogenetic trees while allowing the user to choose from a variety of evolutionary models; it then reconstructs the consensus (CON), most recent common ancestor (MRCA), and center of tree (COT) sequences. DIVEIN also provides tools for further analyses. (Reference: Deng, W. et al. 2010. Biotechniques. 48(5):405-408).

red_bullet.gif (914 bytes) Ribosomal RNA Analysis - The Ribosomal Database Project II (Michigan State University Centre for Microbial Ecology, U.S.A.).
red_bullet.gif (914 bytes) Ridom - Ribosomal RNA analysis for clinically relevant bacteria - (University of Würzburg, Germany)
red_bullet.gif (914 bytes) Rifle - (Universitat Bielefeld, Germany) The RIFLE system compares restriction patterns of 16S rDNA amplicons against a database of theoretical restriction patterns generated from a 16S rDNA database

Molecular Taxonomy

red_bullet.gif (914 bytes) AmphoraNet - is capable of assigning a probability-weighted taxonomic group for each phylogenetic marker gene found in the input metagenomic sample; the webserver is based on the AMPHORA2 workflow.  It uses 31 bacterial and 104 archaeal protein coding marker genes for metagenomic phylotyping. Most of these are single copy genes, therefore AmphoraNet is suitable for estimating the taxonomic composition of bacterial and archaeal communities from metagenomic shotgun sequencing data. (Reference: Kerepesi, C. et al. 2014. Gene 533: 538–540)

red_bullet.gif (914 bytes) GGDC - Genome-to-Genome Distance Calculator - The species concept for Bacteria and Archaea is ultimately based on DNA-DNA hybridization (DDH). This web service can be used for genome-based species delineation with complete or incomplete genomes sequences. The server calculate intergenomic distances; and, these are converted into similarity values analogous to DDH and sent to you via e-mail. (Reference: Meier-Kolthoff, J.P. et al. 2013. BMC Bioinformatics 14:60).

red_bullet.gif (914 bytes) VICTOR (Virus Classification and Tree Building Online Resource; Leibniz-Institut DSMZ-Deutsche Sammlung von Mikroorganismen und Zellkulturen GmbH).  This web service compares bacterial and archaeal viruses ("phages") using their genome or proteome sequences. The results include phylogenomic trees inferred using the Genome-BLAST Distance Phylogeny method (GBDP), with branch support, as well as suggestions for the classification at the species, genus and family level. (The service can be applied to other kinds of viruses, too, but has not yet been tested in this respect.) Upload your FASTA files, GenBank files and/or GenBank accession IDs.  (Reference: JP Meier-Kolthoff & M Göker. 2017. Bioinformatics 33(21): 3396–3404).

red_bullet.gif (914 bytes) VIRFAM is dedicated to the recognition of head-neck-tail modules and of recombinase genes in phage genomes. You can use this server to search for remote homologs of specific protein families within protein sequences of bacteriophages. Input: protein sequences you’re your phage; output includesd a phylogenetic tree with the placement of your virus. (Reference: Lopes A et al. Nucleic Acids Res. (2010) 38(12): 3952-62).

red_bullet.gif (914 bytes) MyTaxa - represents a new algorithm that extends the Average Amino Acid Identity (AAI) concept to identify the taxonomic affiliation of a query genome sequence or a sequence of a contig assembled from a metagenome, including short sequences, and to classify sequences representing novel taxa at three levels  i.e., species, genus and phylum. MyTaxa can assign a larger number of sequences and with higher accuracy compared to other tools available for the same purposes. This is largely attributed to the fact that MyTaxa considers all genes present in an unknown (query) sequence as classifiers and quantifies the classifying power of each gene using predetermined weights, which are derived from the analysis of orthologs of the gene from all available complete genomes. (Reference: Luo, C. et al. 2014. Nucl. Acids Res).

red_bullet.gif (914 bytes) ANI calculator -  estimates the average nucleotide identity using both best hits (one-way ANI) and reciprocal best hits (two-way ANI) between two genomic datasets. Typically, the ANI values between genomes of the same species are above 95% while values below 75% are not to be trusted, and AAI should be used instead. This tool supports both complete and draft genomes (multi-fasta).(Reference: Goris J. et al. 2007. Int J Syst Evol Microbiol. 57:81-91).

red_bullet.gif (914 bytes) Average Nucleotide Identity (ANI) calculator -  their ANI Calculator uses the OrthoANIu algorithm, an improved iteration of the original OrthoANI algorithm, which uses USEARCH instead of BLAST (Reference: Yoon, S. H. et al. (2017).  Antonie van Leeuwenhoek. 110:1281–1286).

red_bullet.gif (914 bytes) TaxMan: Inspect your rRNA amplicons and taxa assignments - In microbiome analyses, often rRNA gene databases are used to assign taxonomic names to sequence reads. The TaxMan server facilitates the analysis of the taxonomic distribution of your reads in two ways. First, you can check what taxonomic names are assigned to the sequences produced by your primers and what taxa you will lose. Second, the produced amplicon sequences with lineages in the FASTA header can be downloaded. This can result in a much more efficient analysis with respect to run time and memory usage, since the amplicon sequences are considerably shorter than the full length rRNA gene sequences. In addition, you can download a lineage file that includes the counts of all taxa for your primers and for the used reference. (Reference: Brandt, B.W. et al. 2012. Nucleic Acids Research 40:W82-W87).