Phylogeny (AAI and ANI)

ANI calculator

ANI (Average Nucleotide Identity) calculator - estimates the average nucleotide identity using both best hits (one-way ANI) and reciprocal best hits (two-way ANI) between two genomic datasets. Typically, the ANI values between genomes of the same species are above 95% (e.g., Escherichia coli). Values below 75% are not to be trusted, and AAI should be used instead. This tool supports both complete and draft genomes (multi-fasta).
(Reference: Goris J et al. 2007. Int J Syst Evol Microbiol. 57(Pt 1): 81-91).
Also see ANI calculator and FastANI which is part of the KBase suite of programs (requires registration).


Average Nucleotide Identity

Average Nucleotide Identity (ANI) calculator - their ANI Calculator uses the OrthoANIu algorithm, an improved iteration of the original OrthoANI algorithm, which uses USEARCH instead of BLAST
(Reference: Yoon, S. H. et al. (2017). Antonie van Leeuwenhoek. 110:1281-1286).


VIRIDIC

VIRIDIC (Virus Intergenomic Distance Calculator; C. Moraru, Institute for Chemistry and Biology of the Marine Environment, Germany) - the first level of bacteriophage classification by ICTV involves computing the overall DNA sequence identity between two viruses. This new tool computes pairwise intergenomic distances/similarities amongst phage genomes. To run it, upload a single fasta file with all phage genomes of interest, create a project and press run. Save the project ID that will be displayed when the project is created. You will need it to access the data if the calculations take a long time
(Reference: Moraru C et al. 2020. Viruses. 12(11): 1268).


GGDC

GGDC (Genome-To-Genome Distance Calculator) - provides methods for inferring whole-genome distances which are well able to mimic DNA-DNA hybridization (DDH). Values calculated with GGDC yield a somewhat better correlation with wet-lab DDH values than alternative approaches such as "ANI". These distance functions can also cope with heavily reduced genomes and repetitive sequence regions. Some of them are also very robust against missing fractions of genomic information (due to incomplete genome sequencing). Thus, this web service can be used for genome-based species delineation.
(Reference: Meier-Kolthoff JP et al. 2013. BMC Bioinformatics 14: 60).


POGO-DB

POGO-DB - Based on computationally intensive whole-genome BLASTs, POGO-DB provides several metrics on pairwise genome: (a) Average Amino Acid Identity of all bi-directional best blast hits that covered at least 70% of the sequence and had 30% sequence identity; (b) Genomic Fluidity that estimates the similarity in gene content between two genomes; (c) Number of orthologs shared between two genomes (as defined by two criteria); (d) Pairwise identity of the most similar 16S rRNA genes; (e) Pairwise identity of 73 additional globally-conserved marker genes (which were determined by us to exist in at least 90% of all the genomes).
(Reference: Lan Y et al. 2014. Nucl. Acids Res. 42(D1): D625-D632).


VICTOR

VICTOR (Virus Classification and Tree Building Online Resource; Leibniz-Institut DSMZ-Deutsche Sammlung von Mikroorganismen und Zellkulturen GmbH). This web service compares bacterial and archaeal viruses ("phages") using their genome or proteome sequences. The results include phylogenomic trees inferred using the Genome-BLAST Distance Phylogeny method (GBDP), with branch support, as well as suggestions for the classification at the species, genus and family level. (The service can be applied to other kinds of viruses, too, but has not yet been tested in this respect.) Upload your FASTA files, GenBank files and/or GenBank accession IDs.
(Reference: JP Meier-Kolthoff & M Göker. 2017. Bioinformatics 33(21): 3396-3404).


VIRFAM

VIRFAM is dedicated to the recognition of head-neck-tail modules and of recombinase genes in phage genomes. You can use this server to search for remote homologs of specific protein families within protein sequences of bacteriophages. Input: protein sequences you're your phage; output includesd a phylogenetic tree with the placement of your virus.
(Reference: Lopes A et al. Nucleic Acids Res. (2010) 38(12): 3952-62).


VipTree

VipTree - generates a "proteomic tree" of viral genome sequences based on genome-wide sequence similarities computed by tBLASTx. The original proteomic tree concept (i.e., "the Phage Proteomic Tree") was developed by Rohwer and Edwards, 2002. A proteomic tree is a dendrogram that reveals global genomic similarity relationships between tens, hundreds, and thousands of viruses. It has been shown that viral groups identified in a proteomic tree well correspond to established viral taxonomies.
(Reference: Nishimura Y et al. (2017) Bioinformatics 33: 2379-2380).


MiGA

MiGA (Microbial Genomes Atlas) - a webserver that allows the classification of an unknown query genomic sequence, complete or partial, against all taxonomically classified taxa with available genome sequences, as well as comparisons to other related genomes including uncultivated ones, based on the genome-aggregate Average Nucleotide and Amino Acid Identity (ANI/AAI) concepts.
(Reference: Rodriguez-R et al (2018) Nucleic Acids Research 46(W1): W282-W288).

Updated: December, 2025