DNA Sequencing

DNA Sequence Quality

DNA Sequence Quality - Phred - provides base calling, chromatogram display and high quality sequence region evaluation and presentation for up to five sequences simultaneously.


Sequence assembly - you don't need your own contig assembly program when you can use:

Galaxy for Genome Assembly

Galaxy for Genome Assembly - The Genome Assembly Workbench is a comprehensive set of analysis tools and consolidated workflows to assist in Genome Assembly. The workbench is based on the Galaxy framework, which guarantees simple access, easy extension, flexible adaption to personal and security needs, and sophisticated analyses independent of command-line knowledge. requires free registration.


CAP3

CAP3 (PBIL, France ),
(Reference: Huang,X. & Madan A. 1999. Genome Res.9: 868-877).


MicroScope web site

MicroScope web site (hosted at Genoscope), provides an environment for expert annotation and comparative genomics. Genome project: Annotation and comparative analyses of finished or draft genome sequences. For pre-annotated sequences, they only integrate annotations from NCBI RefSeq complete genome section. Metagenome project: Annotation and comparative analyses of assembled metagenomic sequences. Currently, they are able to integrate datasets below 20 Mb of contigs per bin.


EGassember

EGassember - aligns and merges sequence fragments resulting from shotgun sequencing or gene transcripts (EST) fragments in order to reconstruct the original segment or gene
(Reference: A. Masoudi-Nejad et al. 2006. Nucl. Acids Res. 34: W459-462).


NanoPipe

NanoPipe - was developed in consideration of the specifics of the MinION sequencing technologies, providing accordingly adjusted alignment parameters. The range of the target species/sequences for the alignment is not limited, and the descriptive usage page of NanoPipe helps a user to succeed with NanoPipe analysis. The results contain alignment statistics, consensus sequence, polymorphisms data, and visualization of the alignment.
(Reference: Shabardina V et al. (2019) Gigascience 8(2). pii: giy169).


COV2HTML

COV2HTML: a visualization and analysis tool of bacterial next generation sequencing (NGS) data for postgenomics life scientists - allows performing both coverage visualization and analysis of NGS alignments performed on prokaryotic organisms (bacteria and phages). It combines two processes: a tool that converts the huge NGS mapping or coverage files into light specific coverage files containing information on genetic elements; and a visualization interface allowing a real-time analysis of data with optional integration of statistical results.
(Reference: Monot M. et al. 2014. OMICS 18(3): 184-95).


PhageTerm

PhageTerm - is a fast and user-friendly software package which can be used to determine bacteriophage termini and packaging mode from randomly fragmented NGS data. It is part of the Galaxy package, and can be found in the "NGS: Mapping" directory. Ideal is you want an automated answer.
(Reference: Garneau JR, et al. 2017. Sci Rep. 7(1):8292).


Sequencing errors:

if your DNA sequence doesn't match the expected protein sequence you can check for errors using BLASTx which compares a protein sequence to a genomic DNA sequence, allowing for introns and frameshifting errors. Other programs include:


FrameD

FrameD - is a program that predicts coding regions in prokaryotic and matured eukaryotic sequences. Initially targeted at gene prediction in bacterial GC rich genomes, the gene model used in FrameD also allows to predict genes in the presence of frameshifts and partially undetermined sequences which makes it also very suitable for gene prediction and frameshift correction in unfinished sequences such as EST and EST cluster sequences. Like recent eukaryotic gene prediction programs, FrameD also includes the ability to take into account protein similarity information both in its prediction and its graphical output.
(Reference: T. Schliex et al. 2003. Nucl. Acids Res. 31: 3738-3741)


PATH

PATH: protein back-translation and alignment - addresses the problem of finding distant protein homologies where the divergence is the result of frameshift mutations and substitutions. Given two input protein sequences, the method implicitly aligns all the possible pairs of DNA sequences that encode them, by manipulating memory-efficient graph representations of the complete set of putative DNA sequences for each protein.
(Reference: Gîrdea M et al. 2010. Algorithms for Molecular Biology 5:)


In-silico.com

In-silico.com (Dr. Joseba Bikandi & co-workers, Faculty of Pharmacy, in the University of the Basque Country) - allows in silico experiments including theoretical PCR amplification, AFLP-PCR , restriction analysis and pulsed field gel electrophoresis [PFGE] with bacterial & archael genomes found in the public database.

Updated: December, 2025