Email updates

Keep up to date with the latest news and content from BMC Bioinformatics and BioMed Central.

Open Access Software

DIALIGN P: Fast pair-wise and multiple sequence alignment using parallel processors

Martin Schmollinger1, Kay Nieselt2, Michael Kaufmann1 and Burkhard Morgenstern3*

Author Affiliations

1 Wilhelm-Schickard-Institut fur Informatik, Sand 14, 72076 Tübingen, Germany

2 Center for Bioinformatics Tübingen, Sand 14, 72076 Tübingen, Germany

3 University of Göttingen, Institute of Microbiology and Genetics, Goldschmidtstr. 1, 37077 Göttingen, Germany

For all author emails, please log on.

BMC Bioinformatics 2004, 5:128  doi:10.1186/1471-2105-5-128

Published: 9 September 2004

Abstract

Background

Parallel computing is frequently used to speed up computationally expensive tasks in Bioinformatics.

Results

Herein, a parallel version of the multi-alignment program DIALIGN is introduced. We propose two ways of dividing the program into independent sub-routines that can be run on different processors: (a) pair-wise sequence alignments that are used as a first step to multiple alignment account for most of the CPU time in DIALIGN. Since alignments of different sequence pairs are completely independent of each other, they can be distributed to multiple processors without any effect on the resulting output alignments. (b) For alignments of large genomic sequences, we use a heuristics by splitting up sequences into sub-sequences based on a previously introduced anchored alignment procedure. For our test sequences, this combined approach reduces the program running time of DIALIGN by up to 97%.

Conclusions

By distributing sub-routines to multiple processors, the running time of DIALIGN can be crucially improved. With these improvements, it is possible to apply the program in large-scale genomics and proteomics projects that were previously beyond its scope.