Abstract.
We developed a multithreaded parallel implementation of a sequence alignment algorithm that is able to align whole genomes with reliable output and reasonable cost. This paper presents a performance evaluation of the whole genome comparison tool called ATGC – Another Tool for Genome Comparison, on a Hyper-Threading multiprocessor. We use our application to determine the system scalability for this particular type of sequence comparison algorithm and the improvement due to Hyper-Threading technology. The experimental results show that despite of placing a great demand on the memory system, the multithreaded code generated by Intel compiler yields to a 3.3 absolute speedup on a quad-processor machine, with parallelization guided by OpenMP pragmas. Additionally, a relatively high 1st level cache miss rate of 7-8% and a lack of memory bandwidth prevent logical processors with hyper-threading technology enabled from achieving further improvement.
This work was partially supported by NSF and DOE: NSF through the NGS program, grant 0103723; DOE grant DE-FC02-01ER25503.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Arslan, N., et al.: A new approach to sequence comparison: Normalized sequence alignment. Bioinformatics 17(4), 327–337 (2001)
del Cuvillo, J.: Whole genome comparison using a multithreaded parallel implementation. Master’s thesis, U. of Delaware, Newark, Del. (July 2001)
Delcher, L., et al.: Alignment of whole genomes. Nucleic Acids Res. 27(11), 2369–2376 (1999)
Mar, D.T., et al.: Hyper-threading technology architecture and microarchitecture. Intel Tech. J. 6(1), 4–15 (2002)
Martins, W.S., et al.: Whole genome alignment using a multithreaded parallel implementation. In: Proc. of the 13th Symp. on Computer Architecture and High Performance Computing, Pirenópolis, Brazil, September 10–12 (2001)
Martins, W.S., et al.: A multithreaded parallel implementation of a dynamic programming algorithm for sequence comparison. In: Proc. of the Pacific Symp. on Biocomputing, Mauna Lani, Haw., January 3–7, pp. 311–322 (2001)
Schwartz, S., et al.: PipMaker — A web server for aligning two genomic DNA sequences. Genome Res. 10(4), 577–586 (2000)
Tian, X., et al.: Intel OpenMP C++/Fortran compiler for hyper-threading technology: Implementation and performance. Intel Tech. J. 6(1), 36–46 (2002)
Tullsen, D.M., et al.: Simultaneous multithreading: Maximizing on-chip parallelism. In: Proc. of the 22nd Ann. Intl. Symp. on Computer Architecture, Santa Margherita Ligure, Italy, pp. 392–403 (June 1995)
Waterman, M.S.: Introduction to Computational Biology: Maps, Sequences, and Genomes. Chapman and Hall, Boca Raton (1995)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2003 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
del Cuvillo, J., Tian, X., Gao, G.R., Girkar, M. (2003). Performance Study of a Whole Genome Comparison Tool on a Hyper-Threading Multiprocessor. In: Veidenbaum, A., Joe, K., Amano, H., Aiso, H. (eds) High Performance Computing. ISHPC 2003. Lecture Notes in Computer Science, vol 2858. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-39707-6_40
Download citation
DOI: https://doi.org/10.1007/978-3-540-39707-6_40
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-20359-9
Online ISBN: 978-3-540-39707-6
eBook Packages: Springer Book Archive