BLAST Service on Workstation Clusters. Future Gen-
eration Computer Systems, 17(6):745–754.
Dally, W. and Towles, B. (2003). Principles and Practices
of Interconnection Networks. Morgan Kaufmann Pub-
lishers Inc., San Francisco, CA, USA.
Darling, A., Carey, L., and Feng, W. (2003). The Design,
Implementation, and Evaluation of mpiBLAST. In
Proceedings of ClusterWorld 2003.
Emmert-Streib, F. and Dehmer, M. (2008). Information
Theory and Statistical Learning. Springer Publishing
Company, Incorporated.
Gardner, M., Feng, W., Archuleta, J., Lin, H., and Ma, X.
(2006). Parallel Genomic Sequence-Searching on an
Ad-Hoc Grid: Experiences, Lessons learned, and Im-
plications. In IEEE/ACM International Conference
for High-Performance Computing, Networking, Stor-
age and Analysis (SC’06).
Janssen, C., Adalsteinsson, H., Cranford, S., Kenny, J.,
Pinar, A., Evensky, D., and Mayo, J. (2010). A Simu-
lator for Large-Scale Parallel Computer Architectures.
Inter. Jour. of Distributed Systems and Technologies,
1(2):57–73.
Kn¨upfer, A., R.B., Brunst, H., Mix, H., and Nagel, W.
(2006). Introducing the Open Trace Format (OTF).
In Alexandrov, V., van Albada, G., Sloot, P., and Don-
garra, J., editors, Int. Conf. on Computational Science,
volume 3992 of Lecture Notes in Computer Science,
pages 526–533. Springer.
Lin, H., Balaji, P., Poole, R., Sosa, C., Ma, X., and Feng, W.
(2008). Massively Parallel Genomic Sequence Search
on the Blue Gene/P Architecture. In Proc. ACM/IEEE
conference on Supercomputing (SC’08), pages 33:1–
33:11, Piscataway, NJ, USA. IEEE Press.
Lin, H., Ma, X., Chandramohan, P., Geist, A., and Sam-
atova, N. (2005). Efficient Data Access for Par-
allel BLAST. In Proc. International Parallel and
Distributed Processing Symposium (IPDPS’05), page
72.2, Washington, DC, USA. IEEE Computer Society.
Lin, H., Ma, X., Feng, W., and Samatova, N. (2011). Co-
ordinating Computation and I/O in Massively Parallel
Sequence Search. IEEE Transactions on Parallel and
Distributed Systems, 22(4):529–543.
Mathog, D. (2003). Parallel BLAST on split databases.
Bioinformatics, 19(14):1865–1866.
Mintchev, S. and Getov, V. (1997). PMPI: High-Level Mes-
sage Passing in Fortran 77 and C. In Proc. Inter. Con-
ference and Exhibition on High-Performance Com-
puting and Networking (HPCN Europe ’97), pages
603–614, London, UK. Springer-Verlag.
MPI (2009). MPI (Message Passing Interface) standards
documents, errata, and archives of the MPI Forum.
http://www.mpi-forum.org.
Prakash, S., Deelman, E., and Bagrodia, R. (2000).
Asynchronous Parallel Simulation of Parallel Pro-
grams. IEEE Transactions on Software Engineering,
26(5):385–400.
Riesen, R. (2006). A Hybrid MPI Simulator. In IEEE Inter.
Conf. on Cluster Computing 2006, pages 1–9.
Schuster, S. (2007). Next-generation sequencing transforms
today’s biology. Nature Methods, 5(1):16–18.
Sherwood, T., Perelman, E., and Hamerly, G. (2002). Auto-
matically Characterizing Large Scale Program Behav-
ior. In 10th International Conference on Architectural
Support for Programming Languages and Operating
Systems (ASPLOS 2002), pages 45–57.
SST/Macro (2011). SST: The Structural Sim-
ulation Toolkit, SST/macro the Macroscale
Components, Open Source Release.
http://sst.sandia.gov/about sstmacro.html.
Stroustrup, B. (2000). The C++ Programming Lan-
guage. Addison-Wesley Longman Publishing Co.,
Inc., Boston, MA, USA.
Susukita, R., Ando, H., Aoyagi, M., Honda, H., Inadomi,
Y., Inoue, K., Ishizuki, S., Kimura, Y., Komatsu, H.,
Kurokawa, M., Murakami, K. J., Shibamura, H., Ya-
mamura, S., and Yu, Y. (2008). Performance predic-
tion of large-scale parallell system and application us-
ing macro-level simulation. In Proc. ACM/IEEE con-
ference on Supercomputing SC ’08, pages 20:1–20:9,
Piscataway, NJ, USA. IEEE Press.
Thorsen, O., Smith, B., Sosa, C., Jiang, K., Lin, H., Peters,
A., and Feng, W. (2007). Parallel genomic sequence-
search on a massively parallel system. In Proc. Int.
Conf. on Computing Frontiers (CF ’07), pages 59–68,
New York, NY, USA. ACM.
Top500 (2011). Top 500 SuperComputers Ranking at
March 2011. http://www.top500.org.
Underwood, K. D., Levenhagen, M., and Rodrigues, A.
(2007). Simulating Red Storm: Challenges and Suc-
cesses in Building a System Simulation. In Proc. In-
ternational Parallel and Distributed Processing Sym-
posium (IPDPS’07), pages 1–10, Los Alamitos, CA,
USA. IEEE Computer Society.
Vectorbase (2010). NIAID Bioinformatics Resource Cen-
ter for Invertebrate Vectors of Human Pathogens.
http://www.vectorbase.org.
Zheng, G., Wilmarth, T., Jagadishprasad, P., and Kal´e, L.
(2005). Simulation-based performance prediction for
large parallel machines. Int. Jour. Parallel Program.,
33(2):183–207.
EVALUATING PERFORMANCE OPTIMIZATIONS OF LARGE-SCALE GENOMIC SEQUENCE SEARCH
APPLICATIONS USING SST/MACRO
73