0
500
1000
1500
2000
GOL(Torus)
GOL(Grids)
FFT
LMK7
Mandelbrot
MatrixMultiply
Smooth
KIPS
Signature-based HLSim MGSim
Figure 9: Average IPS achieved by MGSim and Signature-
based HLSim.
6.5 IPS - Simulation speed
Instructions per second (IPS) is used to measure the
basic performance of an architecture, as we can mea-
sure the simulated instructions per second using a
known contemporary processor. The average IPS (av-
erage across all the cores) achieved by Signature-
based HLSim and MGSim is shown in fig. 9. We
can see that the IPS of MGSim is approximately 100
KIPS, and the IPS of Signature-based HLSim is ap-
proximately 1 MIPS. Different simulators used in in-
dustry and academia with their simulation speed in
terms of IPS are: COTSon (Argollo et al., 2009)
executes at 750KIPS, SimpleScalar (Austin et al.,
2002) executes at 150KIPS, Interval simulator (Carl-
son et al., 2011) executes at 350KIPS and Sesame (Er-
bas et al., 2007) executes at 300KIPS. MGSim (Bou-
sias et al., 2009) executes at 100KIPS. Compared to
the IPS of these simulators the IPS of HLSim is very
promising. It should be noted that the IPS of sim-
ulation frameworks given above are simulating only
few number of cores on the chip. In MGSim and
HLSim we have simulated 128 cores on a single chip.
Given this large number of simulated cores on a chip,
1 MIPS indicates a high simulation speed.
7 CONCLUSION
Signatures are introduced to estimate the number of
instructions in abstracted categories of basic blocks.
These signatures are then used to model the dynamic
adaptation of the program based on the currently ac-
tive threads per core. In this paper, we have simu-
lated load operation as a variable latency operation
and have treated store operation as single latency op-
eration. Also we have ignored the simulation of reg-
ister files in HLSim. In the future work we would
like to simulate store and register files in HLSim and
analyze if the accuracy can further be improved.
REFERENCES
Argollo, E., Falc
´
on, A., Faraboschi, P., Monchiero, M., and
Ortega, D. (2009). Cotson: infrastructure for full sys-
tem simulation. SIGOPS Oper. Syst. Rev., 43(1):52–
61.
Austin, T., Larson, E., and Ernst, D. (2002). SimpleScalar:
An Infrastructure for Computer System Modeling.
Computer, 35(2):59–67.
Bammi, J. R., Kruijtzer, W., Lavagno, L., Harcourt, E., and
Lazarescu, M. T. (2000). Software performance es-
timation strategies in a system-level design tool. In
Proceedings of the eighth international workshop on
Hardware/software codesign, CODES ’00, pages 82–
86, New York, NY, USA. ACM.
Bernard, T. A. M., Grelck, C., Hicks, M. A., Jesshope,
C. R., and Poss, R. (2011). Resource-agnostic pro-
gramming for many-core microgrids. In Proceed-
ings of the 2010 conference on Parallel processing,
Euro-Par 2010, pages 109–116, Berlin, Heidelberg.
Springer-Verlag.
Bousias, K., Guang, L., Jesshope, C. R., and Lankamp,
M. (2009). Implementation and evaluation of a mi-
crothread architecture. J. Syst. Archit., 55:149–161.
Carlson, T. E., Heirman, W., and Eeckhout, L. (2011).
Sniper: exploring the level of abstraction for scalable
and accurate parallel multi-core simulation. In Pro-
ceedings of 2011 International Conference for High
Performance Computing, Networking, Storage and
Analysis, SC ’11, pages 52:1–52:12, New York, NY,
USA. ACM.
Corporation, D. E. (1992). Alpha Architecture Handbook.
Eeckhout, L., Nussbaum, S., Smith, J. E., and Bosschere,
K. D. (2003). Statistical simulation: Adding effi-
ciency to the computer designer’s toolbox. IEEE Mi-
cro, 23:26–38.
Erbas, C., Pimentel, A. D., Thompson, M., and Polstra,
S. (2007). A framework for system-level modeling
and simulation of embedded systems architectures.
EURASIP J. Embedded Syst., 2007:2–2.
Giusto, P., Martin, G., and Harcourt, E. (2001). Reliable es-
timation of execution time of embedded software. In
Proceedings of the conference on Design, automation
and test in Europe, DATE ’01, pages 580–589, Piscat-
away, NJ, USA. IEEE Press.
Jesshope, C. (2008). A model for the design and pro-
gramming of multi-cores. Advances in Parallel Com-
puting, High Performance Computing and Grids in
Action(16):37–55.
Jesshope, C., Lankamp, M., and Zhang, L. (2009). The
implementation of an svp many-core processor and
the evaluation of its memory architecture. SIGARCH
Comput. Archit. News, 37:38–45.
Jesshope, C. R. (2004). Microgrids - the exploitation of
massive on-chip concurrency. In Grandinetti, L., ed-
itor, High Performance Computing Workshop, vol-
ume 14 of Advances in Parallel Computing, pages
203–223. Elsevier.
Lankamp, M., Poss, R., Yang, Q., Fu, J., Uddin, I., and
Jesshope, C. R. (2013). MGSim - Simulation tools
Signature-basedHigh-levelSimulationofMicrothreadedMany-coreArchitectures
515