Figure 4: Energy consumption normalized to the baseline design.
ence, and hence reducing energy and delay of lower-
level memory access. Experiments on a case study of
multitasking application reports an energy reduction
of 36.5% with the PLIC design. The reduction on I-
cache misses ranges from 6.0% to 18.3%, depending
on the frequency of context switch in the multitasking
system.
ACKNOWLEDGEMENTS
This work is supported by JSPS NEXT program un-
der grant number GR076.
REFERENCES
Burger, D. C. and Austin, T. M. (1997). The simplescalar
tool set, version 2.0. Technical Report CS-TR-1997-
1342, Department of Computer Science, University of
Wisconsin, Madison.
Dally, W. J., Balfour, J., Black-Shaffer, D., Chen, J., Hart-
ing, R. C., Parikh, V., Park, J., and Sheffield, D.
(2008). Efficient embedded computing. IEEE Com-
puter, 41(7):27–32.
Gauthier, L., Ishihara, T., Takase, H., Tomiyama, H.,
and Takada, H. (2010). Minimizing inter-task inter-
ferences in scratch-pad memory usage for reducing
the energy consumption of multi-task systems. In
Proceedings of the 2010 international conference on
Compilers, architectures and synthesis for embedded
systems (CASES'10), pages 157–166.
Gu, J. and Guo, H. (2010). Enabling large decoded instruc-
tion loop caching for energy-aware embedded proces-
sors. In Proceedings of the 2010 international con-
ference on Compilers, architectures and synthesis for
embedded systems (CASES'10), pages 247–256.
Guthaus, M. R., Ringenberg, J. S., Ernst, D., Austin, T. M.,
Mudge, T., and Brown, R. B. (2001). Mibench: A
free, commercially representative embedded bench-
mark suite. In IEEE 4th Annual Workshop on Work-
load Characterization, pages 83–94.
Ishihara, T. and Fallah, F. (2005). A non-uniform cache
architecture for low power system design. In Pro-
ceedings of the 2005 International Symposium on Low
Power Electronics and Design (ISLPED'05), pages
363–368.
Malik, A., Moyer, B., and Cermak, D. (2000). A low power
unified cache architecture providing power and per-
formance flexibility. In Proceedings of the 2000 In-
ternational Symposium on Low Power Electronics and
Design (ISLPED'00), pages 241–243.
Pan, D. Z. (2009). Low power design and challenges in
nanometer multicore era. In IEEE CAS Melbourne
and Victoria University, Invited Talks, August 20,
2009.
Paul, M. and Petrov, P. (2011). Dynamically adaptive i-
cache partitioning for energy-efficient embedded mul-
titasking. IEEE Transactions on Very Large Scale In-
tegration (VLSI) Systems, 19(11):2067–2080.
Reddy, R. and Petrov, P. (2010). Cache partitioning for
energy-efficient and interference-free embedded mul-
titasking. ACM Transactions on Embedded Comput-
ing Systems (TECS), 9(3):16:1–16:35.
Scott, J., Lee, L. H., Arends, J., and Moyer, B. (1998). De-
signing the low-power m-core architecture. In Inter-
national Sympsium on Computer Architecture Power
Driven Microarchitecture Workshop, pages 145–150.
Tang, W., Gupta, R., and Nicolau, A. (2002). Power savings
in embedded processors through decode filer cache. In
Proceedings of the Conference on Design, Automation
and Test in Europe (DATE'02), pages 443–448.
Thoziyoor, S., Muralimanohar, N., Ahn, J. H., and Jouppi,
N. P. (2008). CACTI: An integrated cache and mem-
ory access time, cycle time, area, leakage, and dy-
namic power model. Technical Report HPL-2008-20,
HP Laboratories.
Yang, C.-L. and Lee, C.-H. (2004). Hotspot cache: Joint
temporal and spatial locality exploitation for icache
energy reduction. In Proceedings of the International
Symposium on Low Power Electronics and Design,
pages 114–119.
Zhang, C., Vahid, F., Yang, J., and Najjar, W. (2005). A
way-halting cache for low-energy high-performance
systems. ACM Transactions on Architecture and Code
Optimization (TACO), 2(1):34–54.
SMARTGREENS2012-1stInternationalConferenceonSmartGridsandGreenITSystems
202