
 
that the average cycle count of interpolation opera-
tion is 1500. For IQ/IT, the skip mode has probabil-
ity 45% in average and 6% in worst case, which re-
sults the best case cycle count 200. The average cy-
cle count spent in IT/IQ is around 1500 cycles.  
Therefore, the implementation on the DSP core 
spends around 5000 cycles per MB processing in 
average, which is under the required cycle counts 
per MB of decoding an H.264 video sequence with 
D1 (720×480) resolution in real time (i.e., 6173 cy-
cles, derived from (250×10
6
) / (45×30) / 30 when the 
DSP is running at 250 MHz), therefore achieving 
real-time decoding in most cases. 
Table 5-2: Execution cycles of the procedures on the PAC 
DSP. 
Functions  Cycle counts  Code size(kB) 
DSP_main 150  1.5 
IQ/IT 200~2600  6.4 
Intra prediction  (300~1900) + (200~400)  18.7 
Inter prediction  (640~2800) + 200  11.2 
DF 1000~5000  8.9 
TOTAL 1850~10600  46.7 
Finally, the prototyping SoC platform with an 
ARM core and FPGA module (for the PAC DSP) 
reveals that, even in low profile specification, the 
whole decoding system still can process up to 26 fps 
at QCIF resolution, which can be expected at higher 
specification with a real-chip dual-core SoC (e.g., 
PAC DSP@250MHz with higher bus frequency) for 
decoding a video with D1 resolution in real-time. 
6  CONCLUSIONS 
In this paper, a software programming model for 
H.264/AVC decoder on an asymmetric dual-core 
SoC platform, equipped with a VLIW PAC DSP 
coprocessor is presented. The decoding throughput 
is achieved by well-organized software partitioning 
flow between two cores, efficient data movement 
from MPU to PAC DSP and vice versa, and program 
optimization both on the MPU and PAC DSP. The 
analysis shows that the implementation can achieve 
real-time decoding at D1 resolution, which provides 
a valuable experience for similar implementations. 
REFERENCES 
Chen, Y.-K. Li, E., Zhou, X., Ge, S., 2006. Implementa-
tion of H.264 Encoder and Decoder on Personal Com-
puters. In Journal of Visual Communications and Im-
age Representations, vol. 17, no. 2, pp. 509-532. 
DaVinci™ technology from TI.(n.d.). Retrieved March 2, 
2007,from:http://www.ti.com/corp/docs/landing/davin
ci/index.html 
Horowitz, M., Joch, A., Kossentini, F., Hallapuro, A.,  
2003. H.264/AVC baseline profile decoder complexity 
analysis. IEEE Transactions on Circuits and Systems 
for Video Technology, vol.13, issue 7, pp. 704-716. 
ITU-T Rec.H.264, ISO/IEC 14496-10, 2003. Advanced 
video coding, Final Draft International Standard, JVT-
G050r1, Geneva, Switzerland 
Kalva, H., Furht, B., 2005. Complexity Estimation of the 
H.264 Coded Video Bitstreams. Computer Journal, 
vol. 48, issue 5, pp. 504-513. 
Lin, H.-C., Wang, Y.-J., Cheng, K.-T., Yeh, S.-Y., Chen, 
W.-N., Tsai, C.-Y.,  Chang, T.-S., Hang, H.-M., 2006.  
Algorithm and DSP implementation of H.264/AVC. In 
ASP-DAC’06,  11-th Asia and South Pacific Design 
Automation Conference, Yokohama, Japan. 
Ostermann, J., Bormans, J., List, P., Marpe, D.,  Narro-
schke, M., Pereira, F., Stockhammer, T., Wedi, T., 
2004. Video coding with H. 264/AVC: tools, perform-
ance, and complexity. Circuits and Systems Magazine, 
IEEE, vol. 4, no. 1, pp. 728. 
Raja, G., Mirza, M. J., 2004. Performance comparison of 
advanced video coding H.264 standard with baseline 
H.263 and H.263+ standards. In ISCIT’04, IEEE In-
ternational Symposium on Communications and In-
formation Technology, vol.2, pp. 743-746. 
Schwarz, H., Marpe, D., Wiegand, T., 2006. Overview of 
the Scalable H.264/MPEG4-AVC Extension. In 
ICIP’06,  IEEE International Conference on Image 
Processing, pp. 161-164 
STC’s Multimedia SoC. (n.d.). Retrieved March, 2, 2007, 
from: http://int.stc.itri.org.tw/eng/research/multimedia-
soc.jsp?tree_idx=0200 
Suh, H.-L., Jeong, H.-K., Ji, H.-P., Seon, W.-K., Suki, K., 
2006. Implementation of H.264/AVC Decoder for 
Mobile Video Applications. In ISCAS’06, IEEE Inter-
national Symposium on Circuits and Systems . 
Tseng, S.-Y., Hsieh, T.-W., 2006. A Pattern-Search 
Method for H.264/AVC CAVLC Decoding. In 
ICME’06, IEEE International Conference on Multi-
media & Expo, pp. 1073-1076. 
Wang, S., Yang, Y., Li, C., Tung, Y., Wu, J., 2004. The 
optimization of H.264/AVC baseline decoder on low-
cost TriMedia DSP processor. In proceedings of SPIE, 
vol. 5558, p. 524. 
Wiegand, T., Schwarz, H., Joch, A., Kossentini, F.,  
Sullivan, G., 2003. Rate-constrained coder control and 
comparison of video coding standards. IEEE Transac-
tions on Circuits and Systems for Video Technology, 
vol. 13, pp. 688–703. 
SIGMAP 2007 - International Conference on Signal Processing and Multimedia Applications
316