that the average cycle count of interpolation opera-
tion is 1500. For IQ/IT, the skip mode has probabil-
ity 45% in average and 6% in worst case, which re-
sults the best case cycle count 200. The average cy-
cle count spent in IT/IQ is around 1500 cycles.
Therefore, the implementation on the DSP core
spends around 5000 cycles per MB processing in
average, which is under the required cycle counts
per MB of decoding an H.264 video sequence with
D1 (720×480) resolution in real time (i.e., 6173 cy-
cles, derived from (250×10
6
) / (45×30) / 30 when the
DSP is running at 250 MHz), therefore achieving
real-time decoding in most cases.
Table 5-2: Execution cycles of the procedures on the PAC
DSP.
Functions Cycle counts Code size(kB)
DSP_main 150 1.5
IQ/IT 200~2600 6.4
Intra prediction (300~1900) + (200~400) 18.7
Inter prediction (640~2800) + 200 11.2
DF 1000~5000 8.9
TOTAL 1850~10600 46.7
Finally, the prototyping SoC platform with an
ARM core and FPGA module (for the PAC DSP)
reveals that, even in low profile specification, the
whole decoding system still can process up to 26 fps
at QCIF resolution, which can be expected at higher
specification with a real-chip dual-core SoC (e.g.,
PAC DSP@250MHz with higher bus frequency) for
decoding a video with D1 resolution in real-time.
6 CONCLUSIONS
In this paper, a software programming model for
H.264/AVC decoder on an asymmetric dual-core
SoC platform, equipped with a VLIW PAC DSP
coprocessor is presented. The decoding throughput
is achieved by well-organized software partitioning
flow between two cores, efficient data movement
from MPU to PAC DSP and vice versa, and program
optimization both on the MPU and PAC DSP. The
analysis shows that the implementation can achieve
real-time decoding at D1 resolution, which provides
a valuable experience for similar implementations.
REFERENCES
Chen, Y.-K. Li, E., Zhou, X., Ge, S., 2006. Implementa-
tion of H.264 Encoder and Decoder on Personal Com-
puters. In Journal of Visual Communications and Im-
age Representations, vol. 17, no. 2, pp. 509-532.
DaVinci™ technology from TI.(n.d.). Retrieved March 2,
2007,from:http://www.ti.com/corp/docs/landing/davin
ci/index.html
Horowitz, M., Joch, A., Kossentini, F., Hallapuro, A.,
2003. H.264/AVC baseline profile decoder complexity
analysis. IEEE Transactions on Circuits and Systems
for Video Technology, vol.13, issue 7, pp. 704-716.
ITU-T Rec.H.264, ISO/IEC 14496-10, 2003. Advanced
video coding, Final Draft International Standard, JVT-
G050r1, Geneva, Switzerland
Kalva, H., Furht, B., 2005. Complexity Estimation of the
H.264 Coded Video Bitstreams. Computer Journal,
vol. 48, issue 5, pp. 504-513.
Lin, H.-C., Wang, Y.-J., Cheng, K.-T., Yeh, S.-Y., Chen,
W.-N., Tsai, C.-Y., Chang, T.-S., Hang, H.-M., 2006.
Algorithm and DSP implementation of H.264/AVC. In
ASP-DAC’06, 11-th Asia and South Pacific Design
Automation Conference, Yokohama, Japan.
Ostermann, J., Bormans, J., List, P., Marpe, D., Narro-
schke, M., Pereira, F., Stockhammer, T., Wedi, T.,
2004. Video coding with H. 264/AVC: tools, perform-
ance, and complexity. Circuits and Systems Magazine,
IEEE, vol. 4, no. 1, pp. 728.
Raja, G., Mirza, M. J., 2004. Performance comparison of
advanced video coding H.264 standard with baseline
H.263 and H.263+ standards. In ISCIT’04, IEEE In-
ternational Symposium on Communications and In-
formation Technology, vol.2, pp. 743-746.
Schwarz, H., Marpe, D., Wiegand, T., 2006. Overview of
the Scalable H.264/MPEG4-AVC Extension. In
ICIP’06, IEEE International Conference on Image
Processing, pp. 161-164
STC’s Multimedia SoC. (n.d.). Retrieved March, 2, 2007,
from: http://int.stc.itri.org.tw/eng/research/multimedia-
soc.jsp?tree_idx=0200
Suh, H.-L., Jeong, H.-K., Ji, H.-P., Seon, W.-K., Suki, K.,
2006. Implementation of H.264/AVC Decoder for
Mobile Video Applications. In ISCAS’06, IEEE Inter-
national Symposium on Circuits and Systems .
Tseng, S.-Y., Hsieh, T.-W., 2006. A Pattern-Search
Method for H.264/AVC CAVLC Decoding. In
ICME’06, IEEE International Conference on Multi-
media & Expo, pp. 1073-1076.
Wang, S., Yang, Y., Li, C., Tung, Y., Wu, J., 2004. The
optimization of H.264/AVC baseline decoder on low-
cost TriMedia DSP processor. In proceedings of SPIE,
vol. 5558, p. 524.
Wiegand, T., Schwarz, H., Joch, A., Kossentini, F.,
Sullivan, G., 2003. Rate-constrained coder control and
comparison of video coding standards. IEEE Transac-
tions on Circuits and Systems for Video Technology,
vol. 13, pp. 688–703.
SIGMAP 2007 - International Conference on Signal Processing and Multimedia Applications
316