ditions as reflected in the F. delay values of Table 1.
Table 1: Timing information of the multi-core system. F.
delay is the frame delay used to increase exposure time.
Sen. ROI is the rate at which ROI frames are captured by
the sensor per second. Sys. ROI is the rate at which ROI
images actually pass through the entire system per second.
LCD and PC are the rate, in frames per second, at which
320 by 240 pixel images and 40 by 48 motion representa-
tions reach the LCD and PC (via ethernet) respectfully.
F.delay (µs) Sen. ROI Sys. ROI LCD PC
285 436 305 10.2 8.5
162 1442 489 16.3 15.3
113 1700 643 21.3 18.5
The Sensor ROI and System ROI values of Table 1
show the potential rate of ROI sensor capture and ac-
tual rate of ROI processing for the system. Clearly
the system cannot process image data at the sensor
ROI grabbing rate leaving the sensor to free run until
the system is ready to grab the next ROI. ROI transfer
times from the sensor to the system are 46µs and sen-
sor ROI programming times are 26µs. The LCD val-
ues show the 320 by 240 pixel frames per second rate
of output to the LCD and the PC values show the rate
at which 40 by 48 motion representations are received
by the host PC via the ethernet connection. The gaus-
sian filtering is computed in a naive manner and by
using the algorithms of (Wells, W. M., 1986) system
performance and functionality could be increased. It
should be noted that there has been no explicit optimi-
sation applied to the system software which is written
in the XMOS XC language, an extension of C. The
above timings are given as an initial report of results
and much more analysis is required to fully under-
stand the true performance of the system.
4 CONCLUSIONS
In this paper we have shown that by leveraging the
programmability of an image sensor, motion detec-
tion can be carried out at near standard frame rates at
an effective resolution of 320 by 240 pixels using a
single-core four thread processor with just 64KBs of
RAM. Further we have shown that by using a multi-
core architecture motion detection and various addi-
tional image processing can be carried out at near real
time rates at an effective resolution of 320 by 240 pix-
els using a distributed system with no more than four
unshared blocks of 64KB of RAM. It is expected that
with further development the proposed system will be
able to compute higher-level computer vision algo-
rithms such as optical flow (Barron, J. L. and Fleet,
D. J. and Beauchemin, S., 1994), point tracking (Shi,
J. and Tomasi, C., 1994), gesture recognition (Shot-
ton, J. and Fitzgibbon, A. and Cook, M. and Sharp,
T. and Finocchio, M. and More, R. and Kipman, A.
and Blake, A., 2011) and face detection (Viola, P.
and Jones, M. J. and Snow, D., 2005). Key contribu-
tions of this paper include leveraging the programma-
bility of modern image sensors and the use of high
frequency low power XMOS processors.
ACKNOWLEDGEMENTS
This work was sponsored by an EPSRC Knowledge
Transfer Secondment help by the Research, Enter-
prise and Development department of the University
of Bristol.
REFERENCES
Barron, J. L. and Fleet, D. J. and Beauchemin, S. (1994).
Performance of optical flow techniques. In Interna-
tional Journal of Computer Vision, volume 12, pages
43–77.
Davis, J. and Bobick, A. (1997). The representation and
recognition of action using temporal templates. In In-
ternational Conference on Computer Vision and Pat-
tern Recognition.
Lichtsteiner, P. and Posch, C. and Delbruck, T. (2007). An
128x128 120db 15us-latency temporal contrast vision
sensor. In IEEE Journal Solid State Circuits.
Shi, J. and Tomasi, C. (1994). Good features to track. In In-
ternational Conference on Computer Vision and Pat-
tern Recognition.
Shotton, J. and Fitzgibbon, A. and Cook, M. and Sharp, T.
and Finocchio, M. and More, R. and Kipman, A. and
Blake, A. (2011). Real-time human pose recognition
in parts from single depth images. In International
Conference on Computer Vision and Pattern Recogni-
tion.
Shraml, S. and Belbachir, A. N. (2010). A spatio-temporal
clustering method using real-time motion analysis on
event-based 3d vision. In International Conference on
Computer Vision and Pattern Recognition.
Viola, P. and Jones, M. J. and Snow, D. (2005). Detect-
ing pedestrians using patterns of motion and appear-
ance. In International Journal of Computer Vision,
volume 63, pages 153–161.
Wells, W. M. (1986). Efficient synthesis of gaussian filters
by cascaded uniform filters. In IEEE Transactions on
Pattern Analysis and Machine Intelligence, volume 8,
pages 234–239.
PECCS 2012 - International Conference on Pervasive and Embedded Computing and Communication Systems
402