A FAULT-TOLERANT DISTRIBUTED DATA FLOW ARCHITECTURE FOR REAL-TIME DECENTRALIZED CONTROL

Salvador Fallorina, Paul Thienphrapa, Rodrigo Luna, Vu Khuong, Helen Boussalis, Charles Liu, Jane Dong, Khosrow Rad, Wing Ho

2005

Abstract

Complex control-oriented structures are inherently multiple input, multiple output systems whose complexities increase significantly with each additional parameter. When precision performance in both space and time is required, these types of applications can be described as real-time systems that demand substantial amounts of computational power in order to function properly. The failure of a subsystem can be viewed as the extreme case of a non-real-time response, so the ability of a system to recognize and recover from faults, and continue operating in at least some degraded mode, is of crucial importance. Furthermore, the issue of fault-tolerance naturally arises because real-time control systems are often placed in mission-critical contexts. Decentralized control techniques, in which multiple lower-order controllers replace a monolithic controller, provide a framework for embedded parallel computing to facilitate the fault-tolerance and high performance of a sophisticated control system. This paper introduces a fault-tolerant concept to the handling of data flows in multiprocessor environments that are reminiscent of control systems. The design is described in detail and compared against a typical master-slave configuration. A distributed data flow architecture embraces tolerance to processor failures while satisfying real-time constraints, justifying its use over conventional methods. Both master-slave and distributed data flow designs have been studied with regards to a physical control-intensive system; the conclusions indicate a sound design and encourage the further division of computational responsibilities in order to promote fault-tolerance in embedded control processing systems.

References

  1. Baratloo, A. et al. 1995, 'CALYPSO: a novel software system for fault-tolerant parallel processing on distributed platforms', Proc. IEEE HPDC, PC, VA.
  2. Blanton, R., Goldstein, S., & Schmidt, H. 1998, 'Tunable fault tolerance via test and reconfiguration', Proc. FTCS, Munich, Germany.
  3. Boussalis, H. 1979, Stability of Large Scale Systems. Ph.D. dissertation, New Mexico State University.
  4. Boussalis, H. 1994, 'Decentralization of large space-borne telescopes', Proc. SPIE Symposium on Astronomical Telescopes.
  5. Choudhary, A. et al. 1994, 'Optimal processor assignment for a class of pipelined computations', IEEE Transactions on Parallel and Distributed Systems, vol. 5, no. 4, pp. 439-445.
  6. DasGupta, B. et al. 1999, 'Generalized approach towards the fault diagnosis in any arbitrarily connected network', Proc. HiPC, Calcutta, India.
  7. Fallorina, S. et al. 2004, 'A generic pipelined task scheduling algorithm for fault-tolerant decentralized control of a segmented telescope testbed', Proc. ASME DETC/CIE, Salt Lake City, UT.
  8. Khan, G., & Wee, S. 2001, 'Fault-tolerant embedded computer system-on-chip for endoscope control', Proc. ISIC, Singapore.
  9. Reinhardt, S. & Mukherjee, S. 2000, 'Transient fault detection via simultaneous multithreading', Proc. ISCA, Vancouver, BC.
  10. Roberts, J. et al. 2004, 'Efficient real-time parallel signal processing for decentralized control using grouppipelined scheduling', Proc. ISNG, Las Vegas, NV.
  11. Siewiorek, D. et al. 2004, 'Experimental research in dependable computing at Carnegie Mellon University', Proc. WCC, Toulouse, France.
  12. Stockman, H. et al. 1997, The Next Generation Space Telescope: Visiting a Time When Galaxies Were Young, The Association of Universities for Research in Astronomy, Baltimore, MD.
  13. Thienphrapa, P. et al. 2004, 'A generalized fault-tolerant pipelined task scheduling for decentralized control of large segmented systems', Proc. CCCT, Austin, TX.
  14. Worden, K. & Dulieu-Barton, J.M. 2004, 'An overview of intelligent fault detection in systems and structures', Structural Health Monitoring, vol. 3, no. 1, pp. 85-98.
Download


Paper Citation


in Harvard Style

Fallorina S., Thienphrapa P., Luna R., Khuong V., Boussalis H., Liu C., Dong J., Rad K. and Ho W. (2005). A FAULT-TOLERANT DISTRIBUTED DATA FLOW ARCHITECTURE FOR REAL-TIME DECENTRALIZED CONTROL . In Proceedings of the Second International Conference on Informatics in Control, Automation and Robotics - Volume 3: ICINCO, ISBN 972-8865-31-7, pages 109-115. DOI: 10.5220/0001172101090115


in Bibtex Style

@conference{icinco05,
author={Salvador Fallorina and Paul Thienphrapa and Rodrigo Luna and Vu Khuong and Helen Boussalis and Charles Liu and Jane Dong and Khosrow Rad and Wing Ho},
title={A FAULT-TOLERANT DISTRIBUTED DATA FLOW ARCHITECTURE FOR REAL-TIME DECENTRALIZED CONTROL},
booktitle={Proceedings of the Second International Conference on Informatics in Control, Automation and Robotics - Volume 3: ICINCO,},
year={2005},
pages={109-115},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0001172101090115},
isbn={972-8865-31-7},
}


in EndNote Style

TY - CONF
JO - Proceedings of the Second International Conference on Informatics in Control, Automation and Robotics - Volume 3: ICINCO,
TI - A FAULT-TOLERANT DISTRIBUTED DATA FLOW ARCHITECTURE FOR REAL-TIME DECENTRALIZED CONTROL
SN - 972-8865-31-7
AU - Fallorina S.
AU - Thienphrapa P.
AU - Luna R.
AU - Khuong V.
AU - Boussalis H.
AU - Liu C.
AU - Dong J.
AU - Rad K.
AU - Ho W.
PY - 2005
SP - 109
EP - 115
DO - 10.5220/0001172101090115