carefully examined whether such technologies can be
used in an automotive application with safety-critical
software. In addition, the constant alignment of the
two machines requires a high communication band-
width, which, although not a problem in a server en-
vironment, is again subject to different requirements
(EMC) in the vehicle and cannot necessarily be gua-
ranteed. The strategy of machines running in lock-
step, however, offers a high degree of reliability under
certain conditions and time requirements. The deter-
mination of exact times and concrete measurements is
no longer part of this work, but will be carried out in
a follow-up examination.
Container-layer. There are also solutions at contai-
ner level, such as the open source orchestration tool
„Kubernetes“ developed by Google, which offers the
possibility of running applications in containers re-
dundantly. Containers can encapsulate the executa-
bles of the application and all dependencies without
sacrificing the performance of operating system vir-
tualization.
However, the disadvantage in terms of failover is
the lengthy detection of a failure, which is even lon-
ger than the VMware solution described above. In
general, the tool is more designed for scalability of
services and only offers good reliability mechanisms
at first glance. These are not applicable to future auto-
motive control units; however, the container concept
could well be used here in general, since they offer
a high degree of flexibility with simultaneous small
performance losses.
Application-layer. Concepts to ensure the reliabi-
lity of services at application level were also exa-
mined in the context of the work. Programming
languages (e.g. Erlang) or application frameworks
(e.g. Akka) with corresponding features are used. Ap-
plication developers use these features to distribute
the programs on a network of servers.
This offers the highest degree of flexibility and
fine granularity. Only those parts of a program that re-
ally need it can be designed redundantly. In addition,
the most reliable way to detect a failure is from the
application logic. However, the most capable develo-
pers are required for this. This type of programming
is perceived as particularly difficult, but is ultimately
the safest option.
4 CONCLUSION
In this paper, fail-safe mechanisms on different levels
of a distributed system were presented. The closer
these mechanisms are placed near the hardware, the
faster and safer a redundant configuration of several
processing units can be implemented. However, in
this case the flexibility decreases and the performance
overhead increases. The more application-specific a
redundancy mechanism is, the lighter and finer gra-
nular it can be.
However, redundancy alone is no guarantee for sa-
fety against failures, since failure detection cannot of-
fer sufficient coverage when not taking all system’s
layers into account. Therefore mechanisms to imple-
ment this safety measures within all layers of the fra-
mework have to be provided.
ACKNOWLEDGEMENT
The authors gratefully acknowledge the financial sup-
port by the Bavarian Ministry of Economic Affairs,
Energy and Technology, funding progamme “Infor-
mation and Communication Technology Bavaria” as
well as the support by project management organiza-
tion VDI/VDE Innovation + Technik GmbH.
REFERENCES
Conway, M. E. (1968). How do committees invent. Data-
mation, 14(4):28–31.
Doherty, P., Haslum, P., Heintz, F., Merz, T., Nyblom, P.,
Persson, T., and Wingman, B. (2004). A distributed
architecture for autonomous unmanned aerial vehicle
experimentation. In Distributed Autonomous Robotic
Systems 6, pages 233–242. Springer.
Mohallel, A. A., Bass, J. M., and Dehghantaha, A. (2016).
Experimenting with docker: Linux container and base
OS attack surfaces. In 2016 International Conference
on Information Society (i-Society), pages 17–21.
Reinhardt, D., Kühnhauser, W., Baumgarten, U., and Ku-
cera, M. (2016). Virtualisierung eingebetteter Echt-
zeitsysteme im Mehrkernbetrieb zur Partitionierung
sicherheitsrelevanter Fahrzeugsoftware. Universitäts-
verlag Ilmenau, Ilmenau. OCLC: 951392623.
VMWare (2013). Wie die Fault Tolerance funktionert.
Weckemann, K. (2014). Domänenübergreifende Anwen-
dungskommunikation im IP-basierten Fahrzeugbord-
netz. PhD thesis, lmu.
Xavier, M. G., Neves, M. V., Rossi, F. D., Ferreto, T. C.,
Lange, T., and De Rose, C. A. F. (2013). Performance
Evaluation of Container-Based Virtualization for High
Performance Computing Environments. pages 233–
240. IEEE.
PEC 2018 - International Conference on Pervasive and Embedded Computing
102