(1a) Unity3D Relay
(1b) Edge Relay
(2a) Unity3D Relay
(2b) Edge Relay
Figure 9: Examples of a moving person reconstructed using
(1a,2a) the Unity3D Relay (Unity3D Multiplayer Service,
2019) and (1b,2b) our Edge Relay. Case 1: When the object
is still we can see that results (i.e. density of triangulated 3D
points) using Unity3D Relay and Edge Relay are comparable.
Case 2: When the object moves, synchronisation is key to
achieve accurate 3D triangulation, and using the Unity3D
Relay leads to sparser reconstructions.
to receive capture frames via HTTP requests. Synchro-
nisation triggers are generated by a host, rather than
by a system timer, to enable a motion-based, adaptive
sampling-rate, fostering reduced data throughput. Al-
though the creation of high-quality FVVs was not the
scope of this work, we succeeded to show the benefit of
our decentralised data capturing system using a state-
of-the-art 3D reconstruction algorithm (i.e. COLMAP)
and by implementing the assessment of end-to-end
capture delays though OCR.
Future research directions include the integration
of a volumetric 4D reconstruction algorithm that can
be executed in real-time on the edge to providing tele-
presence functionality together with the integration
of temporal filtering of 3D reconstructed points to
provide more stable volumetric videos. We also aim
to improve reconstruction accuracy by postprocessing
ARCore’s pose estimates. By the end of this year we
will deploy our system on a 5G network and carry out
the first FVV production in uncontrolled environments
using off-the-shelf mobiles.
REFERENCES
Amazon Textract (Accessed: Dec 2019). https://aws.amazon.
com/textract/.
ARCore (Accessed: Dec 2019). https://developers.google.
com/ar.
ARCore Anchors (Accessed: Dec 2019). https://developers.
google.com/ar/develop/developer-guides/anchors.
Bastug, E., Bennis, M., Medard, M., and Debbah, M. (2017).
Toward Interconnected Virtual Reality: Opportunities,
Challenges, and Enablers. IEEE Communications Mag-
azine, 55(6):110–117.
Belshe, M., Peon, R., and Thomson, M. (2015). Hypertext
Transfer Protocol Version 2. RFC 7540.
Berners-Lee, T., Fielding, R., and Nielsen, H. (1996). Hy-
pertext Transfer Protocol Version 1. RFC 1945.
Cadena, C., Carlone, L., Carrillo, H., Latif, Y., Scaramuzza,
D., Neira, J., Reid, I., and Leonard, J. (2016). Past,
Present, and Future of Simultaneous Localization And
Mapping: Towards the Robust-Perception Age. IEEE
Trans. on Robotics, 32(6):1309–1332.
Chen, Y.-H., Balakrishnan, H., Ravindranath, L., and Bahl,
P. (2016). GLIMPSE: Continuous, Real-Time Object
Recognition on Mobile Devices. GetMobile: Mobile
Computing and Communications, 20(1):26–29.
Elbamby, M., Perfecto, C., Bennis, M., and Doppler, K.
(2018). Toward Low-Latency and Ultra-Reliable Vir-
tual Reality. IEEE Network, 32(2):78–84.
Garg, A., Yadav, A., Sikora, A., and Sairam, A. (2018). Wire-
less Precision Time Protocol. IEEE Communication
Letters, 22(4):812–815.
Guillemaut, J.-Y. and Hilton, A. (2011). Joint Multi-Layer
Segmentation and Reconstruction for Free-Viewpoint
Video Applications. International Journal on Com-
puter Vision, 93(1):73–100.
Hu, Y., Niu, D., and Li, Z. (2016). A Geometric Approach
to Server Selection for Interactive Video Streaming.
IEEE Trans. on Multimedia, 18(5):840–851.
Huang, C.-H., Boyer, E., Navab, N., and Ilic, S. (2014). Hu-
man Shape and Pose Tracking Using Keyframes. In
Proc. of IEEE Computer Vision and Pattern Recogni-
tion, Columbus, US.
Jiang, D. and Liu, G. (2017). An Overview of 5G Require-
ments. In Xiang, W., Zheng, K., and Shen, X., editors,
5G Mobile Communications. Springer.
Kim, H., Guillemaut, J.-Y., Takai, T., Sarim, M., and Hilton,
A. (2012). Outdoor Dynamic 3D Scene Reconstruc-
tion. IEEE Trans. on Circuits and Systems for Video
Technology, 22(11):1611–1622.
Knapitsch, A., Park, J., Zhou, Q.-Y., and Koltun, V. (2017).
Tanks and Temples: Benchmarking Large-Scale Scene
Reconstruction. ACM Transactions on Graphics,
36(4):1–13.
Latimer, R., Holloway, J., Veeraraghavan, A., and Sabharwal,
A. (2015). SocialSync: Sub-Frame Synchronization in
a Smartphone Camera Network. In Proc. of European
Conference on Computer Vision Workshops, Zurich,
CH.
MLAPI (Accessed: Dec 2019). https://midlevel.github.io/
MLAPI.
MLAPI Configuration (Accessed: Dec 2019). https://github.
com/MidLevel/MLAPI.Relay.
MLAPI Messaging System (Accessed: Dec 2019). https:
//mlapi.network/wiki/ways-to-syncronize/.
Mur-Artal, R., Montiel, J., and Tard
´
os, J. (2015). ORB-
SLAM: a versatile and accurate monocular SLAM sys-
tem. IEEE Trans. on Robotics, 31(5):1147–1163.
Mustafa, A. and Hilton, A. (2017). Semantically Coher-
ent Co-Segmentation and Reconstruction of Dynamic
Scenes. In Proc. of IEEE Computer Vision and Pattern
Recognition, Honolulu, US.
Multi-view Data Capture using Edge-synchronised Mobiles
739