tial of the presented methodology. The most impor-
tant step would consist of creating larger and more
diversified datasets. It would be then possible to per-
form parallel training on two models with different ar-
chitectures: one specialized in multi-pose controlled
face photographs and the second for uncontrolled im-
ages or for images coming from a predefined type of
capture device. It was shown here that the network
training can be set up in the way that embeddings in
both models are forced to enclose to same set of proxy
This work was supported by NCBiR grant DOB-
BIO7/18/02/2015. Computations made in this paper
would not be possible without the support of NVIDIA
Corporation that donated the GPU to author.
