International conference on machine learning, pages
1989–1998. PMLR.
Ishihara, T., Vongkulbhisal, J., Kitani, K. M., and Asakawa,
C. (2017). Beacon-guided structure from motion for
smartphone-based navigation. In 2017 IEEE Win-
ter Conference on Applications of Computer Vision
(WACV), pages 769–777. IEEE.
Kendall, A., Grimes, M., and Cipolla, R. (2015). Posenet:
A convolutional network for real-time 6-dof camera
relocalization. In IEEE international conference on
computer vision, pages 2938–2946.
Long, M., Zhu, H., Wang, J., and Jordan, M. I. (2017). Deep
transfer learning with joint adaptation networks. In
International conference on machine learning, pages
2208–2217. PMLR.
Melekhov, I., Ylioinas, J., Kannala, J., and Rahtu, E.
(2017a). Image-based localization using hourglass
networks. In IEEE International Conference on Com-
puter Vision, pages 879–886.
Melekhov, I., Ylioinas, J., Kannala, J., and Rahtu, E.
(2017b). Relative camera pose estimation using con-
volutional neural networks. In International Confer-
ence on Advanced Concepts for Intelligent Vision Sys-
tems, pages 675–687. Springer.
Orlando, S. A., Furnari, A., and Farinella, G. M. (2020).
Egocentric visitor localization and artwork detection
in cultural sites using synthetic data. Pattern Recogni-
tion Letters, 133:17–24.
Ortis, A., Farinella, G. M., D’Amico, V., Addesso, L., Tor-
risi, G., and Battiato, S. (2017). Organizing egocentric
videos of daily living activities. Pattern Recognition,
72:207–218.
Pasqualino, G., Furnari, A., Signorello, G., and Farinella,
G. M. (2020). Synthetic to real unsupervised domain
adaptation for single-stage artwork recognition in cul-
tural sites. In 2020 25th International Conference on
Pattern Recognition (ICPR). IEEE.
Radwan, N., Valada, A., and Burgard, W. (2018). Vloc-
net++: Deep multitask learning for semantic visual
localization and odometry. IEEE Robotics and Au-
tomation Letters, 3(4):4407–4414.
Ragusa, F., Di Mauro, D., Palermo, A., Furnari, A., and
Farinella, G. M. (2020a). Semantic object segmen-
tation in cultural sites using real and synthetic data.
In 2020 25th International Conference on Pattern
Recognition (ICPR). IEEE.
Ragusa, F., Furnari, A., Battiato, S., Signorello, G., and
Farinella, G. M. (2020b). EGO-CH: Dataset and fun-
damental tasks for visitors behavioral understanding
using egocentric vision. Pattern Recognition Letters,
131:150–157.
Ros, G., Sellart, L., Materzynska, J., Vazquez, D., and
Lopez, A. M. (2016). The synthia dataset: A large
collection of synthetic images for semantic segmenta-
tion of urban scenes. In IEEE conference on computer
vision and pattern recognition, pages 3234–3243.
Rozantsev, A., Salzmann, M., and Fua, P. (2018). Beyond
sharing weights for deep domain adaptation. IEEE
transactions on pattern analysis and machine intelli-
gence, 41(4):801–814.
Saenko, K., Kulis, B., Fritz, M., and Darrell, T. (2010).
Adapting visual category models to new domains. In
European conference on computer vision, pages 213–
226.
Saha, S., Varma, G., and Jawahar, C. (2018). Improved
visual relocalization by discovering anchor points.
arXiv preprint arXiv:1811.04370.
Sattler, T., Havlena, M., Schindler, K., and Pollefeys, M.
(2016). Large-scale location recognition and the ge-
ometric burstiness problem. In IEEE Conference
on Computer Vision and Pattern Recognition, pages
1582–1590.
Savva, M., Kadian, A., Maksymets, O., Zhao, Y., Wijmans,
E., Jain, B., Straub, J., Liu, J., Koltun, V., Malik, J.,
Parikh, D., and Batra, D. (2019). Habitat: A platform
for embodied ai research. In IEEE/CVF International
Conference on Computer Vision (ICCV).
Sch
¨
onberger, J. L. and Frahm, J.-M. (2016). Structure-
from-motion revisited. In Conference on Computer
Vision and Pattern Recognition (CVPR).
Sch
¨
onberger, J. L., Price, T., Sattler, T., Frahm, J.-M., and
Pollefeys, M. (2016a). A vote-and-verify strategy for
fast spatial verification in image retrieval. In Asian
Conference on Computer Vision (ACCV).
Sch
¨
onberger, J. L., Zheng, E., Pollefeys, M., and Frahm,
J.-M. (2016b). Pixelwise view selection for unstruc-
tured multi-view stereo. In European Conference on
Computer Vision (ECCV).
Starner, T., Schiele, B., and Pentland, A. (1998). Visual
contextual awareness in wearable computing. In Di-
gest of Papers. Second International Symposium on
Wearable Computers (Cat. No. 98EX215), pages 50–
57. IEEE.
Taira, H., Okutomi, M., Sattler, T., Cimpoi, M., Pollefeys,
M., Sivic, J., Pajdla, T., and Torii, A. (2018). Inloc: In-
door visual localization with dense matching and view
synthesis. In IEEE Conference on Computer Vision
and Pattern Recognition, pages 7199–7209.
Torii, A., Arandjelovic, R., Sivic, J., Okutomi, M., and Pa-
jdla, T. (2015). 24/7 place recognition by view syn-
thesis. In IEEE Conference on Computer Vision and
Pattern Recognition, pages 1808–1817.
Tzeng, E., Hoffman, J., Saenko, K., and Darrell, T. (2017).
Adversarial discriminative domain adaptation. In
IEEE conference on computer vision and pattern
recognition, pages 7167–7176.
Weyand, T., Kostrikov, I., and Philbin, J. (2016). Planet-
photo geolocation with convolutional neural net-
works. In European Conference on Computer Vision,
pages 37–55. Springer.
Zeiler, M. D., Krishnan, D., Taylor, G. W., and Fergus, R.
(2010). Deconvolutional networks. In 2010 IEEE
Computer Society Conference on computer vision and
pattern recognition, pages 2528–2535. IEEE.
Zhu, J.-Y., Park, T., Isola, P., and Efros, A. A. (2017).
Unpaired image-to-image translation using cycle-
consistent adversarial networks. In IEEE international
conference on computer vision, pages 2223–2232.
Unsupervised Domain Adaptation for 6DOF Indoor Localization
961