
Figure 2: Comparison of Qualitative Results: Original In-
fraGAN algorithm vs. our enhanced approach. The rows of
the Figure showcase various scenes from the FLIR dataset,
differing in exposure and content.
hyperparameter optimization with the Optuna frame-
work, we refined the trade-off between the loss com-
ponents, significantly enhancing InfraGAN’s perfor-
mance and establishing a framework for further ex-
periments.
Our results indicate that these modifications im-
prove InfraGAN’s ability to generate high-fidelity
thermal images with more accurate detail and struc-
tural consistency. This approach demonstrates the ef-
fectiveness of advanced loss configurations in domain
transfer tasks, contributing valuable insights to the
field of image synthesis and domain translation. Fu-
ture work could extend this methodology to other do-
mains and explore additional optimization techniques
for further performance gains.
A primary direction for extending this work is to
test the methods on additional datasets, such as the
Vis-TH dataset for facial expressions (introduced in
(Mallat and Dugelay, 2018)). Evaluating the approach
on a broader range of data will enhance its generaliz-
ability and robustness. Another critical avenue is the
modification of the DFT loss. In its current state, the
DFT loss behaves similarly to the L
2
norm. Introduc-
ing a filter in the DFT loss into a more distinct and
potentially effective metric, warranting further explo-
ration. Hyperparameter optimization presents oppor-
tunities for deeper investigation. A key question is
whether the optimal hyperparameters differ signifi-
cantly between datasets or exhibit consistent patterns.
Additionally, iterative refinement of hyperparameters
should be performed by re-optimizing for each hyper-
parameter. Furthermore, adopting an analytical ap-
proach could further constrain the search space by
leveraging inherent relationships, such as the connec-
tion between the style loss and perceptual loss. A
detailed analysis of the importance of each hyperpa-
rameter is also recommended. Understanding param-
eter importance will inform more targeted and effi-
cient optimization strategies in the future. Finally, al-
ternative accuracy functions beyond LPIPS should be
tested to evaluate the model comprehensively. This
could provide additional insights into its strengths
and areas for improvement. Addressing these recom-
mendations will further refine the methodology and
broaden its applicability, leading to more robust and
versatile outcomes.
REFERENCES
Bulatov, D., Burkard, E., Ilehag, R., Kottler, B., and
Helmholz, P. (2020). From multi-sensor aerial data to
thermal and infrared simulation of semantic 3d mod-
els: Towards identification of urban heat islands. In-
frared Physics & Technology, 105:103233.
Fuoli, D., Gool, L. V., and Timofte, R. (2021). Fourier space
losses for efficient perceptual image super-resolution.
Goodfellow, I. J., Pouget-Abadie, J., Mirza, M., Xu, B.,
Warde-Farley, D., Ozair, S., Courville, A., and Ben-
gio, Y. (2014). Generative adversarial networks.
Isola, P., Zhu, J., Zhou, T., and Efros, A. A. (2016). Image-
to-image translation with conditional adversarial net-
works. CoRR, abs/1611.07004.
Johnson, J., Alahi, A., and Fei-Fei, L. (2016). Perceptual
losses for real-time style transfer and super-resolution.
Kottler, B., Fischer, S., Strauss, E., Bulatov, D., and
Helmholz, P. (2023). Parameter optimization for a
thermal simulation of an urban area. ISPRS Annals
of the Photogrammetry, Remote Sensing and Spatial
Information Sciences, 10:271–278.
Kottler, B., List, L., Bulatov, D., and Weinmann, M. (2022).
3gan: A three-gan-based approach for image inpaint-
ing applied to the reconstruction of occluded parts of
building walls. pages 427–435.
Liu, R., Ge, Y., Choi, C. L., Wang, X., and Li, H. (2021).
Divco: Diverse conditional image synthesis via con-
trastive generative adversarial network. In Proceed-
ings of the IEEE/CVF conference on computer vision
and pattern recognition, pages 16377–16386.
L
´
opez-Rey, A., Ram
´
on, A., and Ad
´
an, A. (2023). Hard-
ware/software solutions for an efficient thermal scan-
ning mobile robot. In ISARC. Proceedings of the In-
ternational Symposium on Automation and Robotics
Exploration and Validation of Specialized Loss Functions for Generative Visual-Thermal Image Domain Transfer
533