
However, the issue of robustness and generalization
ability to handle OOD entities deserves further in-
vestigation. Here we provide two examples for two
classification tasks, where the models fail to general-
ize well to OOD samples. We propose robust train-
ing techniques where we introduce OOD entities dur-
ing fitting, and train multi-task models. We find that
the performance with OOD examples increases. Fu-
ture directions include examining other OOD scenar-
ios and making comparisons with different adversar-
ial training settings.
ACKNOWLEDGEMENTS
The authors were supported by the
´
UNKP-23-4 New
National Excellence Program of the Ministry for Cul-
ture and Innovation from the source of the National
Research, Development and Innovation fund. Both
authors contributed equally. GP was also supported
by Project no TKP2021-NVA-09, implemented with
the support provided by the Ministry of Culture and
Innovation of Hungary from the National Research,
Development and Innovation Fund, financed under
the TKP2021-NVA funding scheme.
REFERENCES
Alipanahi, B., Delong, A., Weirauch, M. T., and Frey, B. J.
(2015). Predicting the sequence specificities of dna-
and rna-binding proteins by deep learning. Nature
Biotechnology, 33(8):831–838.
Chiu, T.-P., Rao, S., and Rohs, R. (2023). Physicochemi-
cal models of protein–dna binding with standard and
modified base pairs. Proceedings of the National
Academy of Sciences, 120(4):e2205796120.
Geirhos, R., Rubisch, P., Michaelis, C., Bethge, M., Wich-
mann, F. A., and Brendel, W. (2019). Imagenet-
trained CNNs are biased towards texture; increasing
shape bias improves accuracy and robustness. In In-
ternational Conference on Learning Representations.
Han, K., Shen, L.-C., Zhu, Y.-H., Xu, J., Song, J., and Yu,
D.-J. (2021). MAResNet: predicting transcription fac-
tor binding sites by combining multi-scale bottom-up
and top-down attention and residual network. Brief-
ings in Bioinformatics, 23(1):bbab445.
Hassanzadeh, H. and Wang, M. D. (2016). Deeperbind:
Enhancing prediction of sequence specificities of dna
binding proteins. In 2016 IEEE International Con-
ference on Bioinformatics and Biomedicine (BIBM),
pages 178–183, Los Alamitos, CA, USA. IEEE Com-
puter Society.
Karimzadeh, M. and Hoffman, M. M. (2022). Virtual
chip-seq: predicting transcription factor binding by
learning from the transcriptome. Genome Biology,
23(1):126.
Koo, P. K. and Eddy, S. R. (2019). Representation learning
of genomic sequence motifs with convolutional neural
networks. PLOS Computational Biology, 15(12):1–
17.
Lambert, M., Jambon, S., Depauw, S., and David-
Cordonnier, M.-H. (2018). Targeting transcription
factors for cancer treatment. Molecules, 23(6).
Lee, H., Ozbulak, U., Park, H., Depuydt, S., De Neve, W.,
and Vankerschaver, J. (2024). Assessing the reliabil-
ity of point mutation as data augmentation for deep
learning with genomic data. BMC Bioinformatics,
25(1):170.
Loshchilov, I. and Hutter, F. (2017). SGDR: Stochastic
gradient descent with warm restarts. In International
Conference on Learning Representations.
Pap., G. and Megyeri., I. (2022). Translational robustness of
neural networks trained for transcription factor bind-
ing site classification. In Proceedings of the 14th In-
ternational Conference on Agents and Artificial Intel-
ligence - Volume 3: ICAART, pages 39–45. INSTICC,
SciTePress.
Park, S., Koh, Y., Jeon, H., Kim, H., Yeo, Y., and Kang, J.
(2020). Enhancing the interpretability of transcription
factor binding site prediction using attention mecha-
nism. Scientific Reports, 10(1):13413.
Qin, Q. and Feng, J. (2017). Imputation for transcrip-
tion factor binding predictions based on deep learning.
PLOS Computational Biology, 13(2):1–20.
Schreiber, J., Singh, R., Bilmes, J., and Noble, W. S. (2020).
A pitfall for machine learning methods aiming to pre-
dict across cell types. Genome Biology, 21(1):282.
Shen, L.-C., Liu, Y., Song, J., and Yu, D.-J. (2021). SARes-
Net: self-attention residual network for predicting
DNA-protein binding. Briefings in Bioinformatics,
22(5):bbab101.
Smith, L. N. (2017). Cyclical learning rates for training
neural networks. In 2017 IEEE Winter Conference on
Applications of Computer Vision (WACV), pages 464–
472.
Vishnoi, K., Viswakarma, N., Rana, A., and Rana, B.
(2020). Transcription factors in cancer development
and therapy. Cancers (Basel), 12(8).
Wang, Z., Xiong, S., Yu, Y., Zhou, J., and Zhang, Y. (2023).
HAMPLE: deciphering TF-DNA binding mechanism
in different cellular environments by characterizing
higher-order nucleotide dependency. Bioinformatics,
39(5):btad299.
Wu, X., Hou, W., Zhao, Z., Huang, L., Sheng, N., Yang,
Q., Zhang, S., and Wang, Y. (2024). Mmgat: a graph
attention network framework for atac-seq motifs find-
ing. BMC Bioinformatics, 25(1):158.
Zagoruyko, S. and Komodakis, N. (2016). Wide residual
networks. In Wilson, R. C., Hancock, E. R., and
Smith, W. A. P., editors, Proceedings of the British
Machine Vision Conference 2016, BMVC 2016, York,
UK, September 19-22, 2016. BMVA Press.
Zeng, H., Edwards, M., Liu, G., and Gifford, D. (2016).
Convolutional neural network architectures for pre-
dicting dna–protein binding. Bioinformatics, 32:i121–
i127.
Zhou, J. and Troyanskaya, O. G. (2015). Predicting ef-
fects of noncoding variants with deep learning–based
sequence model. Nature Methods, 12(10):931–934.
Towards More Robust Transcription Factor Binding Site Classifiers Using Out-of-Distribution Data
47