
ness of our method was evaluated on various sizes of
STPP instances, demonstrating improved sample ef-
ficiency and generalization capability, outperforming
heuristic, mathematical optimization, and learning-
based algorithms designed for STPP.
ACKNOWLEDGEMENTS
This work was supported by the National Research
Foundation of Korea(NRF) grant funded by the Korea
government(MSIT) (RS-2024-00410082).
REFERENCES
Cappart, Q., Moisan, T., Rousseau, L.-M., Pr
´
emont-
Schwarz, I., and Cire, A. A. (2021). Combining
reinforcement learning and constraint programming
for combinatorial optimization. In Proceedings of
the AAAI Conference on Artificial Intelligence, vol-
ume 35, pages 3677–3687.
Espeholt, L., Soyer, H., Munos, R., Simonyan, K., Mnih,
V., Ward, T., Doron, Y., Firoiu, V., Harley, T., Dun-
ning, I., Legg, S., and Kavukcuoglu, K. (2018). IM-
PALA: Scalable distributed deep-RL with importance
weighted actor-learner architectures. In Dy, J. and
Krause, A., editors, Proceedings of the 35th Interna-
tional Conference on Machine Learning, volume 80
of Proceedings of Machine Learning Research, pages
1407–1416. PMLR.
Fu, Z.-H., Qiu, K.-B., and Zha, H. (2021). Generalize
a small pre-trained model to arbitrarily large tsp in-
stances. In Proceedings of the AAAI Conference on
Artificial Intelligence, volume 35, pages 7474–7482.
Gasse, M., Ch
´
etelat, D., Ferroni, N., Charlin, L., and Lodi,
A. (2019). Exact combinatorial optimization with
graph convolutional neural networks. Advances in
Neural Information Processing Systems, 32.
Hou, Q., Yang, J., Su, Y., Wang, X., and Deng, Y. (2022).
Generalize learned heuristics to solve large-scale vehi-
cle routing problems in real-time. In The Eleventh In-
ternational Conference on Learning Representations.
Karp, R. M. (1972). Reducibility among combinatorial
problems. In Complexity of computer computations,
pages 85–103. Springer.
Khalil, E., Dai, H., Zhang, Y., Dilkina, B., and Song, L.
(2017). Learning combinatorial optimization algo-
rithms over graphs. In Guyon, I., Luxburg, U. V.,
Bengio, S., Wallach, H., Fergus, R., Vishwanathan,
S., and Garnett, R., editors, Advances in Neural Infor-
mation Processing Systems, volume 30. Curran Asso-
ciates, Inc.
Konda, V. and Tsitsiklis, J. (1999). Actor-critic algorithms.
Advances in neural information processing systems,
12.
Kool, W., van Hoof, H., and Welling, M. (2019). Atten-
tion, learn to solve routing problems! In International
Conference on Learning Representations.
Lee, K., Park, Y., Jeong, H.-S., Yoon, D., Hong, S., Sohn,
S., Kim, M., Ko, H., Lee, M., Lee, H., Kim, K., Kim,
E., Cho, S., Min, J., and Lim, W. (2022). ReSPack: A
large-scale rectilinear steiner tree packing data gener-
ator and benchmark. In NeurIPS 2022 Workshop on
Synthetic Data for Empowering ML Research.
Li, S., Yan, Z., and Wu, C. (2021). Learning to delegate for
large-scale vehicle routing.
Lukes, J. A. (1974). Efficient algorithm for the partitioning
of trees. IBM Journal of Research and Development,
18(3):217–224.
Ma, Y., Hao, X., Hao, J., Lu, J., Liu, X., Xialiang, T., Yuan,
M., Li, Z., Tang, J., and Meng, Z. (2021). A hi-
erarchical reinforcement learning based optimization
framework for large-scale dynamic pickup and deliv-
ery problems. Advances in Neural Information Pro-
cessing Systems, 34:23609–23620.
McMurchie, L. and Ebeling, C. (1995). Pathfinder: A
negotiation-based performance-driven router for fp-
gas. In Proceedings of the 1995 ACM third inter-
national symposium on Field-programmable gate ar-
rays, pages 111–117.
Nowak-Vila, A., Folqu
´
e, D., and Bruna, J. (2018). Divide
and conquer networks.
Perron, L. and Furnon, V. (2022). Or-tools.
Song, J., Lanka, R., Yue, Y., and Dilkina, B. (2020). A gen-
eral large neighborhood search framework for solving
integer linear programs.
Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones,
L., Gomez, A. N., Kaiser, L. u., and Polosukhin,
I. (2017). Attention is all you need. In Guyon,
I., Luxburg, U. V., Bengio, S., Wallach, H., Fer-
gus, R., Vishwanathan, S., and Garnett, R., editors,
Advances in Neural Information Processing Systems,
volume 30. Curran Associates, Inc.
Veli
ˇ
ckovi
´
c, P., Cucurull, G., Casanova, A., Romero, A., Li
`
o,
P., and Bengio, Y. (2018). Graph attention networks.
In International Conference on Learning Representa-
tions.
Wang, R., Hua, Z., Liu, G., Zhang, J., Yan, J., Qi, F., Yang,
S., Zhou, J., and Yang, X. (2021). A bi-level frame-
work for learning to solve combinatorial optimization
on graphs. Advances in Neural Information Process-
ing Systems, 34:21453–21466.
Watts, D. J. and Strogatz, S. H. (1998). Collective dynamics
of ”small-world” networks.
Ye, H., Wang, J., Liang, H., Cao, Z., Li, Y., and Li, F.
(2024). Glop: Learning global partition and local
construction for solving large-scale routing problems
in real-time. In Proceedings of the AAAI Conference
on Artificial Intelligence, volume 38, pages 20284–
20292.
Hierarchical Decomposition Framework for Steiner Tree Packing Problem
173