
Cai, Z. and Vasconcelos, N. (2018). Cascade r-cnn: Delving
into high quality object detection. In Proceedings of
the IEEE conference on computer vision and pattern
recognition, pages 6154–6162.
Chakraborti, T., Isahagian, V., Khalaf, R., Khazaeni,
Y., Muthusamy, V., Rizk, Y., and Unuvar, M.
(2020). From robotic process automation to intelli-
gent process automation: Emerging trends. CoRR,
abs/2007.13257.
Chasins, S. E., Mueller, M., and Bodik, R. (2018). Rousil-
lon: Scraping distributed hierarchical web data. In
Proceedings of the 31st Annual ACM Symposium on
User Interface Software and Technology, pages 963–
975.
Deka, B., Huang, Z., Franzen, C., Hibschman, J., Afergan,
D., Li, Y., Nichols, J., and Kumar, R. (2017). Rico:
A mobile app dataset for building data-driven design
applications. In Proceedings of the 30th annual ACM
symposium on user interface software and technology,
pages 845–854.
Dwyer, B. (2022). Website screenshots dataset.
https://universe.roboflow.com/roboflow-gw7yv/
website-screenshots.
Han, X., Hu, L., Dang, Y., Agarwal, S., Mei, L., Li, S., and
Zhou, X. (2020). Automatic business process struc-
ture discovery using ordered neurons LSTM: A pre-
liminary study. CoRR, abs/2001.01243.
He, K., Gkioxari, G., Doll
´
ar, P., and Girshick, R. (2017).
Mask r-cnn. In Proceedings of the IEEE international
conference on computer vision, pages 2961–2969.
Hofmann, P., Samp, C., and Urbach, N. (2020). Robotic
process automation. Electronic Markets, 30(1):99–
106.
Institute for Robotic Process Automation (2015). Intro-
duction to robotic process automation. a primer.
https://irpaai.com/wp-content/uploads/2015/05/
Robotic-Process-Automation-June2015.pdf.
Ito, N., Suzuki, Y., and Aizawa, A. (2020). From natural
language instructions to complex processes: Issues in
chaining trigger action rules. CoRR, abs/2001.02462.
Jocher, G., Chaurasia, A., and Qiu, J. (2023). YOLO by
Ultralytics. https://github.com/ultralytics/ultralytics.
Accessed: December 1, 2023.
Leiva, L. A., Hota, A., and Oulasvirta, A. (2020). Enrico:
A dataset for topic modeling of mobile ui designs. In
22nd International Conference on Human-Computer
Interaction with Mobile Devices and Services, pages
1–4.
Leno, V., Deviatykh, S., Polyvyanyy, A., Rosa, M. L., Du-
mas, M., and Maggi, F. M. (2020). Robidium: Auto-
mated synthesis of robotic process automation scripts
from UI logs. In Proceedings of the Best Disserta-
tion Award, Doctoral Consortium, and Demonstra-
tion & Resources Track at BPM 2020 co-located with
the 18th International Conference on Business Pro-
cess Management (BPM 2020), Sevilla, Spain, Sept.
13-18, 2020, volume 2673, pages 102–106. CEUR-
WS.org.
Leno, V., Polyvyanyy, A., Dumas, M., Rosa, M. L.,
and Maggi, F. M. (2021). Robotic process min-
ing: Vision and challenges. Business & Information
Systems Engineering: The International Journal of
WIRTSCHAFTSINFORMATIK, 63(3):301–314.
Leopold, H., van der Aa, H., and Reijers, H. (2018). In
Identifying Candidate Tasks for Robotic Process Au-
tomation in Textual Process Descriptions, pages 67–
81.
Li, T. J.-J., Azaria, A., and Myers, B. A. (2017). Sug-
ilite: Creating multimodal smartphone automation by
demonstration. In Proceedings of the 2017 CHI Con-
ference on Human Factors in Computing Systems,
CHI ’17, page 6038–6049, New York, NY, USA. As-
sociation for Computing Machinery.
Lin, T.-Y., Goyal, P., Girshick, R., He, K., and Doll
´
ar, P.
(2017). Focal loss for dense object detection. In
Proceedings of the IEEE international conference on
computer vision, pages 2980–2988.
Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S.,
Fu, C.-Y., and Berg, A. C. (2016). Ssd: Single shot
multibox detector. In Computer Vision–ECCV 2016:
14th European Conference, Amsterdam, The Nether-
lands, October 11–14, 2016, Proceedings, Part I 14,
pages 21–37. Springer.
Rajawat, A. S., Rawat, R., Barhanpurkar, K., Shaw, R. N.,
and Ghosh, A. (2021). Chapter one - robotic process
automation with increasing productivity and improv-
ing product quality using artificial intelligence and
machine learning. pages 1–13.
Redmon, J., Divvala, S., Girshick, R., and Farhadi, A.
(2016). You only look once: Unified, real-time object
detection. In Proceedings of the IEEE conference on
computer vision and pattern recognition, pages 779–
788.
Ren, S., He, K., Girshick, R., and Sun, J. (2015). Faster
r-cnn: Towards real-time object detection with region
proposal networks. Advances in neural information
processing systems, 28.
Tkachenko, M., Malyuk, M., Holmanyuk, A., and Liu-
bimov, N. (2020-2022). Label Studio: Data label-
ing software. Open source software available from
https://github.com/heartexlabs/label-studio.
Van-der Aalst, W. M. P., Bichler, M., and Heinzl, A. (2018).
Robotic process automation. Business and Informa-
tion Systems Engineering, 60:269–272.
Zou, Z., Chen, K., Shi, Z., Guo, Y., and Ye, J. (2023). Ob-
ject detection in 20 years: A survey. Proceedings of
the IEEE.
UICVD: A Computer Vision UI Dataset for Training RPA Agents
421