AI and NLP to optimize information management,
the use of data mining and text mining techniques to
improve data extraction and management, and the ap-
plication of advanced technologies such as deep neu-
ral networks to understand and analyse complex
online opinions. Furthermore, the temporal view
highlights a progression in research focus over the
years, with an increase in attention towards advanced
data analysis and the application of these innovative
technologies.
From an academic perspective, the identification
of emerging research trends is one of the most rele-
vant contributions. By analysing the temporal distri-
bution of keywords, the evolution of topics of interest
over time can be identified, providing academics with
a clear overview of emerging research directions.
Furthermore, the analysis provides insights into the
intersections between different disciplines and re-
search fields.
From a managerial perspective, this study pro-
vides greater clarity of key trends and themes in the
field of web scraping and AI-based models, which
can inform strategic decisions regarding investments
in technologies to understand the online data and op-
timise their extraction and analysis.
However, this research has some limitations. The
interpretation of the results could be affected by the
subjectivity of the observers and their previous
knowledge in the field, introducing potential biases
into the analysis. Finally, bibliometric analysis may
not fully capture the dynamism and complexity of the
context such as socio-cultural and political factors
that could influence research trends. Future research
should integrate theoretical and empirical approaches
to obtain a more complete and in-depth understanding
of the dynamics and challenges in the field of web
scraping combined with AI-based models, with the
aim of contributing to the development of innovative
and impactful solutions.
REFERENCES
Afandizadeh, S., Sharifi, D., Kalantari, N., & Mirzahossein,
H. (2023). Using machine learning methods to predict
electric vehicles penetration in the automotive market.
Scientific Reports, 13(1), 8345. https://doi.org/10.10
38/s41598-023-35366-3
Arjunan, T. (2022). Building Business Intelligence Data
Extractor using NLP and Python. International Journal
for Research in Applied Science & Engineering Tech-
nology (IJRASET), 10(X).
Arslan, M., & Cruz, C. (2022). Semantic taxonomy enrich-
ment to improve business text classification for dy-
namic environments. 2022 International Conference on
INnovations in Intelligent SysTems and Applications
(INISTA), 1–6. https://doi.org/10.1109/INISTA553
18.2022.9894173
Babu, S., Pragathi, B. S., Chinthala, U., & Maheshwaram,
S. (2020). Subject Tracking with Camera Movement
Using Single Board Computer. Proceedings of 2020
IEEE-HYDCON International Conference on Engi-
neering in the 4th Industrial Revolution, HYDCON
2020, 0–5. https://doi.org/10.1109/HYDCON48903.20
20.9242811
Barbera, G., Araujo, L., & Fernandes, S. (2023). The Value
of Web Data Scraping: An Application to TripAdvisor.
Big Data and Cognitive Computing, 7(3), 121.
https://doi.org/10.3390/bdcc7030121
Bisconti, C., Corallo, A., Fortunato, L., & Spennato, A.
(2019). Influence parameters correlation in a Twitter
event network. Entrepreneurship and Small Business.
Chan, L., Hogaboam, L., & Cao, R. (2022). Artificial Intel-
ligence for Business. https://doi.org/10.1007/978-3-
031-05740-3_1
Corallo, A., Del Vecchio, V., Lezzi, M., & Morciano, P.
(2021). Shop floor digital twin in smart manufacturing:
A systematic literature review. Sustainability
(Switzerland), 13(23). https://doi.org/10.3390/su132
312987
Corallo, A., Fortunato, L., Spennato, A., Errico, F., &
Pedone, A. (2020). Predicting the Consumer’s Purchase
Intention of Food Products. 2020 9th International Con-
ference on Industrial Technology and Management
(ICITM), 181–185. https://doi.org/10.1109/ICITM489
82.2020.9080404
De Mauro, A., Greco, M., Grimaldi, M., & Ritala, P.
(2018). Human resources for Big Data professions: A
systematic classification of job roles and required skill
sets. Information Processing and Management, 54(5),
807–817. https://doi.org/10.1016/j.ipm.2017.05.004
Del Vecchio, V., Lazoi, M., & Lezzi, M. (2023). Digital
Twin and Extended Reality in Industrial Contexts: A
Bibliometric Review. In Lecture Notes in Computer
Science (including subseries Lecture Notes in Artificial
Intelligence and Lecture Notes in Bioinformatics): Vol.
14218 LNCS. Springer Nature Switzerland.
https://doi.org/10.1007/978-3-031-43401-3_18
Donthu, N., Kumar, S., Mukherjee, D., Pandey, N., & Lim,
W. M. (2021). How to conduct a bibliometric analysis:
An overview and guidelines. Journal of Business Re-
search, 133(April), 285–296. https://doi.org/
10.1016/j.jbusres.2021.04.070
Eberendu, A. C. (2016). Unstructured Data: An overview
of the data of Big Data. International Journal of Com-
puter Trends and Technology, 38(1), 46–50.
https://doi.org/10.14445/22312803/ijctt-v38p109
Hao, Z., Zhang, B., Mao, D., Yen, J., Zhao, Z., Zuo, M., Li,
H., & Xu, C.-Z. (2023). A novel method using LSTM-
RNN to generate smart contracts code templates for im-
proved usability. Multimedia Tools and Applications,
82(27), 41669–41699. https://doi.org/ 10.1007/s11042-
023-14592-x
Jorge, O., Pons, A., Rius, J., Vintró, C., Mateo, J., & Vi-
laplana, J. (2020). Increasing online shop revenues with