ver, in our approach, we use a MLP neural network,
which was trained based on a metamodels elements
set, where all elements are well known and the en-
coding schem is simple. The work in (Zhang and
Chen, 2018) deal with the link prediction problem
in network-structured data, it presents link prediction
based on graph neural network, where it proposes a
new method to learn heuristics from local subgraphs
using a graph neural network (GNN). A document or
a model could be encoded as a graph, but there is no
specific treatment for the metamodel elements. An in-
tegration of these approaches with our solution could
improve the capabilities of the classifier.
5 CONCLUSIONS
We presented an approach for classifying JSON docu-
ments into existing metamodels. The solution enables
discovering the domain of the JSON documents and
to serve as an initial typing scheme. We present the
automated steps of the approach, consisting on meta-
model extraction into an MLP using a one-hot encod-
ing (OHE) of the elements, network training, transla-
tion and classification of the input JSON documents.
The extraction algorithm relies on the presence (or
not) of the elements in a given input document, since
it translated the elements into a binary classification
problem. The results have showed that the approach
is effective from classifying JSON documents, with
precision varying from 46 to 97 percent, depending
on the kinds of the elements. We achieved our main
goal to show that a domain-specific and simple ex-
traction algorithm can be useful for classifying docu-
ments, instead of trying to adapt more complex struc-
tured based classification approaches. The results are
publicly available for download, as well as the algo-
rithms implemented.
There are several open issues subject for future
work, such as testing the extraction algorithm output
with other classification algorithms. We also plan to
extend the algorithm to cover more complex relation-
ships between model elements and to test if the results
can be improved.
REFERENCES
Armbrust, M., Das, T., Davidson, A., Ghodsi, A., Or, A.,
Rosen, J., Stoica, I., Wendell, P., Xin, R., and Za-
haria, M. (2015a). Scaling spark in the real world:
Performance and usability. Proc. VLDB Endow.,
8(12):1840–1843.
Armbrust, M., Xin, R. S., Lian, C., Huai, Y., Liu, D.,
Bradley, J. K., Meng, X., Kaftan, T., Franklin, M. J.,
Ghodsi, A., and Zaharia, M. (2015b). Spark sql:
Relational data processing in spark. In Proceedings
of the 2015 ACM SIGMOD International Conference
on Management of Data, SIGMOD ’15, pages 1383–
1394, New York, NY, USA. ACM.
Basciani, F., Di Rocco, J., Di Ruscio, D., Iovino, L., and
Pierantonio, A. (2016). Automated clustering of meta-
model repositories. In Nurcan, S., Soffer, P., Bajec,
M., and Eder, J., editors, Advanced Information Sys-
tems Engineering, pages 342–358, Cham. Springer In-
ternational Publishing.
Burgue
˜
no, L. (2019). An lstm-based neural network archi-
tecture for model transformations. In IEEE/ACM 22nd
International Conference on Model Driven Engineer-
ing Languages and Systems (MODELS).
Cabot, J., Claris
´
o, R., Brambilla, M., and G
´
erard, S. (2017).
Cognifying model-driven software engineering. In
Seidl, M. and Zschaler, S., editors, STAF Workshops,
volume 10748 of Lecture Notes in Computer Science,
pages 154–160. Springer.
Chang, K.-W., Upadhyay, S., Kundu, G., and Roth, D.
(2015). Structural learning with amortized inference.
In Proceedings of the Twenty-Ninth AAAI Confer-
ence on Artificial Intelligence, AAAI’15, pages 2525–
2531. AAAI Press.
Chen, X., Liu, C., and Song, D. (2017). Learning neural
programs to parse programs.
Kumari, G. V., Rao, G. S., and Rao, B. P. (2018). Lm, rp
and gd based ann architecture models for biomedical
image compression. i-manager’s Journal on Image
Processing, 5(3).
Nguyen, P., Di Rocco, J., Di Ruscio, D., Pierantonio,
A., and Iovino, L. (2019). Automated classification
of metamodel repositories: A machine learning ap-
proach. In IEEE/ACM 22nd International Conference
on Model Driven Engineering Languages and Systems
(MODELS).
Perini, A., Susi, A., and Avesani, P. (2013). A machine
learning approach to software requirements prioritiza-
tion. IEEE Trans. Softw. Eng., 39(4):445–461.
Xie, T. (2018). Intelligent software engineering: Synergy
between ai and software engineering. In Feng, X.,
M
¨
uller-Olm, M., and Yang, Z., editors, Dependable
Software Engineering. Theories, Tools, and Applica-
tions, pages 3–7, Cham. Springer International Pub-
lishing.
Zhang, M. and Chen, Y. (2018). Link prediction based on
graph neural networks. In Bengio, S., Wallach, H.,
Larochelle, H., Grauman, K., Cesa-Bianchi, N., and
Garnett, R., editors, Advances in Neural Information
Processing Systems 31, pages 5165–5175. Curran As-
sociates, Inc.
MODELSWARD 2020 - 8th International Conference on Model-Driven Engineering and Software Development
278