Measuring UML Model Similarity

Jie Su, Junpeng Bao


Many user requirements and UML models are similar even if identical, but their application backgrounds are different. It is a straight and feasible way to mine those similar UML models for a model warehouse and reuse them so as to improve software development efficiency. The key point in the idea is to measure the similarity of UML models. We present a Level Edit Distance method to solve the problem. The LED measures similarity of XML structures instead of UML models. Indeed, UML models are converted to XML documents according to XMI so that UML model similarity equals to XML document similarity. However, our method concentrates on the pure structural similarity of UML models in XMI format, namely, the semantic information is ignored. The LED is different from the traditional Edit Distance. The former needs only one primitive operation whereas the later needs three. Our preparatory experimental results show that the LED can keep almost the same distance distribution with the traditional ED and is a little faster than the latter. We are going to improve the capability of the LED and combine it with a semantic-considered method in order to precisely evaluate the similarity of user requirements.


  1. Yasser Kotb, 2010. Improving the UML Consistency Using Text Semantic Similarity Approach. Proceedings of the 2nd International Conference on Computer Technology and Development. Cairo, Egypt. pp. 90-94
  2. Zhenchang Xing, Eleni Stroulia, 2005. Differencing logical UML models. Proceedings of the 20th International Conference on Automated Software Engineering. Volume 14, pp. 215-259
  3. Udo Kelte, JürgenWehren, Jörg Niere, 2005. A Generic Difference Algorithm for UML Models. Proceedings of Software Engineering. LNCS.Volume 64, pp. 105-116
  4. Joe Tekli, Richard Chbeir, Kokou Yetongnon, 2007. A Fine-Grained XML Structural Comparison Approach. Proceedings of the 26th International Conference on Conceptual Modeling. LNCS. Volume 4801, pp.582-598
  5. Shihui Zheng, Aoying Zhou, Long Zhang, 2003. Similarity Measure and Structural Index of XML Documents. Chinese Journal of computers. 26(9):1116-1122
  6. Tao Xie, Chaofeng Sha, Xiaoling Wang, Aoying Zhou, 2006. Approximate Top-k Structural Similarity Search. Proceedings of the 8th Asia-Pacific Web Conference. LNCS. Volume 3841, pp. 319-330
  7. Guangming Xing, Jinhua Guo, Zhonghang Xia, 2007. Classifying XML Documents Based on Structure/Content Similarity. Proceedings of the 5th International Workshop of the Initiative for the Evaluation of XML Retrieval. LNCS. Volume 4518, pp. 444-457
  8. Cuiming Lu, Fang Li, 2005. Simulation Research on XML Documents Similarity. Chinese Simulation of Computers. 22(12):300-303
  9. H.J.Moon, J.W.Yoo, J.Choi, 2007. An Effective Detection Method for Clustering Similar XML DTDs Using Tag Sequences. LNCS. Volume 4706, pp. 849-860
  10. Dunlu Peng, Huan Hou, Jing Lu, 2009. A Bloom Filter Based Approach for Evaluating Structural Similarity of XML Documents. LNCS. Volume 5854, pp. 242-251
  11. W. Viyanon, S. K. Madria, 2009. A System for detecting XML Similarity in Content and Structure using Relational Database. Proceedings of the 18th ACM Conference on Information and Knowledge Management. pp. 197-1206
  12. W. Viyanon, S.K. Madria, 2010. XML-SIM-CHANGE: Structure and Content Semantic Similarity Detection among XML Document Versions. Proceedings of the Confederated International Conference On the Move to Meaningful Internet Systems. LNCS. Volume 6427, pp. 1061-1078
  13. Woosaeng Kim, 2008. XML document similarity measure in terms of the structure and contents. Proceedings of the 2nd WSEAS International Conference on Computer Engineering and Applications. pp. 205-212
  14. L. Wen, T. Amagasa, H. Kitagawa, 2008. An Approach for XML Similarity Join Using Tree Serialization. Proceedings of 13th International Conference on Database Systems for Advanced Applications. LNCS. Volume 4947, pp. 562-570

Paper Citation

in Harvard Style

Su J. and Bao J. (2012). Measuring UML Model Similarity . In Proceedings of the 7th International Conference on Software Paradigm Trends - Volume 1: ICSOFT, ISBN 978-989-8565-19-8, pages 319-323. DOI: 10.5220/0004027303190323

in Bibtex Style

author={Jie Su and Junpeng Bao},
title={Measuring UML Model Similarity},
booktitle={Proceedings of the 7th International Conference on Software Paradigm Trends - Volume 1: ICSOFT,},

in EndNote Style

JO - Proceedings of the 7th International Conference on Software Paradigm Trends - Volume 1: ICSOFT,
TI - Measuring UML Model Similarity
SN - 978-989-8565-19-8
AU - Su J.
AU - Bao J.
PY - 2012
SP - 319
EP - 323
DO - 10.5220/0004027303190323