domain-independent data for Semantic Textual Sim-
ilarity task. We unify the task into four main layers
of processing to exploit the semantic similarity in-
formation from different presentation levels (lexical,
string, syntactic, alignment) to overcome the variance
of system’s performance on data derived from various
sources. Our framework is implemented and eval-
uated on all STS datasets and consistently achieves
either state of the art or near state-of-the-art perfor-
mance in regard to the top three best systems in every
STS competition from 2012 to 2015.
REFERENCES
Agirre, E., Banea, C., Cardie, C., Cer, D., Diab, M.,
Gonzalez-Agirre, A., Guo, W., Lopez-Gazpio, I.,
Maritxalar, M., Mihalcea, R., Rigau, G., Uria, L., and
Wiebe, J. (2015). SemEval-2015 Task 2: Semantic
Textual Similarity, English, Spanish and Pilot on In-
terpretability. In Proceedings of the 9th International
Workshop on Semantic Evaluation (SemEval 2015),
Denver, CO. Association for Computational Linguis-
tics.
Agirre, E., Baneab, C., Cardiec, C., Cerd, D., Diabe,
M., Gonzalez-Agirrea, A., Guof, W., Mihalceab, R.,
Rigaua, G., and Wiebeg, J. (2014). Semeval-2014 task
10: Multilingual semantic textual similarity. SemEval
2014, page 81.
Agirre, E., Cer, D., Diab, M., Gonzalez-Agirre, A., and
Guo, W. (2013). sem 2013 shared task: Semantic tex-
tual similarity, including a pilot on typed-similarity. In
In* SEM 2013: The Second Joint Conference on Lex-
ical and Computational Semantics. Association for
Computational Linguistics. Citeseer.
Agirre, E., Diab, M., Cer, D., and Gonzalez-Agirre, A.
(2012). Semeval-2012 task 6: A pilot on semantic
textual similarity. In Proceedings of the First Joint
Conference on Lexical and Computational Semantics-
Volume 1: Proceedings of the main conference and
the shared task, and Volume 2: Proceedings of the
Sixth International Workshop on Semantic Evalua-
tion, pages 385–393. Association for Computational
Linguistics.
Allison, L. and Dix, T. I. (1986). A bit-string longest-
common-subsequence algorithm. Information Pro-
cessing Letters, 23(5):305–310.
Banerjee, S. and Lavie, A. (2005). Meteor: An automatic
metric for mt evaluation with improved correlation
with human judgments. In Proceedings of the ACL
Workshop on Intrinsic and Extrinsic Evaluation Mea-
sures for Machine Translation and/or Summarization,
pages 65–72.
Bär, D., Biemann, C., Gurevych, I., and Zesch, T. (2012).
Ukp: Computing semantic textual similarity by com-
bining multiple content similarity measures. In Pro-
ceedings of the First Joint Conference on Lexical and
Computational Semantics-Volume 1: Proceedings of
the main conference and the shared task, and Volume
2: Proceedings of the Sixth International Workshop
on Semantic Evaluation, pages 435–440. Association
for Computational Linguistics.
Baroni, M., Dinu, G., and Kruszewski, G. (2014). Don’t
count, predict! a systematic comparison of context-
counting vs. context-predicting semantic vectors. In
ACL (1), pages 238–247.
Barrón-Cedeno, A., Rosso, P., Agirre, E., and Labaka, G.
(2010). Plagiarism detection across distant language
pairs. In Proceedings of the 23rd International Con-
ference on Computational Linguistics, pages 37–45.
Association for Computational Linguistics.
Berant, J., Dagan, I., and Goldberger, J. (2012). Learn-
ing entailment relations by global graph structure op-
timization. Computational Linguistics, 38(1):73–111.
Broder, A. Z. (1997). On the resemblance and containment
of documents. In Compression and Complexity of Se-
quences 1997. Proceedings, pages 21–29. IEEE.
Budanitsky, A. and Hirst, G. (2006). Evaluating wordnet-
based measures of lexical semantic relatedness. Com-
put. Linguist., 32(1):13–47.
Fellbaum, C. (1998). WordNet. Wiley Online Library.
Gabrilovich, E. and Markovitch, S. (2007). Computing se-
mantic relatedness using wikipedia-based explicit se-
mantic analysis. In IJCAI, volume 7, pages 1606–
1611.
Galitsky, B. (2013). Machine learning of syntactic parse
trees for search and classification of text. Engineering
Applications of Artificial Intelligence, 26(3):1072–
1091.
Glickman, O. and Dagan, I. (2004). Acquiring lexical para-
phrases from a single corpus. Recent Advances in Nat-
ural Language Processing III. John Benjamins Pub-
lishing, Amsterdam, Netherlands, pages 81–90.
Guo, W. and Diab, M. (2012). Modeling sentences in the
latent space. In Proceedings of the 50th Annual Meet-
ing of the Association for Computational Linguistics:
Long Papers-Volume 1, pages 864–872. Association
for Computational Linguistics.
Gusfield, D. (1997). Algorithms on strings, trees and se-
quences: computer science and computational biol-
ogy. Cambridge University Press.
Han, L., Kashyap, A., Finin, T., Mayfield, J., and Weese, J.
(2013). Umbc ebiquity-core: Semantic textual simi-
larity systems. In In* SEM 2013: The Second Joint
Conference on Lexical and Computational Semantics.
Association for Computational Linguistics.
Han, L., Martineau, J., Cheng, D., and Thomas, C. (2015).
Samsung: Align-and-differentiate approach to seman-
tic textual similarity. SemEval-2015, page 172.
Hänig, C., Remus, R., and De La Puente, X. (2015). Exb
themis: Extensive feature extraction from word align-
ments for semantic textual similarity. SemEval-2015,
page 264.
Harris, Z. S. (1968). Mathematical structures of language.
Interscience Publishers.
Hirst, G. and St-Onge, D. (1998). Lexical chains as rep-
resentations of context for the detection and correc-
tion of malapropisms. WordNet: An electronic lexical
database, 305:305–332.
A Multi-Layer System for Semantic Textual Similarity
65