it proved to be as successful as the available human
one when embedded into speech summarization. An-
other basic contribution comes from the estimation of
the ASR transcription error propagation into subse-
quent text processing, at least in terms of evaluating
the similarity of POS and dependency tag sequences
between human and ASR made transcriptions. Re-
sults showed that POS tags and selectively nouns are
less sensitive to ASR errors (POS tag error rate was
2/3 of WER, whereas nouns get confused by another
part-of-speech even less frequently). Given the high
degree of spontaneity of the speech and also the heavy
agglutinating property of Hungarian, we believe the
obtained results are promising as they are comparable
to results published for other languages. The over-
all best results were 62% recall and 79% precision
(F
1
= 0.68). Subjective rating of the summaries gave
3.2 mean opinion score.
ACKNOWLEDGEMENTS
The authors would like to thank the support of the
Hungarian National Innovation Office (NKFIH) un-
der contract IDs PD-112598 and PD-108762.
REFERENCES
Campr, M. and Jeˇzek, K. (2015). Comparing semantic mod-
els for evaluating automatic document summarization.
In Text, Speech, and Dialogue, pages 252–260.
Christensen, H., Kolluru, B., Gotoh, Y., and Renals, S.
(2004). From text summarisation to style-specific
summarisation for broadcast news. In Advances in In-
formation Retrieval, pages 223–237. Springer.
Green, N. (2011). Dependency parsing. In Proceedings
of the 20th Annual Conference of Doctoral Students:
Part I - Mathematics and Computer Sciences, pages
137–142.
Gurevych, I. and Strube, M. (2004). Semantic similarity ap-
plied to spoken dialogue summarization. In Proceed-
ings of the 20th international conference on Compu-
tational Linguistics, page 764.
Hakkani-T¨ur, D., Bechet, F., Riccardi, G., and T¨ur, G.
(2006). Beyond asr 1-best: using word confusion net-
works in spoken language understanding. Computer
Speech and Language, 20(4):495–514.
Landauer, T. K., Foltz, P. W., and Laham, D. (1998). An in-
troduction to latent semantic analysis. Discourse pro-
cesses, 25(2-3):259–284.
Lin, C.-Y. (2004). Rouge: A package for automatic evalu-
ation of summaries. In Text summarization branches
out: Proc. of the ACL-04 workshop, volume 8.
Liu, Y. and Xie, S. (2008). Impact of automatic sentence
segmentation on meeting summarization. In Proc.
Acoustics, Speech and Signal Processing, ICASSP
2008. IEEE International Conference on, pages 5009–
5012.
Maskey, S. and Hirschberg, J. (2005). Comparing lex-
ical, acoustic/prosodic, structural and discourse fea-
tures for speech summarization. In INTERSPEECH,
pages 621–624.
Maskey, S. and Hirschberg, J. (2006). Summarizing speech
without text using hidden markov models. In Pro-
ceedings of the Human Language Technology Confer-
ence of the NAACL, Companion Volume: Short Pa-
pers, pages 89–92.
Nenkova, A. (2006). Summarization evaluation for text and
speech: issues and approaches. In INTERSPEECH,
pages 1527–1530.
Neuberger, T., Gyarmathy, D., Gr´aczi, T. E., Horv´ath, V.,
G´osy, M., and Beke, A. (2014). Development of
a large spontaneous speech database of agglutinative
hungarian language. In Text, Speech and Dialogue,
pages 424–431.
Sarkar, K. (2012). Bengali text summarization by sen-
tence extraction. In Proc. of International Conference
on Business and Information Management ICBIM12,
pages 233–245.
Shriberg, E., Stolcke, A., Hakkani-T¨ur, D., and T¨ur, G.
(2000). Prosody-based automatic segmentation of
speech into sentences and topics. Speech Communi-
cation, 32(1):127–154.
Szarvas, M., Fegy´o, T., Mihajlik, P., and Tatai, P. (2000).
Automatic recognition of Hungarian: Theory and
practice. Int. Journal of Speech Technology, 3(3):237–
251.
Szasz´ak, G. and Beke, A. (2012). Exploiting prosody
for automatic syntactic phrase boundary detection in
speech. Journal of Language Modeling, 0(1):143–
172.
Tarj´an, B., Fegy´o, T., and Mihajlik, P. (2014). A bilingual
study on the prediction of morph-based improvement.
In Proceedings of the 4th International Workshop on
Spoken Languages Technologies for Under-Resourced
Languages, pages 131–138.
T¨undik, M. A. and Szasz´ak, G. (2016). Sz¨ovegalap´u nyelvi
elemz¨o ki´ert´ekel´ese g´epi besz´edfelismer¨o hib´akkal
terhelt kimenet´en. In Proc. 12th Hungarian Confer-
ence on Computational Linguistics (MSZNY), pages
111–120.
Zsibrita, J., Vincze, V., and Farkas, R. (2013). magyarlanc:
A toolkit for morphological and dependency parsing
of hungarian. In Proceedings of RANLP, pages 763–
771.