loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: Aleš Pražák 1 ; Ludkě Müller 1 ; J. V. Psutka 2 and J. Psutka 2

Affiliations: 1 SpeechTech s.r.o., Czech Republic ; 2 University of West Bohemia, Czech Republic

Keyword(s): ASR, LVCSR, HMM, real-time, class-based language model, live TV, online subtitling.

Related Ontology Subjects/Areas/Topics: Applications ; Audio and Speech Processing ; Digital Signal Processing ; Multimedia ; Multimedia Signal Processing ; Pattern Recognition ; Software Engineering ; Telecommunications

Abstract: The paper describes a fast 2-pass large vocabulary continuous speech recognition (LVCSR) system for automatic online subtitling of live TV programs. The proposed system implementation can be used for direct recognition of TV program audio channel or recognition of a shadow speaker who re-speaks the original audio channel. The first part of this paper focuses on preparation of an adaptive language model for TV programs, where person names are specific for each subtitling session and have to be added to the recognition vocabulary. The second part outlines the recognition system conception for automatic online subtitling with vocabulary up to 150 000 words in real-time. The recognition system is based on Hidden Markov Models, lexical trees and bigram and quadgram language models in the first and second pass, respectively. Finally, experimental results from our project with the Czech Television are reported and discussed.

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 18.224.63.87

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Pražák, A.; Müller, L.; V. Psutka, J. and Psutka, J. (2007). LIVE TV SUBTITLING - Fast 2-pass LVCSR System for Online Subtitling. In Proceedings of the Second International Conference on Signal Processing and Multimedia Applications (ICETE 2007) - SIGMAP; ISBN 978-989-8111-13-5, SciTePress, pages 139-142. DOI: 10.5220/0002140301390142

@conference{sigmap07,
author={Aleš Pražák. and Ludkě Müller. and J. {V. Psutka}. and J. Psutka.},
title={LIVE TV SUBTITLING - Fast 2-pass LVCSR System for Online Subtitling},
booktitle={Proceedings of the Second International Conference on Signal Processing and Multimedia Applications (ICETE 2007) - SIGMAP},
year={2007},
pages={139-142},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002140301390142},
isbn={978-989-8111-13-5},
}

TY - CONF

JO - Proceedings of the Second International Conference on Signal Processing and Multimedia Applications (ICETE 2007) - SIGMAP
TI - LIVE TV SUBTITLING - Fast 2-pass LVCSR System for Online Subtitling
SN - 978-989-8111-13-5
AU - Pražák, A.
AU - Müller, L.
AU - V. Psutka, J.
AU - Psutka, J.
PY - 2007
SP - 139
EP - 142
DO - 10.5220/0002140301390142
PB - SciTePress