loading
Papers Papers/2022 Papers Papers/2022

Research.Publish.Connect.

Paper

Paper Unlock

Authors: César Alonso Abad 1 ; Miguel Ángel Martín Fernández 2 and Carlos Alberola López 2

Affiliations: 1 Music Technology Group, Pompeu Fabra University, Spain ; 2 Image Processing Laboratory, University of Valladolid, Spain

Keyword(s): Fast perceptual quantization, perceptual audio coding, Huffman, histogram, scalability, low delay.

Related Ontology Subjects/Areas/Topics: Applications ; Audio and Speech Processing ; Digital Audio and Video Broadcasting ; Digital Signal Processing ; MPEG Standards and Related Issues ; Multimedia ; Multimedia and Communications ; Multimedia Signal Processing ; Pattern Recognition ; Software Engineering ; Telecommunications

Abstract: In this paper we present Fast Perceptual Quantization (FPQ), a novel procedure to quantize and code audio signals. It employs the same psychoacoustics principles used in the popular MPEG/Audio coders, but substantially simplifies the complexity and computational needs of the encoding process. FPQ is based on defining a hierarchy of privileged quantization values so that the masking threshold calculated through a psychoacoustic model is leveraged to quantize the real values to the privileged ones when possible. The computational cost of this process is very low compared to MP3’s or AAC’s quantization/coding loops. Experimental results show that it is possible to achieve nearly transparent coding using as few as approximately 100 quantization values. This leads to very efficient bit compaction using Huffman or arithmetic coding so that nearly state-of-the-art performance can be achieved in terms of quality/bit-rate trade-off. Since quantization and codification (bit compaction) procedu res are completely independent here, efficient scalable decoding can be achieved either by parsing and entropy re-encoding the original quantized values or by coding the bit-planes independently and sorting them in order of perceptual significance. Very low delay performance is also possible to achieve, which makes the proposed coding scheme suitable for real-time applications. (More)

CC BY-NC-ND 4.0

Sign In Guest: Register as new SciTePress user now for free.

Sign In SciTePress user: please login.

PDF ImageMy Papers

You are not signed in, therefore limits apply to your IP address 34.229.50.161

In the current month:
Recent papers: 100 available of 100 total
2+ years older papers: 200 available of 200 total

Paper citation in several formats:
Alonso Abad, C.; Ángel Martín Fernández, M. and Alberola López, C. (2007). LOW COMPLEXITY, LOW DELAY AND SCALABLE AUDIO CODING SCHEME BASED ON A NOVEL STATISTICAL PERCEPTUAL QUANTIZATION PROCEDURE. In Proceedings of the Second International Conference on Signal Processing and Multimedia Applications (ICETE 2007) - SIGMAP; ISBN 978-989-8111-13-5, SciTePress, pages 31-34. DOI: 10.5220/0002140100310034

@conference{sigmap07,
author={César {Alonso Abad}. and Miguel {Ángel Martín Fernández}. and Carlos {Alberola López}.},
title={LOW COMPLEXITY, LOW DELAY AND SCALABLE AUDIO CODING SCHEME BASED ON A NOVEL STATISTICAL PERCEPTUAL QUANTIZATION PROCEDURE},
booktitle={Proceedings of the Second International Conference on Signal Processing and Multimedia Applications (ICETE 2007) - SIGMAP},
year={2007},
pages={31-34},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0002140100310034},
isbn={978-989-8111-13-5},
}

TY - CONF

JO - Proceedings of the Second International Conference on Signal Processing and Multimedia Applications (ICETE 2007) - SIGMAP
TI - LOW COMPLEXITY, LOW DELAY AND SCALABLE AUDIO CODING SCHEME BASED ON A NOVEL STATISTICAL PERCEPTUAL QUANTIZATION PROCEDURE
SN - 978-989-8111-13-5
AU - Alonso Abad, C.
AU - Ángel Martín Fernández, M.
AU - Alberola López, C.
PY - 2007
SP - 31
EP - 34
DO - 10.5220/0002140100310034
PB - SciTePress