Auditory time-frequency masking: psychoacoustical data and application to audio representations

Authors: Necciari T., Balazs P., Kronland-Martinet R., Ystad S., Laback B., Savel S., Meunier S.
Publication Date: August 2012
Journal: Post-proceedings of CMMR 2011 - Speech, sound and music processing: embracing research in India (LNCS vol. 7172, pp. 146-171, Springer-Verlag Heidelberg, 2012)

Tags: Time-Frequency Masking

Abstract

In this paper, the results of psychoacoustical experiments on auditory time-frequency (TF) masking using stimuli (masker and target) with maximal concentration in the TF plane are presented. The target was shifted either along the time axis, the frequency axis, or both relative to the masker. The results show that a simple superposition of spectral and temporal masking functions does not provide an accurate representation of the measured TF masking function. This confirms the inaccuracy of simple models of TF masking currently implemented in some perceptual audio codecs. In the context of audio signal processing, the present results constitute a crucial basis for the prediction of auditory masking in the TF representations of sounds. An algorithm that removes the inaudible components in the wavelet transform of a sound while causing no audible difference to the original sound after re-synthesis is proposed. Preliminary results are promising, although further development is required.