This article has multiple issues. Please helpimprove it or discuss these issues on thetalk page.(Learn how and when to remove these messages) (Learn how and when to remove this message)
|
Masking threshold withinacoustics (a branch ofphysics that deals with topics such asvibration,sound,ultrasound, andinfrasound), refers to a process where if there are two concurrent sounds and one sound is louder than the other, a person may be unable to hear the soft sound because it is masked by the louder sound.[1]
So the masking threshold is thesound pressure level of a sound needed to make the sound audible in the presence of another noise called a "masker". This threshold depends upon thefrequency, the type of masker, and the kind of sound being masked. The effect is strongest between two sounds close in frequency.
In the context of audio transmission, there are some advantages to being unable to perceive a sound. Inaudio encoding, for example, bettercompression can be achieved by omitting the inaudible tones. This requires fewerbits to encode the sound and reduces the size of the final file.
It is uncommon to work with only onetone. Most sounds are composed of multiple tones. There can be many possible maskers at the same frequency. In this situation, it would be necessary to compute theglobal masking threshold using a high resolutionFast Fourier transform via 512 or 1024 points to determine the frequencies that comprise the sound. Because there arebandwidths that humans are not able to hear, it is necessary to know the signal level, masker type, and thefrequency band before computing the individual thresholds. To avoid having the masking threshold under the threshold in quiet, one adds the last one to the computation of partial thresholds.[clarification needed] This allows computation of the signal-to-mask ratio (SMR).

TheMPEG audio encoding process leverages the masking threshold. In this process, there is a block called "Psychoacoustic model". This is communicated with the band filter and the quantify block. The psychoacoustic model analyzes the samples sent to it by the filter band, computing the masking threshold in each frequency band using a Fast Fourier transform. The number of points used depends upon the MPEG layer. Using these thresholds, the signal-to-mask ratio is determined and sent to the quantifier. The quantifier assigns more or less bits in each block based upon the SMR. The block with the highest SMR will encode with the maximum number ofbits.