A Study on the Frequency-Domain Primary-Ambient Extraction For Stereo Audio Signals
Publications:
[1] J. He, W. S. Gan, and E. L. Tan, “A study on the frequency-domain primary-ambient extraction for stereo audio signals,” in Proc. ICASSP, Florence, Italy, 2014, pp. 2892-2896.
Primary-ambient extraction (PAE) has been playing an important role in spatial audio analysis-synthesis. Based on the spatial features, PAE decomposes a signal into primary and ambient components, which are then rendered separately. PAE is performed in subband domain for complex input signals having multiple point-like sound sources. However, the performance of PAE approaches and their key influences for such signals have not been well-studied so far. In this paper, we conducted a study on frequency-domain PAE using principal component analysis (PCA) in the case of multiple sources. We found that the partitioning of the frequency bins is very critical in PAE. Simulation results reveal that the proposed top-down adaptive partitioning method achieves superior performance as compared to the conventional partitioning methods.
Below are some test tracks.
Input
Test track
Primary: speech + music ;
Ambient:white noise;
PPR = 0.9