top of page

A Study on the Frequency-Domain Primary-Ambient Extraction For Stereo Audio Signals

 

Publications:

[1] J. He, W. S. Gan, and E. L. Tan, “A study on the frequency-domain primary-ambient extraction for stereo audio signals,” in Proc. ICASSP, Florence, Italy, 2014, pp. 2892-2896.

Primary-ambient extraction (PAE) has been playing an important role in spatial audio analysis-synthesis. Based on the spatial features, PAE decomposes a signal into primary and ambient components, which are then rendered separately. PAE is performed in subband domain for complex input signals having multiple point-like sound sources. However, the performance of PAE approaches and their key influences for such signals have not been well-studied so far. In this paper, we conducted a study on frequency-domain PAE using principal component analysis (PCA) in the case of multiple sources. We found that the partitioning of the frequency bins is very critical in PAE. Simulation results reveal that the proposed top-down adaptive partitioning method achieves superior performance as compared to the conventional partitioning methods.

 

 

 

 

 

 

 

 

 

 

 

Below are some test tracks.

Input 

Test track

Primary: speech + music ;

Ambient:white noise;

PPR = 0.9 

Full-band

8 Uniform

Top-down

Observations

The test track is a comprehensive track, where only a few frames both sources are dominant (that is the case where these approaches would perform differently).

But it can be perceived that the proposed Top-Down approach has less distortion and the directions of the two sources are clearer.

© 2013 by HE Jianjun. Proudly created with Wix.com

  • s-facebook
  • s-linkedin
bottom of page