top of page

Primary-Ambient Extraction using Ambient Phase Estimation with a Sparsity Constraint

 

Publications:

[1] J. He, W. S. Gan, and E. L. Tan, “Primary-ambient extraction using ambient phase estimation with a sparsity constraint,”  IEEE Signal Process. Letters, vol. 22, no. 8, pp. 1127-1131, Aug. 2015.

Spatial audio reproduction addresses the growing commercial need to recreate an immersive listening experience of digital media content, such as movies and games. Primary-ambient extraction (PAE) is one of the key approaches to facilitate flexible and optimal rendering in spatial audio reproduction. Existing approaches, such as principal component analysis and time-frequency masking, often suffer from severe extraction error. This problem is more evident when the sound scene contains a relatively strong ambient component, which is frequently encountered in digital media. In this Letter, we propose a novel PAE approach by estimating the ambient phase with a sparsity constraint (APES). This approach exploits the equal magnitude of the uncorrelated ambient components in the two channels of a stereo signal and reformulates the PAE problem as an ambient phase estimation problem, which is then solved using the criterion that the primary component is sparse. Our experimental results demonstrate that the proposed approach significantly outperforms existing approaches, especially when the ambient component is relatively strong.
 
Below are some test tracks to compare APES with PCA.

Input Signal

Track 1

(Music + Wave lapping sound, 8s)

PCA

APES

Primary

Ambient

Observation

Track 2

(Speech + Wave sound, 8s)

Primary

Ambient

Track 3

(Music + Shopping center sound, 4s)

Primary

Ambient

Note: APES is dentoed as "APEPS" in the codes

bottom of page