International Journal of Applied Engineering Research, Dindigul
  • Year: 2011
  • Volume: 2
  • Issue: 2

A new codebook design schema for VQ-based Monaural Speech-Music Segregation

  • Author:
  • Seyedhossein Alavinia, Farbod Razzazi
  • Total Page Count: 10
  • Page Number: 462 to 471

Department of Electrical Engineering, Science and Research Branch, Islamic Azad University, Tehran, Iran

*Email: h_alavinia@yahoo.com

Abstract

Several ideas have been introduced to improve monaural speechmusic segregation problem. Schemadriven approaches employ some statistical methods to model the underlying source signals. Although schemabased techniques present a high quality segregated speech and music outputs, the computational complexity is the main drawback of these methods. In this paper, we proposed an optimized version of hybrid PCA-VQ model based on K-means clustering to overcome this deficiency. K-means algorithm does not work well for high dimensional data in terms of computational complexity and curse of dimensionality issues. To overcome the computational complexity of K-means algorithm and obtain uncorrelated and the most descriptive variables, we used Principal Component Analysis (PCA) technique. This technique is a commonly used statistical approach for dimension reduction. The goal of PCA is a linear mapping which maps data to a lower dimensional space, so that variance of the data in new space is maximized. First, we employed PCA on 2-D framed STFT of the input signal to compute uncorrelated feature vectors efficiently and subsequently. Then we introduced and evaluate a modified version of kmeans method for clustering. The simulation results demonstrate that the proposed schema can improve the segregated procedure in terms of PESQ criterion and complexity measures.

Keywords

K-means clustering, monaural segregation, PESQ, principal component analysis, schemadriven, vector quantization