International Journal of Applied Research on Information Technology and Computing (IJARITAC)
  • Year: 2019
  • Volume: 2
  • Issue: 2

A Dimensionality reduced Text data clustering with prediction of optimal number of clusters

  • Author:
  • M. Ramakrishna Murty1,, JVR Murthy2,, Prasad Reddy3,, Suresh Chandra Satapathy4,
  • Total Page Count: 9
  • Published Online: Aug 1, 2019
  • DOI:
  • Page Number: 41 to 49

1GMRIT,Rajam, India

2JNTU, Kakinada, India

3P.V.G.D, AU, India

4Sr Member IEEE, ANITS, India

*E-mail id: ramkrishna.malla@gmail.com

**E-mail id:mjonnalagedda@gmail.com

***E-mail id: prasadreddy.vizag@gmail.com

****E-mail id:sureshsatapathy@ieee.org

Abstract

Grouping large number of text documents is a challenging task due to high dimensional representation of the vector space model. Higher dimensionality of text data leads to computational burden and inefficient cluster results. In this work we improve the quality of the text document clustering using Singular Value Decomposition technique along with dimensionality reduction. In this paper we have also proposed Singular Value Decomposition which helps in finding the appropriate number of clusters for k-means clustering technique according to the singular values (Eigen values) calculated in the Singular Value Decomposition method.

Keywords

Text Clustering, Dimensionality, Singular Value Decomposition, K-means