International Journal of Data Mining and Emerging Technologies

  • Year: 2017
  • Volume: 7
  • Issue: 1

An Analysis of Data Characteristics and Classifier Performance in the Context of Ordinal Classification

1Ph.D. Scholar, Graduate Student IEEE Member, Department of Computer Science, S.N.R. Sons College, Coimbatore, Tamil Nadu, India

2Head, Department of Computer Applications, S.N.R. Sons College, Coimbatore, Tamil Nadu, India

*(*Corresponding author) email id: dhanadurairaj@gmail.com

**saroviji@rediffmail.com

Abstract

The characteristics of data set play an important and foremost role in data mining and knowledge engineering. Classification is a foremost and necessary method in data mining. The motto of classification is to investigate the training data and to acquire an accurate description or model for each class using the characteristics present in the data. The characteristics of the data set are one of the leading performance affecting factors of the classifier. This paper performs an analysis of data set characteristics and its correlation with classifier performance. In particular, this analysis extends the study to the issues of ordinal classification with respect to the data set and recommendations proposed to address these issues.

Keywords

Classification, Ordinal classification, Dataset characteristics, Classifier performance, Imbalance, Issues of ordinal classification, With in class imbalance, Between class imbalance