International Journal of Engineering, Science and Mathematics

  • Year: 2017
  • Volume: 6
  • Issue: 6

Preliminary Investigations on the spam filtering using statistical classification techniques

  • Author:
  • A.V. Rajeswari
  • Total Page Count: 10
  • DOI:
  • Page Number: 336 to 345

M. Sc, M. Phil, Ph. D, Professor, Department of Physics, J.M.J College for Women, Tenali, Andhra Pradesh, India-522202

Online published on 19 April, 2019.

Abstract

A study on the thereotical and application of the statistical filtering techniques for the spam classification problems has been conducted and its results are presented. The research methodology applied in this study starts with building the dataset, correcting errors in the datasetand discusses the techniques to compute the probability of tokens in the dataset and the statistical application of the token values. The analysis shown is used to depict that statistical filtering is better than heuristic-based filtering because the former approach gives specific information about making a decision. All the four popular techniques in use today namely-Bayesian Combination by Paul Graham, Bayesian Combination by Brian Burton, Robinson's Geometric Mean Test and Fisher-Robinson's Inverse Chi-Square and their merits and demerits have been discussed in detail.

Keywords

Spam Classfication, Statistical Filtering, Bayes Classifier, Machine Learning