International Journal of Research in Engineering and Applied Sciences
  • Year: 2016
  • Volume: 6
  • Issue: 11

Predictive analytics in data mining with big data: A literature survey

  • Author:
  • V. Vignesh1, M. Mohanapriya2
  • Total Page Count: 13
  • Page Number: 10 to 22

1Research Scholar, Department of Computer Science, Karpagam University, Coimbatore-641021, Tamil Nadu, India

2Head, Department of Computer Science and Engineering, Karpagam University, Coimbatore-641021, Tamil Nadu, India

Online published on 8 May, 2017.

Abstract

Big data is a collection of massive and complex data sets that include the huge quantities of data, social media analytics, data management capabilities, real-time data. Big data analytics is the process of examining large amounts of data. Big Data is characterized by the dimensions volume, variety, and velocity, while there are some well-established methods for big data processing such as Hadoop which uses the map-reduce paradigm. Using Map Reduce programming paradigm the big data is processed. While there are some well-established methods for big data processing such as Hadoop which uses the map-reduce paradigm. Using Map Reduce programming paradigm the big data is processed. The technologies used by big data application to handle the massive data are Hadoop, Map Reduce, Apache Hive, No SQL and HPCC. These technologies handle massive amount of data in MB, PB, YB, ZB, KB and TB. In this research paper various technologies for handling big data along with the advantages and disadvantages of each technology for catering the problems in hand to deal the massive data has discussed.

Keywords

Big Data, Parameters, Evolution, Hadoop, HDFS