Journal of Innovation in Electronics and Communication Engineering
  • Year: 2015
  • Volume: 5
  • Issue: 2

Computational Geometry Leveraged by Apache Spark

  • Author:
  • Nagarjuna Rao Dustakar1,, Surendra Rao Dustakar2,
  • Total Page Count: 17
  • Page Number: 15 to 31

1Department of Electrical Engineering, Arizona State University, Arizona, USA

2Department of Electronics and Communication Engineering, Guru Nanak Institutions Technical Campus, Hyderabad, Telangana, India

*ndustaka@asu.edu

**sdustakar@gmail.com

Online published on 27 June, 2017.

Abstract

Apache spark, a cluster computing framework, is widely used for solving big data problems in distributed environment. Unfortunately, this framework efficiency was not analyzed completely based on different number of nodes and for processing different large-scale computational geometry operations such as Geometry Union, Convex Hull, Closest and Farthest pair and Spatial Range, Join and Aggregation on both small, medium and huge volumes of spatial dataset. In this paper we leverage these operations using the inherent functions provided by Apache Spark framework such as Map and Map Partitions and analyze its performance and efficiency in different cases of single & multiple nodes.

Keywords

Apache Spark, Hadoop Distributed File System (HDFS), Computational Geometry, Geometric Algorithms, Distributed Database