Arya Bhatta Journal of Mathematics and Informatics
  • Year: 2021
  • Volume: 13
  • Issue: 1

Evaluation of first and second dosage of COVID-19 vaccination using k-means clustering model and visualization of Indian states and union territories

  • Author:
  • R. Lakshmi Priya1, M. Salomi2, G. Manimannan3
  • Total Page Count: 10
  • Page Number: 115 to 124

1Assistant Professor, Department of Statistics, Dr. Ambedkar Government Arts College, Chennai, Tamilnadu, India

2Assistant Professor, Department of Statistics, Madras Christian College, Tambaram, Chennai, Tamilnadu, India

3Assistant Professor, Department of Mathematics, TMG College of Arts and Science, Chennai, Tamilnadu, India, E-mail: manimannang@gmail.com

Online published on 10 September, 2021.

Abstract

Application of Orange Data mining software determines the clusters and plots the graph of vaccination data for various states and union territories. The file widget open new vaccination data set and perform k-mean++ from 2 to 9 with silhouette distance. The silhouette scores and cluster information are achieved. The three zones are visualized and the zones are labeled as green, Blue and Red: The green zone indicates that states and union territories are high vaccinated, the blue zones indicates states and union territories are Moderatelyvaccinated, and the red zone are low vaccinated states and union territories of India. The states and union territories ’of Sikkim, Tripura, Ladakh and Lakshadweep have low population butfalls in high vaccinated states of first and second dose. The states and union territories of Goa, Mizoram, Delhi, Arunachal Pradesh, Chandigarh, Uttarakhand, Gujarat, Rajasthan, Kerala, Jammu and Kashmir, Dadra and Nagar Haveli, Damn and Diu, Himachal Pradesh, Chhattisgarh and Andaman Nicobar Islandshavediverse population and come in the category of low vaccinated states of first and second dose. The states and union territories of Manipur, Meghalaya, Nagaland, Odisha, West Bengal, Haryana, Karnataka, Andhra Pradesh, Maharashtra, Telengana, Jharkhand, Madhya Pradesh, Punjab, Assam, Uttar Pradesh, Tamil Nadu Puduchery and Bihar have high population and are moderately vaccinated states of first and second dose. The open source tools like Orange Data mining found useful for exploring appropriate and applicable functions in data science. Several partitions with different values of k-number of clusters or partitions are recommended to review along with cluster quality index for optimum solution. K-means can be adapted to micro-level demarcation of containment zones. The clusters formed based on COVID-19 patient's vaccination data using Data Science techniques specifically Kmeans will be active, unbiased, accurate, visible, economic and easy to apply.

Keywords

K-means++, Visualization, COVID-19, Vaccination of first and Second Dosage and Indian States and Union Territories