Revealing Insights Through Exploratory Data Analysis on Earthquake Dataset

Authors

  • Kiagus Muhammad Arsyad Department of Computer Science, Universitas Pertamina, Jakarta 12220, Indonesia
  • Ariana Yunita Department of Computer Science, Universitas Pertamina, Jakarta 12220, Indonesia https://orcid.org/0000-0001-7883-5065
  • Haniifah Mas'uudah Krismartopo Department of Geophysics Engineering, Universitas Pertamina, Jakarta 12220, Indonesia
  • Aghnia Syahputri Dimar Department of Geophysics Engineering, Universitas Pertamina, Jakarta 12220, Indonesia
  • Kartika Dewi Department of Geophysics Engineering, Universitas Pertamina, Jakarta 12220, Indonesia
  • Iktri Madrinovella Department of Geophysics Engineering, Universitas Pertamina, Jakarta 12220, Indonesia

DOI:

https://doi.org/10.57102/jsis.v1i1.18

Keywords:

data visualization, earthquake dataset, exploratory data analysis, correlation analysys, geospatial analysis

Abstract

Exploratory Data Analysis (EDA) is a critical approach in developing machine learning models because the goal is to summarize the main characteristics of the data, often with visual methods, before modeling. It is frequently used as a prerequisite for more advanced data analytics techniques. Earthquakes are one of the natural disasters that commonly happen worldwide and lead to many victims. Research on machine learning for predicting earthquakes has been conducted a lot in recent years. This is a preliminary study for understanding an earthquake dataset to reveal several insights. This study aims to perform EDA using a dataset available on Kaggle, the Earthquake dataset from 1965 until 2016. Using several libraries in Python for data visualization and correlation analysis, this study results that depth does not correlate with magnitude, and the most frequent earthquake happened in 2011. Recommendations for further research are to cluster the dataset using clustering algorithms, such as K-means and hierarchical clustering, and then classify using several classifier algorithms.

Downloads

Published

2023-02-02

Issue

Section

Articles