Hydrochemical assessment of Semarang area using multivariate statistics: A sample based dataset
Dasapta Erwin Irawan1 and Thomas Triadi Putranto21Faculty of Earth Sciences and Technology, Institut Teknologi Bandung, Jalan Ganesa No. 10, Bandung – 40132, Indonesia 2Faculty of Engineering, Universitas Diponegoro, Jalan Prof. H. Soedarto, SH, Tembalang, Kota Semarang – 50275, Indonesia
Abstract. The following paper describes in brief the data set related to our project "Hydrochemical assessment of Semarang Groundwater Quality". All of 58 samples were taken in 1992, 1993, 2003, 2006, and 2007 using well point data from several reports from Ministry of Energy and Min- eral Resources and independent consultants. We provided 20 parameters in each samples (sample id, coord X, coord Y, well depth, water level, water elevation, TDS, pH, EC, K, Ca, Na, Mg, Cl, SO4, HCO3, year, ion balance, screen location, and chemical facies). The chemical composi- tion were tested in the Water Quality Laboratory, Universitas Diponegoro using mas spectrofotometer method.
The statistical treatment for the dataset (available on Zenodo doi:10.5281/zenodo.57293) were described as follows: (1) data preparation in to csv file format, load it in to R environment; (2) data treatment, including: correlation matrix, cluster analysis using kmeans and hierarchical cluster analysis, and principal component analysis. For anal- ysis and visualizations, We used the following R packages: ggplot2, dplyr, factomineR, factoExtra, cluster, ggcorrplot, and ape.
Irawan, D. E. and Putranto, T. T.: Hydrochemical assessment of Semarang area using multivariate statistics: A sample based dataset, Earth Syst. Sci. Data Discuss., doi:10.5194/essd-2016-29, 2016.