Open Access Open Access  Restricted Access Subscription or Fee Access

Application of Surface Water Quality Classification Models Using Principal Components Analysis and Cluster Analysis

mohamed Ahmed Reda hamed

Abstract


Water quality monitoring has one of the highest priorities in surface water protection policy. Many variety approaches are being used to interpret and analyze the concealed variables that determine the variance of observed water quality of various source points. A considerable proportion of these approaches are mainly based on statistical methods, multivariate statistical techniques in particular. In the present study, the use of multivariate techniques is required to reduce the large variables number of Nile River water quality upstream Cairo Drinking Water Plants (CDWPs) and determination of relationships among them for easy and robust evaluation. By means of multivariate statistics of principal components analysis (PCA), Fuzzy C-Means (FCM) and K-means algorithm for clustering analysis, this study attempted to determine the major dominant factors responsible for the variations of Nile River water quality upstream Cairo Drinking Water Plants (CDWPs). Furthermore, cluster analysis classified 21sampling stations into three clusters based on similarities of water quality features.
The result of PCA shows that 6 principal components contain the key variables and account for 75.82% of total variance of the study area surface water quality and the dominant water quality parameters were: Conductivity, Iron, Biological Oxygen Demand (BOD), Total Coliform (TC), Ammonia (NH3), and pH.
However, the results from both of FCM clustering and K-means algorithm , based on the dominant parameters concentrations, determined 3 cluster groups and produced cluster centers (prototypes). Based on clustering classification, a noted water quality deteriorating as the cluster number increased from 1 to 3, thus the cluster grouping can be used to identify the physical, chemical and biological processes creating the variations in the water quality parameters.
This study revealed that multivariate analysis techniques, as the extracted water quality dominant parameters and clustered information can be used in reducing the number of sampling parameters on the Nile River in a cost effective and efficient way instead of using a large set of parameters without missing much information. These techniques can be helpful for decision makers to obtain a global view on the water quality in any surface water or other water bodies when analyzing large data sets especially without a priori knowledge about relationships between them.

Full Text:

PDF

References


Abd El-Daiem S. Water quality management in Egypt. J Water Resour Dev. 2011; 27(1); 181–202p.

Adekunle L, Adetunji M, Gbadebo A. Assessment of ground water quality in a typical rural settlement in south Nigeria. Int J Environ Res Public Health. 2007; 4(4); 307–318p.

Akume D, Weber G.-W. 2002. Cluster algorithms; theory and methods. J Comput Technol. 2007; 7(1); 15–27p.

Cairo Drinking Water Company. Central Laboratory Annual Technical Report. 2018.

Cattel RD. The scree test for the number of factors. Multivar Behav Res. 1966; 1; 245–276p.

Chatfield C, Collin AJ. Introduction to Multivariate Analysis. New York; Chapman and Hall in Association with Methuen, Inc.; 1980.

Clare AA, Dianne FJ, Stuart LS. Effect of overlying water pH, dissolved oxygen, salinity and sediment disturbances on metal release and sequestration from metal contaminated marine sediments. Chemosphere. 2007; 69(9); 1428–1437p.

Davis JC. Statistics and Data Analysis in Geology, 2rd edition. New York; John Wiley and Sons, Inc; 2002.

Egyptian Governmental Law No. 48, 1982. The Implementer Regulations for law 48/1982 regarding the protection of the River Nile and water ways from pollution. Map Period Bull. 1982; 3–4; 12–35p.

EWQS (Egyptian Drinking Water Quality Standards). (2007). Ministry of Health, Population Decision number 458.

Goher ME, Hassan AM, Abdel-Moniem IA, Fahmy AH, El-Sayed SM. Evaluation of surface water quality and heavy metal indices of Ismailia Canal, Nile River, Egypt. Egypt J Aquat Res. 2014; 40; 225–233p.

Karavoltsos S, Sakellar A, Mihopoulos N, Dassenakis M, Scoullos MJ. Evaluation of the quality of drinking water in regions of Greece. Desalination. 2008; 224; 317–329p.

Jolliffe IT. Principal Component Analysis, 2nd edition. New York; Springer-Verlag; 2002.

Kaufman L, Rousseeuw PJ, Finding Groups in Data—An Introduction to Cluster Analysis. New York; John Wiley & Sons Inc; 1990.

Laurie K, Bryan F, Manly J. Multivariate Statistical Methods; A Primer. Chapman & Hall/Crc; 2005.

McKenna JE. An enhanced cluster analysis program with bootstrap significance testing for ecological community analysis. Environ Model Softw. 2003; 18(3); 205–220p.

Panda UC, Sundaray SK, Rath P, Nayak BB, Bhatta D. Application of factor and cluster analysis for characterization of river and estuarine water systems – a case study; Mahanadi River (India). J Hydrol. 2006; 331; 434–445p.

Rabeh SA. Ecological stufies on nitrogen cycle bacteria in Lake Manzalah, Egypt. Egypt J Aquat Biol Fish. 2001; 5(3); 263–282p.

Reghunath R, Murthy STR, Raghavan BR. The utility of multivariate statistical techniques in hydrogeochemical studies; an example from Karnataka, India. Water Res. 2002; 36; 2437–2442p.

Saleh AR. Bacteria and viruses in the Nile. Monogr Biol. 2009; 89; 407–429p.

Selim SZ. Soft clustering of multi-dimensional data; a semi-fuzzy approach. Patt Recogn. 1984; 17(5); 559–568p.

Suhr D. (2005). Principal component analysis vs. exploratory factor analysis. SUGI 30 Proceedings. Available from; http;//www2.sas.com

/proceedings/sugi30/Leadrs30.pdf.

Terceiro P, Lobo-Ferreira JP, Leitão TE. Análise da qualidade da água e questões de governân-ciana Albufeirado Alqueva. Comunicaçãoapresen-tadano9 Congressoda Água–Água; Desafiosdehoje, exigênciasdeamanhã. Cascais, Portugal. Available from; http;//www.aprh.pt/congressoagua2008/PDF/Lobo-FerreiraAlqueva.pdf. (Accessed 20 January 2009). (In Portuguese).

Tebbutt T. Principles of Water Quality Control. 5th edition. Hallam University; 1998.

Toufeek MA, Korium MA. Physico-chemical characteristics of water quality in Lake Nasser water. Glob J Environ Res. 2009; 3(3); 141–148p.

Trauwaert E, Kaufman L, Rousseeuw P. Fuzzy clustering algorithms based on the maximum likelihood principle. Fuzzy Sets Syst. 1991; 42; 213–227p.

Toufeek MA, Korium MA. Physico-chemical characteristics of water quality in Lake Nasser water. Global J Environ Res. 2005; 3(3); 141–148p.

World Health Organization. Health Guidelines for the Use of Wastewater in Agriculture and Aquaculture. Report of a WHO Scientific Group, Technical Report Series No. 778. Geneva; WHO; 1989.

Yu S, Shang J, Zhao J, Guo H. Factor analysis and dynamics of water quality of the Songhua River Northeast China. Water Air Soil Pollut. 2003; 144; 159–169p.

Zeng X, Rasmussen TC. Multivariate statistical characterization of water quality in lake Lanier, Georgia, USA. J Environ Qual. 2005; 34(6); 1980–1991p.

Zamxaka M, Pironcheva G, Muyima NYO. Microbiological and physico-chemical assessment of the quality of domestic water sources in selected rural communities of the Eastern Cape Province, South Africa.




DOI: https://doi.org/10.37628/jwre.v5i1.454

Refbacks

  • There are currently no refbacks.