Skip to content

Latest commit

 

History

History
 
 

Water Quality Classification

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 

Water Quality Analysis

Dataset Information

Safe drinking water is essential to a healthy life. It is a fundamental human right. Healthy drinking water is vital as a health and development issue at a national, regional, and local level. In some regions, investing in water supply and sanitation can yield a net economic benefit since the reductions in adverse health effects and health care costs outweigh the costs of undertaking the interventions.

Attribute Information:

Input variables (based on physicochemical tests):
1 - ph-> pH of water
2 - Hardness-> Capacity of water to precipitate soap in mg/L
3 - Solids-> Total dissolved solids in ppm
4 - Chloramines-> Amount of Chloramines in ppm
5 - Sulfate-> Amount of Sulfates dissolved in mg/L
6 - Conductivity-> Electrical conductivity of water in μS/cm
7 - Organic_carbon-> Amount of organic carbon in ppm
8 - Trihalomethanes-> Amount of Trihalomethanes in μg/L
9 - Turbidity-> Measure of light emiting property of water in NTU (Nephelometric Turbidity Units) \

Output variable (based on sensory data):
10 - Potability-> Indicates if water is safe for human consumption

Download link: https://www.kaggle.com/adityakadiwal/water-potability

Libraries

  • pandas
  • matplotlib
  • seaborn
  • plotly
  • scikit-learn
  • xgboost

    Algorithms

  • Logistic Regression
  • K Nearest Neighbours
  • Support Vector Machine
  • Decision Tree
  • Random Forest
  • XGBoost

    Best Model Accuracy: 70.00