Water Quality Classification Project:

This project employs machine learning techniques to classify the water quality in Europe using a dataset obtained from the European Environment Agency (EEA). The EEA dataset contains a vast amount of information on the quality and status of Europe's water resources, including rivers, lakes, groundwater bodies, transitional, coastal, and marine waters.

Project Goal:

The goal of the project is to use the Water Quality Index (WQI) to classify the quality of Europe's water resources accurately. The project contains all the necessary code and documentation to replicate it.

Data Preparation:

The dataset is quite large (16GB), so a subset of it will be utilized for the project. The data preparation process includes data cleaning and preprocessing steps, such as selecting relevant data, dealing with missing values, outliers, and scaling the data. The Water Quality Index (WQI) is calculated for each water resource to classify its quality., which includes:

Selecting relevant data
Creating custom datasets
Dealing with missing values
Dealing with outliers
Calculating the Water Quality Index (WQI)
Scaling the data

Build Classificaiton Models:

After the data is processed we going to utilize 3 machine learning models to find the optimal results with the hyperparameter tuning using the RandomSearchCV and GridSearchCV:

Decision Tree Classifier
Support Vector Machine (SVC)
K-Nearest Neighbors (KNN)

Technologies Used:

Python 3.x
Scikit Learn
Pandas
NumPy
Matplotlib

Clone this repository by running:

git clone/~https://github.com/stefanshipinkoski/water_quality_classificaiton.git

Navigate to the project directory:

cd water_quality_classificaiton

Create a conda environment and install the required dependencies:

conda env create -f enviorment.yml

Activate the conda environment:

conda activate water-quality-env

Usage

Launch a Jupyter Notebook session within the conda environment:
Run the data_preprocessing.ipynb notebook to clean and preprocess the dataset.
Run the water_quality_classification.ipynb notebook to classify the quality of water resources in Europe.

Contributing:

Please feel free to contribute to this project by submitting pull requests or creating issues.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
.gitingore		.gitingore
Chemical Elements Classes.md		Chemical Elements Classes.md
LICENSE		LICENSE
README.md		README.md
data_preprocessing.ipynb		data_preprocessing.ipynb
environment.yml		environment.yml
water_quality_classification.ipynb		water_quality_classification.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Water Quality Classification Project:

Project Goal:

Data Preparation:

Build Classificaiton Models:

Technologies Used:

Usage

Contributing:

About

Releases

Packages

Languages

License

stefanshipinkoski/water_quality_classificaiton

Folders and files

Latest commit

History

Repository files navigation

Water Quality Classification Project:

Project Goal:

Data Preparation:

Build Classificaiton Models:

Technologies Used:

Usage

Contributing:

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages