Skip to content

Sanushi-Salgado/Clustering

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 

Repository files navigation

Clustering

Contents

  • Exploratory Data Analysis (EDA)
    • Exploring
      • the dataset
      • missing values
      • duplicate records
      • features
      • outliers
      • correlations
  • Data pre-processing
    • Data cleansing
      • Treating
        • missing data
        • duplicate records
        • outliers
    • Feature scaling
    • Dimensionality reduction
      • Principal Component Analysis (PCA)

Implementation

  • k-Means clustering
    • Elbow method
  • Hierarchical clustering
    • Generating Dendrograms

Tools

  • Google Colabs
  • Python

Source Code