Skip to content

Latest commit

 

History

History
6 lines (4 loc) · 309 Bytes

README.md

File metadata and controls

6 lines (4 loc) · 309 Bytes

Data Mining: Understanding the data

This is an attempt to learn how Unsupervised learning is done on random Text data.

We use CountVectorizer, Tf-IdfVectorizer Modules of Python to visualize Document to Document Similarity matrix. We use built-in clustering libraries for K-Means clustering and so on.