Unsupervised Learning

Clustering (Divide by similarity)

Models are trained on unlabeled data and must discover structure on their own.

We don’t have the right answer for the data set.

Customer Segmentation

X:
- Purchase frequency
- Average order value
- Product categories
- Visit patterns
y:
- Groups such as “high-value”, “occasional”, “price-sensitive”

Email categorization

X: Take a bunch of emails and create groups of messages. We can later name those groups (e.g. work, friends, family, marketing, spam, etc.)
News items
DNA sequences how much certains Genes are expressed
Social network analysis
Market segmentations
Astronomical data analysis
Cocktail party algorithm (separating two voices as recorded by two microphones) (Noise cancellation)