Keyboard shortcuts

Press or to navigate between chapters

Press S or / to search in the book

Press ? to show this help

Press Esc to hide this help

Unsupervised Learning

  • Clustering (Divide by similarity)

Models are trained on unlabeled data and must discover structure on their own.

  • We don't have the right answer for the data set.

Customer Segmentation

  • X:
    • Purchase frequency
    • Average order value
    • Product categories
    • Visit patterns
  • y:
    • Groups such as "high-value", "occasional", "price-sensitive"

Email categorization

  • X: Take a bunch of emails and create groups of messages. We can later name those groups (e.g. work, friends, family, marketing, spam, etc.)

  • News items

  • DNA sequences how much certains Genes are expressed

  • Social network analysis

  • Market segmentations

  • Astronomical data analysis

  • Cocktail party algorithm (separating two voices as recorded by two microphones) (Noise cancellation)