Table of Contents


πŸŽ“ Intended learning outcomes

At the end of this lesson, students are expected to:


πŸͺ£ Unsupervised learning β˜…β˜…β˜…

Until now, we have seen regression and classification problems. In these problems, you were provided some kind of labels with the data β€” each temperature point came with a location or biodiversity value, and each location on the cat came with a label pet/do not pet. These types of problems are called supervised learning because these labels in the training dataset serve as a kind of teacher, supervising the learning process by giving us the answers that we are supposed to learn to produce. What happens if we are given data without supervision β€” examples without the desired answers?

Oh no! Who left all this dirty laundry around the house? Your flatmate, again! And your parents are coming over in a few hours β€” what are you going to do?

Untitled

By your calculations, there is time to run $K = 3$ loads of laundry before your parents arrive. We need to sort these clothes into groups so we can clean them! We will denote the loads $k = \{1,2,3\}$, and our goal is to assign each article of clothing one of these $k$ values, as labels. Notice that I haven’t given you the labels (most people, including your flatmate, have their own system for sorting laundry β€” maybe it is color based, or maybe it is based on the fabric or outdoors vs. indoors). As such, this is an unsupervised learning problem.