7.4 Nonparametric density estimation

🎓 Intended learning outcomes

At the end of this lesson, the student is expected to:

Understand the concept of nonparametric modelling
Apply nonparametric approaches to density estimation problems
Know and understand the formulation of the empirical- and naive density estimators, histograms and kernel density estimation (KDE)
Be able to formulate the naive density estimator as a special case of KDE
Be able to define the integrated square error, integrated absolute error, and maximum likelihood functions for KDE
Be able to demonstrate hyperparameter optimisation through cross-validation
Be aware of the advantages and disadvantages of nonparametric probabilistic models compared to parametric density estimation

In this lesson, we discuss nonparametric methods for density estimation, which are methods that do not require you to assume a particular parametric family for the density that you estimate. These are examples of a broader class of nonparametric methods in general, which can be both probabilistic and non-probabilistic, and be used in machine learning, statistics, and probability.

A simple density estimation problem

Suppose that you are a professor teaching a new machine-learning course at KTH, where the grading is composed of timed quizzes and coding assignments. Since the course is given for the first time, you are wondering whether you managed to strike a good balance in the first coding assignment. Let’s investigate!

At first, we have no idea what the distribution of these scores looks like. If the lecture notes sufficiently prepared students regardless of their backgrounds and the assessment tells students apart in a meaningful way, then the score distribution should be wide, with a single mode (i.e., peak). However, if (say) only students with a strong programming background could keep up, then they might form a peak at the higher end, distinct from the other students.

We might be tempted to fit a Gaussian to this data, or perhaps a simple GMM, but it is not clear if such an approach will be a able to offer good description of the shape of the underlying distribution that the data came from.

Untitled

Instead of assuming a specific parametric form for our model, we turn to nonparametric probabilistic approaches, which are designed to (at least in theory) be able to adapt to any possible shape that the data distribution may have. The nonparametric methods we will cover are typically used for explorative analysis and visualisation, rather than solving complex density-estimation problems.

Table of contents

🎓 Intended learning outcomes

A simple density estimation problem