At the end of this lesson, the student is expected to:
<aside> 📢 Now would be a good time to quickly review the most common and basic distributions used in probability and statistics, including their multivariate versions. Wikipedia has a big list of probability distributions, many many more that you ever will need. You should be able to find a more compact list in your basic probability textbook.
</aside>
Suppose you toss a coin $n$ times and observe the outcomes $\mathcal{X} = \{\text x_1, \dots, \text x_n\}$, where $\textbf x_i=1$ means heads and $\textbf x_i=0$ means tails. How can we build a probabilistic ****model of this coin using the data? Formally, our goal is to construct a probability distribution represented by the probability density function $p_\mathbf{x}$ that assigns a probability to each of the two possibilities (heads or tails).
Distributions are defined by two things: their model family, and their parameter values.
We will assume that the coin flips follow a Bernoulli distribution: