STAT 1010 - Fall 2022
As the number of trials increases, the binomial pdf becomes well approximated by a normal distribution.
“Observed data often represent the accumulation of many small factors.”
\(Y \sim N(\mu = 4, \sigma^2 = 25)\)
The probability distribution of a sum of independent random variables of comparable variance approaches a normal distribution as the number of summed random variables increases.
Historically, it could be quite hard to find probabilities, so standardizing was important.
02:00
02:00
Let \(Y \sim N(\mu_Y = 5, \sigma_Y^2 = 4)\), standardize the following and find the probability:
\(P(Y < 3)\)
\[\begin{aligned} P(Y < 3) &= P(\frac{Y - \mu_Y}{\sigma_Y} < \frac{3-\mu_Y}{\sigma_Y}) \\ & = P(Z < \frac{3-5}{2} = -1) \end{aligned}\]
pnorm(-1)
\(0.1586553\)
pnorm(-1, mean = 5, sd = 2)
02:00
Percentiles are always to the left.
Let \(Y \sim N(\mu_Y = 5, \sigma_Y^2 = 4)\), standardize the following and find the probability:
\(P(Y > 7)\)
\[\begin{aligned} P(Y > 7) &= P(\frac{Y - \mu_Y}{\sigma_Y} > \frac{7-\mu_Y}{\sigma_Y}) \\ & = P(Z > \frac{7-5}{2} = 1) \end{aligned}\]
pnorm(1)
\(0.8413447\)
pnorm(1, mean = 5, sd = 2)
02:00
1 - pnorm(1)
\(0.1586553\)1 - pnorm(1, mean = 5, sd = 2)
02:00
qnorm(.1711)
\(-0.9498273\)qnorm(.01, mean = 10, sd = 2.5)
\(4.18413\)02:00
Looking at the data
R
If you see departures from normality (large or small kurtosis, QQ plots that deviate from a straight line) PLOT the data and check.