
STAT 7 - Statistical Methods for the Biological, Environmental & Health Sciences
10 Mar 2026
This is where probability meets inference!
The most important continuous distribution in statistics!
Properties:
History: The name comes from early statisticians who thought this distribution was the “norm” or typical pattern in nature.
Reality: It’s common but not universal!
\[X \sim N(\mu, \sigma)\]

\(\mu = 0, \sigma = 1\)

\(\mu = 20, \sigma = 3\)
Same shape, different location and spread!
Empirical Rule: For any normal distribution:

This is incredibly useful for quick estimates!
Heights in this class are approximately \(N(168, 7)\) (cm)
Using the 68-95-99.7 rule:
Z-score = number of standard deviations from the mean
\[z = \frac{x - \mu}{\sigma}\]
Properties:
Purpose: Allows comparison across different normal distributions!
SAT: \(N(1500, 300)\), ACT: \(N(21, 5)\)
Who did better relative to their test?
Pam’s z-score: \[z = \frac{1800 - 1500}{300} = 1.0\]
Jim’s z-score: \[z = \frac{24 - 21}{5} = 0.6\]
Pam scored better (1 SD vs 0.6 SD above mean)
NORM.DIST(x, mean, sd, cumulative)
NORM.INV(probability, mean, sd)
Example: For \(N(168, 7)\), what’s P(X ≤ 175)?
=NORM.DIST(175, 168, 7, TRUE) = 0.841
Heights: \(N(168, 7)\) cm
Calculate using Google Sheets:
=1 - NORM.DIST(168, 168, 7, TRUE) = 0.50=NORM.DIST(175, 168, 7, TRUE) - NORM.DIST(161, 168, 7, TRUE)=NORM.INV(0.70, 168, 7) = 171.7 cmLet’s explore with the Normal Distribution App:
https://istats.shinyapps.io/NormalDist/
Try different values of \(\mu\) and \(\sigma\)!
Example: Test scores are \(N(100, 15)\)
Using Google Sheets:
=1 - NORM.DIST(130, 100, 15, TRUE) = 0.0228 ≈ 2.3%=NORM.INV(0.90, 100, 15) = 119.2=NORM.DIST(115, 100, 15, TRUE) - NORM.DIST(85, 100, 15, TRUE)How do we know if data are approximately normal?
Remember: Real data are never exactly normal, we look for “close enough”
Examples of NON-normal distributions:
Right-skewed:
Left-skewed:
Bimodal, uniform, etc.
Don’t force normality where it doesn’t exist!
Stand up, stretch, take a deep breath!
\(X \sim Bin(n, p)\) counts successes in n trials
Question: Can we use the normal distribution to approximate?
When n is large, the binomial distribution starts to look normal!
Rule of thumb: Normal approximation is good when:
\[np \geq 10 \quad \text{AND} \quad n(1-p) \geq 10\]
Why these conditions?
If conditions met: Use \(N(\mu, \sigma)\) where:
\[\mu = np \qquad \sigma = \sqrt{np(1-p)}\]
Check if normal approximation is appropriate:
Problem: Binomial is discrete, Normal is continuous
Solution: Continuity correction
Why? The bar for X = k extends from k - 0.5 to k + 0.5
Setup: Flip a fair coin 100 times. Find P(X ≥ 60) where X = # heads.
Check conditions:
Using normal approximation:
\[\mu = np = 50, \quad \sigma = \sqrt{np(1-p)} = \sqrt{25} = 5\]
With continuity correction: P(X ≥ 60) ≈ P(Z > 59.5)
In Google Sheets: =1 - NORM.DIST(59.5, 50, 5, TRUE) = 0.0287
About 2.9% chance of 60+ heads
For the coin example (n=100, p=0.5):
Exact (using Binomial):
=1 - BINOM.DIST(59, 100, 0.5, TRUE)
Result: 0.0284
Approximate (using Normal):
=1 - NORM.DIST(59.5, 50, 5, TRUE)
Result: 0.0287
Very close! Normal approximation works well here.
So far (Weeks 1-4):
Descriptive statistics & Probability
Coming up (Weeks 5-10):
Inferential statistics
Bridge: Sampling distributions!
Examples:
Examples:
Key distinction:
Parameter → Population → Unknown (usually)
Statistic → Sample → Known (calculated)
Goal of inference: Use statistics to estimate parameters!
Scenario: Study average sleep hours of UCSC students
Question: Is \(\bar{x} = 6.8\) exactly equal to μ?
Answer: Probably not! There’s sampling variability
Key insight: Different samples give different statistics!
This is natural and expected!
What we’ve built:
What’s next (Week 5+):
Before you leave:
Name one new thing you learned today
Check approximation: n=50, p=0.15. Can we use normal approximation?
Office hours: I’ll be in the classroom nextdoor after class if you need me. If you can’t stay, check our office hours on Ed Discussion and look for a spot that works with your schedule.
Practice is essential: Don’t forget to think deeply when solving your assignments. Don’t do it as a check list, but rather as an opportunity to think about the concepts that we have learned.
See you next week!
Definition: The probability distribution of a statistic across all possible samples of size n
Thought experiment:
That histogram approximates the sampling distribution of \(\bar{x}\)
Notice: It’s approximately normal and centered at μ!
For samples of size n from a population with mean μ and SD σ:
1. Center: \[E(\bar{x}) = \mu\]
The mean of the sampling distribution equals the population mean
2. Spread: \[SD(\bar{x}) = \frac{\sigma}{\sqrt{n}}\]
Called the standard error (SE)
3. Shape: Depends on population and sample size (more soon!)
Important distinction!
\[SE = \frac{\sigma}{\sqrt{n}}\]
Intuition: Larger samples give more stable estimates
One of the most important theorems in statistics!
For samples of size n from ANY population with mean μ and SD σ:
As n increases, the sampling distribution of \(\bar{x}\) becomes approximately normal with:
- Mean = μ
- Standard deviation = \(\frac{\sigma}{\sqrt{n}}\)
In notation:
\[\bar{x} \sim N\left(\mu, \frac{\sigma}{\sqrt{n}}\right) \quad \text{approximately, for large } n\]
Original population: Right-skewed (NOT normal)
Notice: As n increases, sampling distribution becomes normal!
General rule: n ≥ 30 is usually sufficient
BUT this depends on population shape:
Key point: More skewed population → need larger n
Problem: Average adult height μ = 170 cm, σ = 10 cm.
For a random sample of n = 36 adults, find P(\(\bar{x}\) > 172).
Solution:
By CLT, \(\bar{x} \sim N(170, \frac{10}{\sqrt{36}}) = N(170, 1.67)\)
In Google Sheets:
=1 - NORM.DIST(172, 170, 1.67, TRUE)
Result: 0.115
About 11.5% chance the sample mean exceeds 172 cm
Scenario: Light bulb manufacturer
Solution:
\(\bar{x} \sim N(1000, \frac{100}{\sqrt{25}}) = N(1000, 20)\)
=NORM.DIST(1010, 1000, 20, TRUE) - NORM.DIST(990, 1000, 20, TRUE)
Result: 0.383
38.3% of samples will have mean between 990-1010 hours
| Distribution | Type | When to Use | Parameters |
|---|---|---|---|
| Binomial | Discrete | Count successes in n trials | n, p |
| Normal | Continuous | Symmetric, bell-shaped data | μ, σ |
| Sampling Dist of \(\bar{x}\) | Theoretical | Distribution of sample means | μ, σ/√n |
Remember: CLT tells us the third one is approximately Normal!
This is the foundation for all of inferential statistics!
![]()
STAT 7 – Winter 2026