Lecture 24
Duke University
STA 199 - Fall 2025
November 25, 2025
Given that HW 6 will be assigned on Monday, Dec 1, what day should HW 5 be due? It will be due at 11:59 pm on the deadline we decide on.

Scan the QR code or go to app.wooclap.com/sta199. Log in with your Duke NetID.
Find range of plausible values for the slope using bootstrap confidence intervals.
openintro::duke_forest

Goal: Use the area (in square feet) to understand variability in the price of houses in Duke Forest.
Which of the following is the correct interpretation of the intercept?
| term | estimate | std.error | statistic | p.value |
|---|---|---|---|---|
| (Intercept) | 116652 | 53302 | 2 | 0 |
| area | 159 | 18 | 9 | 0 |

Scan the QR code or go to app.wooclap.com/sta199. Log in with your Duke NetID.
Which of the following is the correct interpretation of the slope?
| term | estimate | std.error | statistic | p.value |
|---|---|---|---|---|
| (Intercept) | 116652 | 53302 | 2 | 0 |
| area | 159 | 18 | 9 | 0 |

Scan the QR code or go to app.wooclap.com/sta199. Log in with your Duke NetID.
For each additional square foot, we expect the sale price of Duke Forest houses to be higher by $159, on average.
Statistical inference provide methods and tools so we can use the single observed sample to make valid statements (inferences) about the population it comes from
For our inferences to be valid, the sample should be random and representative of the population we’re interested in
Calculate a confidence interval for the slope, \(\beta_1\) (today)
Conduct a hypothesis test for the slope,\(\beta_1\) (next week)
A confidence interval will allow us to make a statement like “For each additional square foot, the model predicts the sale price of Duke Forest houses to be higher, on average, by $159, plus or minus X dollars.”
Should X be $10? $100? $1000?
If we were to take another sample of 98 would we expect the slope calculated based on that sample to be exactly $159? Off by $10? $100? $1000?
The answer depends on how variable (from one sample to another sample) the sample statistic (the slope) is
We need a way to quantify the variability of the sample statistic
for estimation










so on and so forth…




Look familiar?


Fill in the blank: For each additional square foot, the model predicts the sale price of Duke Forest houses to be higher, on average, by $159, plus or minus ___ dollars.
Fill in the blank: For each additional square foot, we expect the sale price of Duke Forest houses to be higher, on average, by $159, plus or minus ___ dollars.
How confident are you that the true slope is between $0 and $250? How about $150 and $170? How about $90 and $210?
Go to your ae project in RStudio.
If you haven’t yet done so, make sure all of your changes up to this point are committed and pushed, i.e., there’s nothing left in your Git pane.
If you haven’t yet done so, click Pull to get today’s application exercise file: ae-15-duke-forest-bootstrap.qmd.
Work through the application exercise in class, and render, commit, and push your edits.