📈 Statistics Chapter 1 · deep practice

1.1 A – Identify population & sample (Pew Research)

Context: 1500 U.S. adults were asked: “do you think there is solid evidence for global warming?” 855 said yes.

(a) Identify the population. (b) Identify the sample. (c) Is the value “855” a parameter or statistic? Explain.

→ Population: The collection of all U.S. adults (their responses regarding global warming). [ref: p.8-9] → Sample: The 1500 U.S. adults who actually responded. [ref: p.9] → 855 yes’s: This is a statistic because it describes the sample (855/1500 = 57%). If it were the proportion of all U.S. adults, it would be a parameter. [ref: p.10-11]

1.1 B – Parameter or statistic? (petroleum engineers / Northwestern)

1) A survey of a sample of college career centers reported average starting salary for petroleum engineering majors = $83,121.
2) The 2182 students who accepted admission to Northwestern in 2009 have an average SAT score of 1442.

#1 salary $83,121: based on a sample → sample statistic. [p.11] #2 SAT 1442: based on all 2009 admitted students → describes the population (that specific cohort) → population parameter. [p.12] 💡 Even if the population is “all 2009 admits,” it’s still a parameter because it uses every individual of interest.

1.1 C – descriptive & inferential (longevity study)

A sample of men aged 48 was followed 18 years: 70% unmarried men alive at 65, 90% married men alive at 65. (Source: Journal of Family Issues)

a) Which part is descriptive? b) Give a possible inference.

Descriptive: the reported percentages “70% of unmarried in sample alive at 65” and “90% of married” — they summarize the observed data. [p.15] Inferential: “Being married is associated with longer life for men” (a conclusion that extends beyond the sample to the population). Also possible: “marriage may have a protective effect.” [p.15]

1.1 D – classify each value

(i) In a survey of 500 students, 62% prefer online learning. (ii) The registrar reports that exactly 12,840 students are enrolled at your university (all records).

(i) 62% based on 500 → statistic (sample). (ii) 12,840 is from entire enrollment → parameter (population).

1.2 A – Ford vehicles: classify data types

Model	Suggested retail price
Focus Sedan	$15,995
Fusion	$19,270
Mustang	$20,995
Edge	$26,920

Which are qualitative / quantitative?

Model names: categories (Focus, Fusion … ) → qualitative (non‑numerical entries). [p.21-22] Prices: numerical measurements → quantitative (discrete/ratio).

1.2 B – Zip code, height, telephone number

Classify: (a) Zip code (b) Height in cm (c) Telephone area code

(a) Zip code: usually qualitative (label, even if digits). (b) Height: quantitative. (c) Area code: qualitative (categorical).

1.3 A – identify collection method (flight patterns, oatmeal, puzzle, approval)

1) Effect of changing flight patterns on accident numbers.
2) Effect of oatmeal on lowering blood pressure.
3) How 4th graders solve a puzzle.
4) U.S. residents’ approval of the president.

1) Flight/accidents: Simulation (impractical/dangerous to create real changes). [p.31] 2) Oatmeal & BP: Experiment – apply treatment (eating oatmeal) vs control. [p.32] 3) Puzzle solving: Observational study – watch & measure without intervention. [p.33] 4) Approval rating: Survey – ask individuals. [p.34]

1.3 B – sampling: majors vs random numbers

You study opinion about stem cell research on campus.
(A) Divide by major, randomly pick some from each major.
(B) Assign each student a number, generate random numbers, question those selected.

(A) groups = majors → random from each group → stratified sampling. [p.42] (B) every student equally likely, random numbers → simple random sample. [p.43]

1.3 C – more sampling names

i) Choose every 50th household from a list.
ii) Divide city into city blocks, randomly pick 5 blocks and survey all households in those blocks.
iii) Stand outside library and ask people.

i) every kth → systematic. [p.41] ii) blocks = clusters, all members in selected clusters → cluster sample. [p.40] iii) library intercept → convenience (non‑probability).

1.3 D – classify study design

a) A medical researcher gives a new drug to some patients and a placebo to others, then measures cholesterol.
b) A sociologist examines public records to determine the correlation between crime rate and unemployment.

a) treatment assigned → experiment. b) no manipulation, just observes existing data → observational study.

1.3 E – Stratified or cluster?

A school has 4 grades (9‑12). To estimate average screen time, you randomly select 15 students from each grade.

Grades are natural strata (homogeneous within? maybe). You sample from every stratum → stratified. If you instead selected two entire grades (all students) it would be cluster.

1.3 F – identify from study

A researcher wants to know the average height of women aged 30 in Ohio. She measures 250 women and finds a mean of 164 cm. Is 164 cm a parameter or statistic?

164 cm describes the sample of 250 women, so it’s a statistic. If she measured every woman in Ohio aged 30, it would be a parameter.

1.3 G – design a experiment (control group)

You want to test whether a new teaching method improves exam scores. Describe a basic experimental design.

Take a random sample of students, randomly assign them to two groups: treatment (new method) and control (traditional). Compare final exam means. Control for teacher effect, time of day, etc. Ideally double‑blind.

1.3 H – simulation scenario

Why would a car manufacturer use crash test dummies and computer models instead of real humans?

Ethical and practical reasons: simulation reproduces conditions without risk. Also cheaper and repeatable.

1.3 I – identify possible errors

In a telephone survey about internet usage, only landline numbers are called. What bias might occur?

Undercoverage: households with only cell phones (often younger, lower income) are excluded. So sample not representative → nonresponse/coverage bias.

1.3 J – simple random sample procedure

731 stats students, you need a sample of 8. Using random digits, you start at row 3 col 2: 719, 662, 650, 004, 053, 589, 403, 129. Explain why 004 is included and why numbers > 731 are ignored.

Assign numbers 001–731. Read three‑digit groups. Ignore 000 and 732–999 because they don’t correspond to any student. 004 represents student #4. This gives each student an equal chance (simple random sample). [p.38]

Mixed – classify each statement

1) “52% of the sampled likely voters approve.” → parameter/statistic?
2) “The median household income in the US (from census) is $70,784.” → parameter/statistic?
3) Descriptive or inferential: “The correlation in the sample was -0.4, suggesting a negative relationship in the population.”

1) statistic (sample). 2) parameter (full census). 3) “correlation -0.4” = descriptive; “suggesting in population” = inferential part.

🧪 STATISTICS (Chapter 1 )

🧪 STATISTICS
(Chapter 1 )