Up Learn – A Level maths (edexcel) – Hypothesis Testing
Hypothesis Testing: Summary
Here’s a summary of everything you need to know about hypothesis testing for a population proportion at A Level.
A*/A guaranteed or your money back
More informationWant to see the whole course?
No payment info required!
More videos on Hypothesis Testing:
Introduction to Hypothesis Testing (free trial)
Statistics vs Parameters (free trial)
The World is Unknowable (free trial)
Don’t Treat Parameter Estimates as Final (free trial)
Population Proportions (free trial)
When Should We Question Our Parameter Estimates? (free trial)
Critical Regions With Two Tails (free trial)
One or Two Tails? (free trial)
Hypothesis Tests
2. Statistics vs Parameters (free trial)
3. The World is Unknowable (free trial)
4. Don’t Treat Parameter Estimates as Final (free trial)
5. Population Proportions (free trial)
6. When Should We Question Our Parameter Estimates? (free trial)
7. Critical Regions (free trial)
8. Critical Regions With Two Tails (free trial)
9. One or Two Tails? (free trial)
10. Critical Values (free trial)
11. Acceptance Region (free trial)
12. Type 2 Errors (free trial)
13. Type 1 Errors (free trial)
14. Remembering Which is Which (free trial)
15. Significance (free trial)
16. Significance Levels (free trial)
17. Significance Levels with Two Tails (free trial)
18. Actual Significance Levels (free trial)
19. Common Significance Levels (free trial)
20. Two Ways of Setting a Significance Level (free trial)
21. Finding Critical Regions (free trial)
22. The Term ‘Hypothesis Testing’ (free trial)
23. Population Parameters and Test Statistics (free trial)
24. What are Hypotheses? (free trial)
25. Null and Alternative Hypotheses (free trial)
26. Statistical Hypothesis Testing (free trial)
27. One-Tailed Tests and Two-Tailed Tests (free trial)
28. Representing the Null and Alternative Hypotheses (free trial)
29. Performing a One-Tailed Test (free trial)
30. A Second Way of Performing a One-Tailed Test (free trial)
31. Performing a Two-Tailed Test (free trial)
32. The Miraculous Dead Salmon (free trial)
2. Sample Means and Population Means (free trial)
3. How Sample Size Affects Sample Means (free trial)
4. Probabilities of Sample Means (free trial)
5. Modelling the Sample Mean (free trial)
6. The Standard Deviation of the Sample Mean’s Distribution (free trial)
7. When Should We Question Our Population Mean? (free trial)
8. Finding Critical Regions and Critical Values (free trial)
9. What Are the Hypotheses? (free trial)
10. Performing a One-Tailed Test for a Population Mean (free trial)
11. Performing a Two-Tailed Test for a Population Mean (free trial)
12. Testing a Population Mean with the Standard Normal Distribution (free trial)
13. Coding the Sample Means (free trial)
14. Hypothesis Testing With a Coded Sample Mean (free trial)
2. Correlation Coefficients from Samples and Populations (free trial)
3. Probabilities of Sample Correlation Coefficients (free trial)
4. Sample Size Matters (free trial)
5. Testing for Zero Correlation (free trial)
6. The Null and Alternative Hypotheses in PMCC Testing (free trial)
7. The Percentage Points Table (free trial)
8. Performing a One-Tailed Test for a Population PMCC (free trial)
9. Performing a Two-Tailed Test for a Population PMCC (free trial)
10. Where Were All the Distributions? (free trial)
11. Are Older People All Liars? (free trial)
Here’s a reminder of the key points you should know about hypothesis testing.
‘Statistics’ describe or summarise data from samples…
Whereas ‘parameters’ describe or summarise data from populations.
In the real world, we normally need to use statistics to estimate the values of parameters.
But because they’re estimates, it’s important that we never ‘accept’ those parameter values.
Instead, good statisticians keep testing their estimates, to make sure they’re good estimates.
For example, suppose we estimate that 22% of people in the UK have brown eyes.
To test that estimate, we could take a new sample of 150 people and look at their eye colour.
It’s theoretically possible we’d see anywhere between 0 and 150 people with brown eyes.
But the probability of some of those outcomes is so low that, if we observe them, we should assume that our parameter estimate is incorrect.
The question is: at what point do these outcomes, and therefore our parameter estimate, become too unlikely to believe?
Well, it varies depending on the context and the statistician.
But wherever they draw the line, statisticians call this region the ‘critical region’.
Any outcome in the critical region is so unlikely that, if that’s what we observe, we should reject the parameter estimate we started with.
There are some very unlikely outcomes on the left of the distribution, too.
So we can also create critical regions with two ‘tails’.
…where the probability of getting an outcome in one tail is always about equal to the probability of getting an outcome from the other tail.
Whether we create one or two tails depends on the context.
If we suspect the true parameter value might be lower than our estimate, we use a critical region like this.
If we suspect the true parameter value might be higher than our estimate, we use a critical region like this.
And if we suspect the real parameter value might be different to their estimate, but we’re not sure in what way, we use a critical region like this.
The first value inside a critical region is called a ‘critical value’.
And critical values allow us to describe critical regions with inequalities.
This region is called the ‘acceptance region’, even though we never actually ‘accept’ parameter estimates as the correct value.
Next, since we can’t know the true parameter value, we often make errors without realising.
If we fail to reject an estimate that’s secretly incorrect, that’s called a ‘type 2’ error.
If we do reject an estimate that’s secretly correct, that’s called a ‘type 1’ error.
And the probability of making a type 1 error is equal to the probability represented by the critical region.
Here’s a mnemonic which may help you remember which is which.
Making a type 1 error involves rejecting the parameter estimate.
Making a type 2 error involves accepting the parameter estimate.
‘Reject’ has 1 consonant in the middle.
‘Accept’ has 2.
Any result we observe in the critical region is called a ‘significant’ result.
The significance level of an experiment determines how big our critical region is and is equal to the probability of making a type 1 error.
When we’re working with two tails, we need to split the probability given by the significance level evenly between the two tails.
And actually, there’s a difference between the stated significance level, and the exact probability of finding an outcome in the critical region, since those rarely line up.
The stated version is just called the ‘significance level’, whilst the actual probability is usually referred to as the ‘actual significance level’.
A statistician can choose any significance level they like.
But 1%, 5% and 10% are especially common.
There are two slightly different ways of creating the critical region.
We could ensure that our actual significance level is no greater than the stated one.
Or, we could ensure that our actual significance level is as close as possible to the stated significance level – even if that means going over a bit.
We find critical regions by using the binomial cumulative distribution function on our calculators. And we speed up the process if we create a table of values.
Now, the process we’ve just seen…
That is…
Taking parameter estimates…
Taking a sample…
Seeing how the sample statistic compares with the parameter estimate…
And depending on how significant the statistic is, potentially rejecting the parameter estimate…
That’s collectively called ‘hypothesis testing’.
Although, there’s a bit more technical language to bring in.
First, another term for ‘parameter’ is ‘population parameter’.
And the statistic we get from our new sample is usually referred to as a ‘test statistic’.
Second, whenever we have a statement that we can test in some way, it’s called a hypothesis.
In our example, we’re testing the hypothesis: “22% of people in the UK have brown eyes.”
Although actually, when we’re hypothesis testing in the conventional way, we need two hypotheses: the null hypothesis, and the alternative hypothesis.
The null hypothesis states what we currently believe to be true.
And the alternative hypothesis states some alternative.
Third, any hypothesis that can be tested by using a probability distribution can be referred to as a ‘statistical hypothesis’.
So you’ll sometimes see hypothesis testing referred to as statistical hypothesis testing instead.
Fourth, hypothesis tests in which the critical region has one tail are called one-tailed tests.
And hypothesis tests in which the critical region has two tails are called two-tailed tests.
Fifth, we represent the null and alternative hypothesis using mathematical notation.
What goes on the right of the colon depends on whether we’re running a one-tailed test…
Or a two-tailed test.
Finally then, the exam can’t ask you to conduct an entire experiment yourself.
So instead, they may tell you a sample was taken, tell you the significance level, and ask you to perform the rest of the hypothesis test.
Here’s what that looks like for a one-tailed test.
First, write down the null and alternative hypotheses.
Second, write down the distribution we’re using.
Third, find and state the critical region.
Fourth, compare the test statistic to the critical region.
And fifth, write a conclusion.
An alternative method is to find the cumulative probability associated with the test statistic instead.
Then, compare that to the significance level and conclude.
In the exam you can generally do it either way.
That is, unless they specifically ask you to do it one way.
The working looks very similar for a two-tailed test.
First, write down the null and alternative hypotheses.
Second, write down the distribution we’re using.
Third, find and state the critical region, now with two tails.
Fourth, compare the test statistic to the critical region.
And fifth, write a conclusion.
Alternatively, find the cumulative probability associated with the test statistic instead.
69 out of 80 is 86.25%, so if it falls in either tail it would be the one on the left.
So, calculate this probability
Then, compare that to half the significance level, since this is a two-tailed test, and conclude.
WHAT YOU GET
Every course includes
Interactive Video Lessons
Video content that keeps you engaged and regular activities that keep you from losing focus
Detailed Quizzes
Technical, Memorisation and Mastery quizzes gradually build up your knowledge and understanding
Exclusive Practice Papers
Written by real examiners exclusively for Up Learn in order to give you additional confidence when preparing for exams
Progress Tracker to A*
Bespoke assessment and practice questions to chart your grade gains as you progress through your A Level Maths revision