Wage Differentials Among Occupations: Why we need ANOVA (Part II)
- Due May 17, 2024 by 11:59pm
- Points 0
- Available until May 17, 2024 at 11:59pm
See the note about this table from Part I:
Business & Financial Occupations (sample)
|
|
Community & Social Service Occupations (sample) |
|
Healthcare Support Occupations (sample) |
|
---|---|---|---|---|---|
13-1071 |
29.77 |
21-1012 |
27.42 |
31-1131 |
14.26 |
13-1111 |
40.99 |
21-1015 |
17.38 |
31-9091 |
19.27 |
13-1198 |
35.37 |
21-1018 |
22.23 |
31-9092 |
16.73 |
13-2011 |
34.40 |
21-1021 |
22.78 |
31-9011 |
20.59 |
|
|
21-1093 |
16.85 |
31-2021 |
28.26 |
(Reminder: these are not truly random samples, but we will ignore that for this exercise)
We will use a method called ANalysis Of VAriance, or ANOVA. This type of hypothesis tests uses the F distribution (which you may have seen before in conducting a hypothesis test on two variances or standard deviations). But the F sample statistic is calculated differently for ANOVA.
The data values within each group vary. The data values between the groups also vary. In ANOVA, . If F is "large" then we can conclude that the mean values for the groups really differ. The null hypothesis in an ANOVA test is always that "all the population means are equal." The alternative hypothesis is that "at least two population means are different." (ANOVA does not tell us which ones.) It is always a right-tail test.
The OpenStax e-text (Ch.13) explains the theory and the computations in more detail. Most calculators and apps with statistical functions will automate the calculations. The important thing is how to interpret the results.
- Conduct an ANOVA hypothesis test to investigate whether different occupational categories have different average hourly wages (among the detailed occupations). Remember to include all steps and use a 1% level of significance.
NOTE: The format of your output might vary, but it should include the following:
F = 13.3398 p = .0011 (What does p tell you?)
Factor (relating to the numerator of the F statistic)
df = 2 ( k - 1 where k is the number of groups being compared)
SS = 611.02 (technical calculation - see text)
MS = 305.51 (Variance between groups - "mean square between")
Error (not a mistake! relating to the denominator of the F statistic)
df = 11 (N - k where N is the total sample size in all categories and k is the number of categories)
SS = 251.92 (technical calculation - see text)
MS = 22.90 (Variance within groups - "mean square within")
Sxp = 4.786 (pooled standard deviation)
- Since we have found that at least two means are different, we probably want to know which ones. One method is to create side-by-side box plots for the groups. Use technology to do this. What do you find? Look back at the table and see why this is reasonable.
- The table below gives the actual jobs instead of the numerical codes. Why do you think our society rewards some categories of jobs more than others? What question(s) do the data raise for you? Post a comment in the Wage Differentials Discussion.
Business & Financial Occupations (sample)
|
|
Community & Social Service Occupations (sample) |
|
Healthcare Support Occupations (sample) |
|
---|---|---|---|---|---|
HR specialists |
29.77 |
Educational, guidance and career counselors |
27.42 |
Nursing assistants |
14.26 |
Management analysts |
40.99 |
Rehabilitation counselors |
17.38 |
Dental assistants |
19.27 |
Project management |
35.37 |
Substance abuse, behavioral disorder and mental health counselors |
22.23 |
Medical assistants |
16.73 |
Accountants and auditors |
34.40 |
Child, family and school social workers |
22.78 |
Massage therapists |
20.59 |
|
Social and human service assistants |
16.85 |
Physical therapy assistants |
28.26 |