|
|
|
||||
|
|
|||||
Cronbach's alpha can be written as a function of the number of test items AND the average inter-correlation among the items. Below, for conceptual purposes, we show the formula for the standardized Cronbach's alpha:
Here N is equal to the number of items, c-bar is the average inter-item covariance among the items and v-bar equals the average variance.
One can see from this formula that if you increase the number of items, you increase Cronbach's alpha. Additionally, if the average inter-item correlation is low, alpha will be low. As the average inter-item correlation increases, Cronbach's alpha increases as well.
This makes sense intuitively - if the inter-item correlations are high, then there is evidence that the items are measuring the same underlying construct. This is really what is meant when someone says they have "high" or "good" reliability. They are referring to how well their items measure a single unidimensional latent construct.
Thus, if you have multi-dimensional data, Cronbach's alpha will generally be low for all items. In this case, run a factor analysis to see which items load highest on which dimensions, and then take the alpha of each subset of items separately.
I. An example
To compute Cronbach's alpha for all four items - q1, q2, q3, q4 - use the RELIABILITY command:DATA LIST FREE
- Let's work through an example of how to compute Cronbach's alpha using SPSS, and how to check the dimensionality of the data using factor analysis.
- First, let's start with a dataset consisting of four test items - q1, q2, q3, and q4:
/q1 q2 q3 q4.
BEGIN DATA.
2 3 5 5
5 5 4 4
4 5 5 5
4 3 4 4
3 3 5 5
3 3 4 5
3 4 4 4
4 4 5 5
4 5 5 5
4 4 3 3
4 4 5 5
5 5 4 4
4 4 4 4
4 3 5 5
4 4 5 5
3 3 4 5
4 5 4 4
5 5 5 5
5 5 4 4
4 4 4 4
4 4 4 4
4 4 4 4
3 4 5 5
5 3 5 5
5 5 3 3
3 3 4 4
4 4 4 4
3 3 5 5
4 4 3 3
4 4 5 5
4 4 5 5
4 5 5 5
4 4 5 5
4 5 5 5
4 4 5 5
3 3 4 4
4 3 5 4
3 4 5 5
4 4 5 4
3 4 4 4
4 5 5 5
5 5 5 5
4 4 5 5
4 4 4 4
4 4 5 5
3 4 4 4
5 5 5 5
4 5 4 4
3 4 4 4
5 3 4 4
5 3 4 4
4 5 4 4
2 5 5 5
3 4 5 5
4 3 5 5
4 4 4 4
4 4 5 5
3 4 4 4
4 4 5 4
4 4 5 5
END DATA.
RELIABILITY
/VARIABLES=q1 q2 q3 q4.
Here is the resulting output from the above syntax:
R E L I A B I L I T Y A N A L Y S I S - S C A L E (A L L)Reliability Coefficients
N of Cases = 60.0 N of Items = 4
Alpha = .3924
Here, the reliability is shown to be low using all four items because alpha is .3924. (Note that a reliability coefficient of .70 or higher is considered "acceptable" in most Social Science research situations). Perhaps the data are multidimensional? To check the dimensionality of the data, use the factor command:
FACTORHere is the resulting output from the above syntax:
/VARIABLES q1 q2 q3 q4
/FORMAT SORT BLANK(.35).
| Initial Eigenvalues | Extraction Sums of Squared Loadings | Rotation Sums of Squared Loadings | |||||||
|---|---|---|---|---|---|---|---|---|---|
| Component | Total | % of Variance | Cumulative % | Total | % of Variance | Cumulative % | Total | % of Variance | Cumulative % |
| 1 | 1.926 | 48.139 | 48.139 | 1.926 | 48.139 | 48.139 | 1.890 | 47.247 | 47.247 |
| 2 | 1.306 | 32.642 | 80.781 | 1.306 | 32.642 | 80.781 | 1.341 | 33.534 | 80.781 |
| 3 | .654 | 16.349 | 97.130 | ||||||
| 4 | .115 | 2.870 | 100.000 | ||||||
| Extraction Method: Principal Component Analysis. | |||||||||
| Factor | ||
|---|---|---|
| 1 | 2 | |
| Q3 | .968 | |
| Q4 | .967 | |
| Q2 | .827 | |
| Q1 | .807 | |
| Extraction
Method: Principal Component Analysis.
Rotation Method: Varimax with Kaiser Normalization. |
||
| a Rotation converged in 3 iterations. | ||
Notice that the data are not unidimensional. That is, q3 and q4 do not seemingly measure the same latent construct as q1 and q2.
Now, let's estimate the reliability of these two "subsets" of items separately:
RELIABILITYHere is the resulting output from the above syntax:
/VARIABLES=q1 q2 q3 q4
/SCALE(Q1_Q2)=q1 q2
/SCALE(Q3_Q4)=q3 q4.
R E L I A B I L I T Y A N A L Y S I S - S C A L E (Q 1 _ Q 2)Here we see that the reliability for items q3 and q4 is very high, while the reliability for q1 and q2 is lower. Both estimates of reliability, however, are higher than when using all four items for measuring the same construct. This result also implies that the correlation between items q3 and q4 is higher than the correlation between q1 and q2. To check that this is indeed true, use the correlations command:Reliability Coefficients
N of Cases = 60.0 N of Items = 2
Alpha = .5045
R E L I A B I L I T Y A N A L Y S I S - S C A L E (Q 3 _ Q 4)
Reliability Coefficients
N of Cases = 60.0 N of Items = 2
Alpha = .9368
CORRELATIONS VARIABLES=q1 q2 q3 q4.
| Q1 | Q2 | Q3 | Q4 | ||
|---|---|---|---|---|---|
| Q1 | Pearson Correlation | 1.000 | .337 | -.084 | -.168 |
| Sig. (2-tailed) | . | .008 | .525 | .200 | |
| N | 60 | 60 | 60 | 60 | |
| Q2 | Pearson Correlation | .337 | 1.000 | .001 | -.037 |
| Sig. (2-tailed) | .008 | . | .992 | .777 | |
| N | 60 | 60 | 60 | 60 | |
| Q3 | Pearson Correlation | -.084 | .001 | 1.000 | .881 |
| Sig. (2-tailed) | .525 | .992 | . | .000 | |
| N | 60 | 60 | 60 | 60 | |
| Q4 | Pearson Correlation | -.168 | -.037 | .881 | 1.000 |
| Sig. (2-tailed) | .200 | .777 | .000 | . | |
| N | 60 | 60 | 60 | 60 | |
Thus, since the data are not unidimensional, all four items SHOULD NOT be combined to create one single scale. Instead, q1 and q2 should be combined to create one scale, and q3 and q4 should be combined to create a second scale.
For more information...
UCLA Researchers are invited to our Statistical Consulting Services
We recommend others to our list of Other Resources for Statistical Computing Help
These pages are Copyrighted (c) by UCLA Academic Technology Services