UCLA Academic Technology Services HomeServicesClassesContactJobs
Search

SPSS Data Analysis Examples
Probit Regression

Examples

Example 1:  Suppose that we are interested in factors that influence whether or not a political candidate wins an election.  Our outcome variable has only two possible values:  win or not win.  We believe that factors such as the amount of money spent on the campaign, the amount of time spent campaigning negatively and whether the candidate is an incumbent affect whether the candidate wins the election.  Because our outcome variable is binary (either the candidate wins or does not win), we need to use a model that handles this feature correctly. 

Example 2:  Some people have heart attacks and others don't.  We would like to see if exercise, age and gender influences whether or not someone has a heart attack.  Again, we have a binary outcome:  have heart attack or not. 

Example 3:  Many undergraduates wish to continue their education in graduate school.  In their application to any given graduate program, they include their GRE scores and their GPA from their undergraduate institution.  Some students are graduating from very prestigious institutions, while others are graduating from not-so-prestigious institutions.  Many months after sending in their applications, students receive either a thick or a thin envelope from the graduate program to which they applied:  some were admitted and others were not.

Description of the Data

For our data analysis below, we are going to expand on our third example about getting into graduate school.  We have generated hypothetical data, which can be obtained by clicking on probit.sav. You can store this anywhere you like, but our examples will assume it has been stored in c:\data. First, we read the data file into SPSS.
get file = "c:\data\probit.sav".

This hypothetical data set has a 0/1 variable called admit that we will use as our response (i.e., outcome, dependent) variable.  We also have three variables that we will use as predictors:  gre, which is the student's Graduate Record Exam score; gpa, which is the student's grade point average; and topnotch, which is a 0/1 variable where 1 indicates that the undergraduate institution was "top notch" and 0 indicates that it is not. 

descriptives /variables=gre gpa.
frequencies /variables=topnotch.

Some Strategies You Might Try

Using the Probit Model

Before we run our probit model, we will see if any cells (created by the crosstab of our categorical and response variables) are empty or particularly small.  If any are, we may have difficulty running our model. 

crosstabs /tables=admit by topnotch.
 

None of the cells is too small or empty (has no cases), so we will run our model.

PLUM
admit WITH gre topnotch gpa
/LINK = PROBIT
/PRINT = PARAMETER SUMMARY .

 

In the output the first thing we see is a warning that about 48% of cells have zero frequencies. There are empty cells because there are continuous variables (gre and gpa) in our model. However, we checked to be sure there were no empty cells when we did the crosstab of the categorical predictor variable (topnotch) and our outcome variable (admit). Because we did this, and everything looked okay, we will continue with the analysis.

 This shows the number of observations and the coding for the outcome variable, admit. We see that all 400 observations have been used. Fewer observations would have been used if any of our variables had missing values.  By default, SPSS does a listwise deletion of cases with missing values. 

The Model Fitting Information table contains information on a model containing an intercept only (labeled "Intercept Only" in the left column) and the model we asked SPSS to estimate (labeled "Final").  Looking at the column labeled "-2 Log Likelihood" we can compare the values for the intercept only model and the final model, to determine if including gre, topnotch, and gpa in the model improves the fit of the model. To do this, we could subtract the -2 log likelihood for the final model from the same for the intercept only model, but SPSS has done this for us and placed it in the next column over, labeled "Chi-Square" because this value is distributed chi-square with degrees of freedom equal to the number of independent variables in the model (in this case, 3). The likelihood ratio chi-square with a p-value of less than 0.0004 tells us that our model as a whole fits significantly better than the model with no predictors (the "intercept only" model).

Three different pseudo-R-squared are given, there are many different types of pseudo-R-squareds.  It is a pseudo-R-squared because there is no direct equivalent of an R-squared (from OLS regression) in non-linear models.  Because it is an approximation to the R-squared values you see in OLS, each type of pseudo-R-squared will give a different estimate.

 

 In the table we see the coefficients, their standard errors, the Wald tests, df (degrees of freedom), the associated p-values (labeled "Sig."), and the 95% confidence interval of the coefficients.  Both gre and gpa are statistically significant; topnotch is not.  A discussion of the interpretation of the coefficients can be found in the sample write up section below.

Sample Write-up of the Analysis

Below is one way of describing the results.  Please note that the coefficients can be discussed in terms of either Z-scores or probit index.  These are equivalent terms. Note that they can not be interpreted the way OLS coefficients are interpreted.

The Z-score of a person with a zero GRE score and zero GPA at a non-topnotch school is about -2.8.  For each point of increase in GRE score, the Z-score is increased by .0015244; for each point of increase in GPA, the probit index increases by .4.

Similarities and differences between logit and probit models

Neither the logit model nor the probit model are linear, which makes things difficult.  To make the model linear, a transformation is done on the dependent variable.  In logit regression, the transformation is the logit function which is the natural log of the odds.  In probit models, the function used is the inverse of the standard normal cumulative distribution (a.k.a. a z-score).  In reality, this difference isn't too important:  both transformations are equally good at linearizing the model; which one you use is a matter of personal preference.  Both models need to have diagnostics done afterwards to check that the assumptions of the model have not been violated.  Both methods use maximum likelihood, and so require more cases than a similar OLS model.  Unlike logit models, you don't get odds ratios with probit models.  In general, the logit coefficients are larger than the probit coefficients by a factor of 1.7.  However, this rule often does not apply when an independent variable has a high standard error (lots of variability).

Cautions, Flies in the Ointment

See Also


How to cite this page

Report an error on this page

UCLA Researchers are invited to our Statistical Consulting Services
We recommend others to our list of Other Resources for Statistical Computing Help
These pages are Copyrighted (c) by UCLA Academic Technology Services


The content of this web site should not be construed as an endorsement of any particular web site, book, or software product by the University of California