UCLA Academic Technology Services HomeServicesClassesContactJobs

Stata Data Analysis Examples
Zero-Truncated Poisson

Examples of Zero-Truncated Poisson

Example 1. A study of length of hospital stay, in days, as a function of age, kind of health insurance and whether or not the patient died while in the hospital. Length of hospital stay is recorded as a minimum of at least one day.

Example 2. A study of the number of journal articles published by tenured faculty as a function of discipline (fine arts, science, social science, humanities, medical, etc). To get tenure faculty must publish, i.e., there are no tenured faculty with zero publications.

Example 3. A study by the county traffic court on the number of tickets received by teenagers as predicted by school performance, amount of driver training and gender. Only individuals who have received at least one citation are in the traffic court files.

Description of the Data

Let's pursue Example 1 from above.

We have a hypothetical data file, ztp.dta with 1,493 observations. The length of hospital stay variable is stay. The variable age gives the age group from 1 to 9 which will be treated as interval in this example. The variables hmo and died are binary indicator variables for HMO insured patients and patients who died while in the hospital, respectively.

Let's look at the data.

Some Strategies You Might Be Tempted To Try

Before we show how you can analyze this with a zero-truncated Poisson analysis, let's consider some other methods that you might use.

Stata Zero-Truncated Poisson Analysis

The output looks very much like the output from an OLS regression. The output begins the iteration log giving the values of the log likelihoods starting with a model that has no predictors. The last value in the log is the final value of the log likelihood for the full model and is repeated below.

Next comes the header information. On the right-hand side the number of observations used (1493) is given along with the likelihood ratio chi-squared with three degrees of freedom for the full model, followed by the p-value for the chi-square. The model, as a whole, is statistically significant. The header also includes a pseudo-R2 which is very low in this example (0.0129).

Below the header you will find the zero-truncated poisson coefficients for each of the variables along with standard errors, z-scores, p-values and 95% confidence intervals for the coefficients.

Now, just to be on the safe side, let's rerun the ztp command with the robust option in order to obtain robust standard errors for the zero-truncated Poisson coefficients.

Using the robust option has resulted in a fairly large change in the model chi-square, which is now a Wald chi-square, based on log pseudolikelihoods, instead of a likelihood ratio chi-square.

In the main body of the output are the zero-truncated poisson coefficients, robust standard errors, z-scores, p-values and 95% confidence intervals for the coefficients. The variable age was significant without the robust option and is not significant with it. The robust standard errors attempt to adjust for heterogeneity in the model.

Zero-truncated poisson models can also display results as incidence rate ratios using the irr option.

Sample Write-Up of the Analysis

Before we begin the sample write-up we need to get the output into a form more acceptable for publication. In order to go back to the non-exponentiated version of the coefficients we will quietly rerun the ztp command. The estout command (findit estout by Ben Jann of ETH Zurich), will get us close to what we want. With a little bit of manual editing we can produce an acceptable table of the output. The zero-truncated Poisson regression model predicting length of hospital stay from age, hmo membership and death during the hospital stay was statistically significant (chi-squared = 25.65, df = 3, p<.001). The predictors hmo and died were each statically significant. The effect of age was not significant at the .05 level. For these data the expected log count for those enrolled in an hmo was -0.14 that of those not so enrolled. This amounts to a difference of about 1.25 days. Patients who died during the hospital stay had an expected log count difference of -0.20 or about 1.9 days.

Cautions, Flies in the Ointment

  • It is not recommended that zero-truncated poisson models be applied to small samples. What constitutes a small sample does not seem to be clearly defined in the literature.
  • See Also

     

    How to cite this page

    Report an error on this page

    UCLA Researchers are invited to our Statistical Consulting Services
    We recommend others to our list of Other Resources for Statistical Computing Help
    These pages are Copyrighted (c) by UCLA Academic Technology Services


    The content of this web site should not be construed as an endorsement of any particular web site, book, or software product by the University of California.