Previous PageTable Of Contents

Annex I. Description of Gibbs-sampling data-augmentation algorism used in the estimation of the models


Estimation of the probit model
Estimation of the Tobit model

Estimation of the probit–Tobit model


Estimation of the probit model

More formally, with the usual, non-informative prior over the unknown quantities,  p(z, a) µ constant, the full conditional distributions of the two vector blocks z and a are easily established. Respectively, the conditional distribution of z given a is normal with mean the prediction vector Ez º xa and covariance the N × N identity matrix Vz º IN. The conditional distribution of a given z is also normal, but with mean the k-vector Ea º (x¢ x)–1x¢ z and covariance Va º   (x¢ x)–1. Accordingly, a Gibbs-sampling, data-augmentation algorism can be constructed to simulate draws from the joint posterior distribution p( z, a | y ). There are five sequential steps:

Step 1: Select starting values a(s).

Step 2: Draw z(s) from the multivariate normal distribution N(xa(s), IN), where a(s) implies conditioning on a(s) from Step 1 and the draws are made such that zi £ 0 when y = 1 is observed, and zi < 0 otherwise.

Step 3: Draw a(s + 1) from the multivariate normal distribution 
N((x¢ x)–1x¢z(s), (x¢x)–1), where z(s) denotes conditioning on z(s) from step 2                                           (4)

Step 4: Repeat steps 1–3 many times, S1, until convergence is attained.

Step 5: Repeat steps 1–3 many times, S2, and collect samples {z(s) s = 1, 2, ..., S} and {a(s) s = 1, 2, ..., S}.

The draws in the last step can be used to compute means, standard errors or, indeed, to plot histograms of any characteristic of interest in the posterior (Gelman et al. 1995). In the reports of the empirical results that follow, the algorism is run for a burn-in phase of S1 = 5000 observations followed by a collection phase of S2 = 5000 and means and implied standard errors are used to infer confidence intervals. The procedures are implemented on a DELLTM LATITUDETM laptop machine running a PentiumTM III processor at 600 megahertz with commands executed in MATLABTM version 5.1.0.421. All computer codes are available upon request.

Estimation of the Tobit model

In comparison with the algorism in (4) above, Tobit estimation is executed through the following steps:

Step 1: Select starting values a(s) and z(s).

Step 2: Draw s(s), a scalar, from the inverse-gamma distribution ig(s2, v) (Zellner 1971, pp. 371–373), with s2 º (z(s) – x a(s) )¢(z(s) – x a(s))/v, v  º N – k and where z(s) and a(s) denote conditioning on z(s) and a(s)  from Step 1.

Step 3: Construct z(s) by combining the observed, positive sales quantities 
with a draw  for the latent quantities zc from the normal distribution 
N(xca(s), s(s)IN), where a(s) and s(s) denote conditioning on a(s)  
from Step 1 and s(s) from step 2 and the draws z(s) are made 
such that zi < 0 when y = 0 is observed                                                                                       (5)

Step 4: Draw a(s + 1) from the multivariate normal distribution N((x¢ x)–1x¢ z(s), s(s) (x¢ x)–1), where z(s) denotes conditioning on z(s) from step 3 and s(s) implies conditioning on s(s) from step 2.

Step 5: Repeat steps 1–4 many times, S1, until convergence is attained.

Step 6: Repeat steps 1–4 many times, S2, and collect samples {z(s) s = 1, 2, ..., S} and {a(s) s = 1, 2, ..., S}.

Estimation of the probit–Tobit model

Step 1: Select starting values a(s) and z(s).

Step 2: Draw S(s), (a 2 x 2 square, symmetric, positive definite matrix), from the inverse-gamma distribution iW(W, v) (Zellner 1971, pp. 371373), with W º (z(s) – x a(s))¢(z(s) – x a(s))/v, v º N + m + k + 1 and where z(s) and a(s) denote conditioning on z(s) and a(s) from Step 1.

Step 3: Construct the latent components z(s) by combining the observed, 
positive sales quantities with draws for the censored observations 
and draws for the latent data in the probit equation, as in steps 2 and 3 
in the algorisms (4) and (5) above, respectively                                                                            (6)

Step 4: Draw a(s + 1) from the multivariate normal distribution N((x¢ x)–1x¢ z(s), s(s) (x¢x)–1), where z(s) denotes conditioning on z(s) from step 3 and s(s) implies conditioning on s(s) from step 2.

Step 5: Repeat steps 1–4 many times, S1, until convergence is attained.

Step 6: Repeat steps 1–4 many times, S2, and collect samples {z(s) s = 1, 2, ..., S} and {a(s) s = 1, 2, ..., S}.

Previous PageTop Of Page