next up previous
Next: About this document

Econ 551b Econometrics II
Problem Set 5

Prof. John Rust

Due: April 21, 1999 (Wednesday)

QUESTION 1 Derive the maximum likelihood estimator for the model

displaymath123

where tex2html_wrap_inline125 are IID double exponential random variables with mean 0 and scale parameter tex2html_wrap_inline129 :

displaymath131

1.
Derive a formula for K so that f is a valid probability density (i.e. so tex2html_wrap_inline137 ).

2.
Derive the maximum likelihood estimator for tex2html_wrap_inline139 .

Hint: Show that the MLE for tex2html_wrap_inline141 is identical to the Least Absolute Deviations estimator tex2html_wrap_inline143 defined by:

displaymath145

QUESTION 2 Consider the random utility model:

displaymath147

where tex2html_wrap_inline149 is a decision-maker's payoff or utility for selecting alternative d from a set containing D possible alternatives (we assume that the individual only chooses one item). The term tex2html_wrap_inline155 is known as the deterministic or strict utility from alternative d and the error term tex2html_wrap_inline159 is the random component of utility. In empirical applications tex2html_wrap_inline155 is often specified as

displaymath163

where tex2html_wrap_inline165 is a vector of observed covariates and tex2html_wrap_inline141 is a vector of coefficients determining the agent's utility to be estimated. The interpretation is that tex2html_wrap_inline165 represents a vector of characteristics of the decision-maker and alternative d that are observable by the econometrician and tex2html_wrap_inline173 represents characteristics of the agent and alternative d that affect the utility of choosing alternative d which are unobserved by the econometrician. Define the agent's decision rule tex2html_wrap_inline179 by:

displaymath181

i.e. tex2html_wrap_inline183 is the optimal choice for an agent whose unobserved utility components are tex2html_wrap_inline185 . Then the agent's choice probability tex2html_wrap_inline187 is given by:

displaymath189

where tex2html_wrap_inline191 is the vector of observed characteristics of the agent and the D alternatives and tex2html_wrap_inline195 is the conditional density function of the random components of utility given the values of observed components X, and tex2html_wrap_inline199 is the indicator function given by tex2html_wrap_inline201 if tex2html_wrap_inline203 and 0 otherwise. Note that the integral above is actually a multivariate integral over the D components of tex2html_wrap_inline185 , and simply represents the probability that the values of the vector of unobserved utilities tex2html_wrap_inline211 lead the agent to choose alternative d.

Definition: The Social Surplus Function tex2html_wrap_inline215 is given by:

displaymath217

The Social Surplus function is the expected maximized utility of the agent.gif

Problem: Prove the Williams-Daly-Zachary Theorem:

displaymath227

and discuss its relationship to Roy's Identity.

Hint: Interchange the differentiation and expectation operations when computing tex2html_wrap_inline229 :

eqnarray39

and show that

displaymath231

QUESTION 3 Consider the special case of the random utility model when tex2html_wrap_inline185 has a multivariate (Type I) extreme value distribution:

displaymath235

Show that the conditional choice probability tex2html_wrap_inline187 is given by the multinomial logit formula:

displaymath239

Hint 1: Use the Williams-Daly-Zachary Theorem, showing that in the case of the extreme value distribution (8) the Social Surplus function is given by

displaymath241

where tex2html_wrap_inline243 is Euler's constant.
Hint 2: To derive equation (9) show that the extreme value family is max-stable: i.e. if tex2html_wrap_inline245 are IID extreme value random variables, then tex2html_wrap_inline247 also has an extreme value distribution. Also use the fact that the expectation of a single extreme value random variable with location parameter tex2html_wrap_inline249 and scale parameter tex2html_wrap_inline251 is given by:

displaymath253


and the CDF is given by

displaymath255


Hint 3: Let tex2html_wrap_inline245 be INID (independent, non-identically distributed) extreme value random variables with location parameters tex2html_wrap_inline259 and common scale parameter tex2html_wrap_inline251 . Show that this family is max-stable by proving that tex2html_wrap_inline263 is an extreme value random variable with scale parameter tex2html_wrap_inline251 and location parameter

displaymath267

QUESTION 4 Extract data in data3.asc in the

pub/John_Rust/courses/econ551/regression/

directory on gemini.econ.yale.edu (either ftp to gemini.econ.yale.edu and login as ``anonymous'' and cd pub/John_Rust/courses/econ551/regression and get data3.asc or click on the hyperlink in the html version of this document). This data file contains n=3000 IID observations tex2html_wrap_inline271 that I generated from the binary probability model:

equation76

where tex2html_wrap_inline273 is some parametric model of the conditional probability of the binary variable y given x, i.e. tex2html_wrap_inline279 . Two standard models for tex2html_wrap_inline281 are the logit and probit models. In the logit model we have

equation85

and in the probit mode we have

equation88

where tex2html_wrap_inline283 is the standard normal CDF, i.e.

displaymath113

where

displaymath114

More generally, tex2html_wrap_inline281 could take the form

displaymath115

where F is an arbitrary continuous CDF.

1.
Show that versions of the logit and probit models can be derived from an underlying random utility model where a decision maker has utility function of the form:

displaymath116

and takes action y=1 if tex2html_wrap_inline291 and takes action y=0 if tex2html_wrap_inline295 . Derive the implied choice probability tex2html_wrap_inline297 in the case where tex2html_wrap_inline299 is a bivariate normal random vector with tex2html_wrap_inline301 , tex2html_wrap_inline303 and tex2html_wrap_inline305 and tex2html_wrap_inline307 and tex2html_wrap_inline309 . What is the form of tex2html_wrap_inline273 in the general case when tex2html_wrap_inline299 has an unrestricted bivariate normal distribution with mean vector tex2html_wrap_inline315 and covariance matrix tex2html_wrap_inline317 ? If the utility function includes a constant term, i.e. tex2html_wrap_inline319 are the tex2html_wrap_inline321 , tex2html_wrap_inline315 and tex2html_wrap_inline317 parameters all separately identified if we only have access to data on (y,x) pairs?

2.
Derive the form of the choice probability under the same assumptions as part 1 above but when tex2html_wrap_inline299 has a bivariate Type I extreme value distribution using the results you have obtained from QUESTION 2 and 3. By doing this you will have derived the binary logit model from first principles.

3.
Using the artificially generated data in pub/John_Rust/courses/econ551/regression/data3.asc

compute maximum likelihood estimates of the parameters tex2html_wrap_inline331 of the logit and probit specifications given in equations (2) and (3) above, where tex2html_wrap_inline333 is given by:

displaymath117

4.
Is it possible to consistently estimate tex2html_wrap_inline321 by doing nonlinear least squares estimation of the nonlinear regression formulation of the binary probability model

equation101

instead of doing maximum likelihood? If so, provide a proof of the consistency of the NLLS estimator. If not, provide a counterexample showing that the NLLS estimator is inconsistent.

5.
Estimate both the probit and logit specifications by nonlinear least squares as suggested in part (4). How do the parameter estimates and standard errors compare to the maximum likelihood estimates computed in part 3?

6.
Is there any problem of heteroscedasticity in the nonlinear regression formulation of the problem in (4)? If so, derive the form of the heteroscedasticity and, using the estimated ``first stage'' parameters from part 5 above, compute second stage ``feasible generalized least squares'' (FGLS) estimates of tex2html_wrap_inline321 .

7.
Are the FGLS estimates of tex2html_wrap_inline321 consistent and asymptotically normally distributed (assuming the model is correctly specified)? If so, derive the asymptotic distribution of the FGLS estimator, and if not provide a counter example showing that the FGLS estimator is inconsistent or not asymptotically normally distributed. If you conclude that the FGLS estimator is asymptotically normally distributed, is it as efficient as the maximum likelihood estimator of tex2html_wrap_inline321 ? Explain your reasoning for full credit.

8.
Is it possible to determine whether the data in the file data3.asc are generated from a logit or probit model? In answering this question, consider whether you could estimate tex2html_wrap_inline273 nonparametrically via non-parametric regression. Is there any way you could use the nonparametric regression estimate of tex2html_wrap_inline281 to help discriminate between the logit and probit specifications?




next up previous
Next: About this document

econ551
Wed Mar 24 09:54:02 EST 1999