next up previous
Next: About this document

Spring 1997 John Rust
Economics 551b 37 Hillhouse, Rm. 27

FINAL EXAM

April 30, 1997

INSTRUCTIONS: Answer 3 of the questions below. Each question is worth 100 points, for a total of 300 possible points. You have 3 hours for the exam, or 1 hour per question.

QUESTION 1 (Hypothesis testing) Consider the GMM estimator with IID data, i.e the observations tex2html_wrap_inline119 are independent and identically distributed. Show that in the overidentified case (J >K) that the minimized value of the GMM criterion function is asymptotically tex2html_wrap_inline123 with J-K degrees of freedom:

displaymath127

where tex2html_wrap_inline129 is a tex2html_wrap_inline131 vector of moment conditions, tex2html_wrap_inline133 is a tex2html_wrap_inline135 vector of parameters, tex2html_wrap_inline137 is a Chi-squared random variable with J-K degrees of freedom,

displaymath141

displaymath143

and tex2html_wrap_inline145 is a consistent estimator of tex2html_wrap_inline147 given by

displaymath149

QUESTION 2 (Markov Processes)

A.
(10 points) Are Markov processes of any use in econometrics? Describe some examples of how Markov processes are used in econometrics such as providing models of serially dependent data, as a framework for establishing convergence of estimators and proving laws of large numbers, central limit theorems, etc. and as computational tool for doing simulations.

B.
(10 points) What is a random walk? Is a random walk always a Markov process? If not, provide a counter-example.

C.
(40 points) What is the ergodic or invariant distribution of a Markov process? Do all Markov processes have invariant distributions? If not, provide a counterexample of a Markov process that doesn't have an invariant distribution. Can a Markov process have more than 1 invariant distribution? If so, give an example.

D.
(40 points) Consider the discrete Markov process tex2html_wrap_inline151 with transition probability

eqnarray20

Does this process have an invariant distribution? If so, find all of them.

QUESTION 3 (Consistency of M-estimator) Consider an M-estimator defined by:

displaymath111

Suppose following two conditions are given

(i) (Identification) For all tex2html_wrap_inline153

displaymath112

where tex2html_wrap_inline155 .

(ii) (Uniform Convergence)

displaymath113

Prove consistency of the estimator by showing

displaymath114

QUESTION 4 (Consistency of Bayesian posterior) Consider a Bayesian who has observes IID data tex2html_wrap_inline157 , where tex2html_wrap_inline159 is the likelihood for a single observation, and tex2html_wrap_inline161 is the prior density over an unknown finite-dimensional parameter tex2html_wrap_inline163 .

A.
(30 points) Let tex2html_wrap_inline165 be the posterior probability tex2html_wrap_inline133 is in some set tex2html_wrap_inline169 given the first N observations. Show that this posterior probability satisfies the Law of iterated expectations:

displaymath173

B.
(20 points) A martingale is a stochastic process tex2html_wrap_inline175 that satisfies tex2html_wrap_inline177 , where tex2html_wrap_inline179 denotes the information set at time t and includes knowledge of all past tex2html_wrap_inline183 's up to time t, tex2html_wrap_inline187 . Use the result in part A to show that the process tex2html_wrap_inline175 where tex2html_wrap_inline191 is a martingale. (We are interested in martingales because the Martingale Convergence Theorem can be used to show that if tex2html_wrap_inline133 is finite-dimensional, then the posterior distribution converges with probability 1 to a point mass on the true value of tex2html_wrap_inline133 generating the observations tex2html_wrap_inline197 . But you don't have to know anything about this to answer this question.)

C.
(50 points) Suppose that if tex2html_wrap_inline133 is restricted to the K-dimensional simplex, tex2html_wrap_inline203 with tex2html_wrap_inline205 , tex2html_wrap_inline207 , tex2html_wrap_inline209 , and the distribution of tex2html_wrap_inline211 given tex2html_wrap_inline133 is multinomial with parameter tex2html_wrap_inline133 , i.e.

displaymath217

Suppose the prior distribution over tex2html_wrap_inline133 , tex2html_wrap_inline161 is Dirichlet with parameter tex2html_wrap_inline223 :

displaymath225

where both tex2html_wrap_inline227 and tex2html_wrap_inline229 , tex2html_wrap_inline207 . Compute the posterior distribution and show 1) the posterior is also Dirichlet (i.e. the Dirichlet is a conjugate family), and show directly that as tex2html_wrap_inline233 that the posterior distribution converges to a point mass on the true parameter tex2html_wrap_inline133 generating the data.

QUESTION 5 (Time series question) Suppose tex2html_wrap_inline237 is an ARMA(p,q) process, i.e.

displaymath239

where A(L) is a tex2html_wrap_inline243 order lag-polynomial

displaymath245

and B(L) is a tex2html_wrap_inline249 order lag-polynomial

displaymath251

and the lag-operator tex2html_wrap_inline253 is defined by

displaymath255

and tex2html_wrap_inline257 is a white-noise process, tex2html_wrap_inline259 and (cov( tex2html_wrap_inline261 )=0 if tex2html_wrap_inline265 , tex2html_wrap_inline267 if t=s).

A.
(30 points) Write down the autocovariance and spectral density functions for this process.

B.
(30 points) Show that if p = 0 an autoregression of tex2html_wrap_inline273 on q lags of itself provides a consistent estimate of tex2html_wrap_inline277 . Is the autoregression still consistent if p > 0?

C.
(40 points) Assume that a central limit theorem holds, i.e. the distribution of normalized sums of tex2html_wrap_inline237 to converge in distribution to a normal random variable. Write down an expression for the variance of the limiting normal distribution.

QUESTION 6 (Empirical question) Assume that shoppers always choose only a single brand of canned tuna fish from the available selection of K alternative brands of tuna fish each time they go shopping at a supermarket. Assume initially that the (true) probability that the decision-maker chooses brand k is the same for everybody and is given by tex2html_wrap_inline287 , tex2html_wrap_inline289 . Marketing researchers would like to learn more about these choice probabilities, tex2html_wrap_inline291 and spend a great deal of money sampling shoppers' actual tuna fish choices in order to try to estimate these probabilities. Suppose the Chicken of the Sea Tuna company undertook a survey of N shoppers and for each shopper shopping at a particular supermarket with a fixed set of K brands of tuna fish, recorded the brand tex2html_wrap_inline297 chosen by shopper i, tex2html_wrap_inline301 . Thus, tex2html_wrap_inline303 denotes the observation that consumer 1 chose tuna brand 2, and tex2html_wrap_inline309 denotes the observation that consumer 4 chose tuna brand K, etc.

A.
(10 points) Without doing any estimation, are there any general restrictions that you can place on the tex2html_wrap_inline135 parameter vector tex2html_wrap_inline317 ?

B.
(10 points) Is it reasonable to suppose that tex2html_wrap_inline287 is the same for everyone? Describe several factors that could lead different people to have different probabilities of purchasing different brands of tuna. If you were a consultant to Chicken of the Sea, what additional data would you recommend that they collect in order to better predict the probabilities that consumers buy various brands of tuna? Describe how you would use this data once it was collected.

C.
(20 points) Using the observations tex2html_wrap_inline321 on the observed brand choices of the sample of N shoppers, write down an estimator for tex2html_wrap_inline317 (under the assumption that the ``true'' brand choice probabilities tex2html_wrap_inline317 are the same for everyone). Is your estimator unbiased?

D.
(20 points) What is the maximum likelihood estimator of tex2html_wrap_inline317 ? Is the maximum likelihood estimator unbiased?

E.
(40 points) Suppose Chicken of the Sea Tuna company also collected data on the prices tex2html_wrap_inline331 that the supermarket charged for each of the K different brands of tuna fish. Suppose someone proposed that the probability of buying brand j was a function of the prices of all the various brands of tuna, tex2html_wrap_inline337 , given by:

displaymath339

Describe in general terms how to estimate the parameters tex2html_wrap_inline341 . If tex2html_wrap_inline343 , does an increase in tex2html_wrap_inline345 decrease or increase the probability that a shopper would buy brand j?

QUESTION 7 (Regression question) Let tex2html_wrap_inline349 be IID observations from a regression model

displaymath351

where tex2html_wrap_inline353 , tex2html_wrap_inline355 , and tex2html_wrap_inline357 are all scalars. Suppose that tex2html_wrap_inline357 is normally distributed with tex2html_wrap_inline361 , but tex2html_wrap_inline363 . Consider the following two estimators for tex2html_wrap_inline365 :

displaymath367

displaymath369

A.
(20 points) Are these two estimators consistent estimators of tex2html_wrap_inline365 ? Which estimator is more efficient when: 1) if we know a priori that tex2html_wrap_inline373 , and 2) we don't know tex2html_wrap_inline317 ? Explain your reasoning for full credit.

B.
(20 points) Write down an asymptotically optimal estimator for tex2html_wrap_inline365 if we know the value of tex2html_wrap_inline317 a priori.

C.
(20 points) Write down an asymptotically optimal estimator for tex2html_wrap_inline381 if we don't know the value of tex2html_wrap_inline317 a priori.

D.
(20 points) Describe the feasible GLS estimator for tex2html_wrap_inline381 . Is the feasible GLS estimator asymptotically efficient?

E.
(20 points) How would your answers to parts A to D change if you didn't know the distribution of tex2html_wrap_inline357 was normal?




next up previous
Next: About this document

John Rust
Mon May 5 10:40:50 CDT 1997