Spring 1998 John Rust
Economics 551b 37 Hillhouse, Rm. 27
PROBLEM SET 0
Applied Parametric and Nonparametric Regression
QUESTION 1 Extract data in file
data1.asc in the
pub/John_Rust/courses/econ551/regression/
directory on gemini.econ.yale.edu (either ftp to
gemini.econ.yale.edu and login as ``anonymous'' and
cd pub/John_Rust/courses/econ551/regression and
get data1.asc or click on the hyperlink in the html
version of this document). This data file
contains n=1500 IID
observations that I generated
on the computer from the nonlinear regression
where is normally distributed with mean zero, and
the independent variable is
a scalar random variable x which is also normally distributed.
The sorted (y,x) observations are graphed in the file
data1.eps, also available by clicking on the hyperlink:
to
http://gemini.econ.yale.edu/jrust/econ551/exams/98/ps0/data1.eps
Compare the OLS (or MLE) estimates of this log-linear
model to those you obtained
in step 1. Can you come up with a theoretical argument that the
probability limits for are the same for the two
different estimation methods? If so, write down a proof, otherwise
provide an argument of why the probability limits are different.
Now what are your maximum likelihood estimates of ? Is the asymptotic covariance between the
and
parameters zero? Why or why not? Can you reject the
hypothesis of homoscedasticity via likelihood ratio or Wald test
at the 5% significance level?
where is the unknown regression function to
be estimated. Estimate f
using your favorite non-parametric regression method (i.e.
kernel regression, series estimator, nearest neighbor, neural
networks, splines, etc.), and plot
the estimate over the [0,1] interval (the range of the
x's). Describe clearly the method you used and your choices
for auxiliary smoothing parameters. Discuss the sensitivity of
your estimates to these smoothing parameters or other aspects
of your estimation procedure. Compare your non-parametric
estimate of f to the results you might get from a simple
OLS with coefficients on a series expansion to f, i.e.
and to a nonlinear regression of the form
using the software you developed to answer parts 1 to 5 above.
Describe how you might use these residuals as data for a nonparametric regression to uncover the form of unknown heteroscedasticity, i.e. to estimate the unknown function