3. You are given a data file CollegeDistance that contains data from a random sample of high schools seniors interviewed
Posted: Sun May 29, 2022 6:45 pm
question is about econometrics, by using STATA program.
I cannot upload the .dta file to solve this question, so i'd
like to know approximate way to solve this.
3. You are given a data file CollegeDistance that contains data from a random sample of high schools seniors interviewed in 1980 and re-interviewed in 1986. In this exercise, you will use the data to investigate the relationship between the year of schooling and the distance from each student's high school to the nearest four-year college. (Proximity to college lowers the cost of education.) A detailed description of the data can be find in the pdf file. (a) Run a regression of years of schooling (ED) on distance to the nearest college (Dist), where Dist measures in tens of miles. (For example, Dist=2 means that the distance is 20 miles.) What is the estimated intercept? What is the estimated slope? (b) Bob's high school was 20 miles from the nearest college. Predict Bob's year of schooling using the estimated regression in (a). How would the predict change if Bob lived 10 miles from the nearest college? (c) Does the distance to college explain a large fraction of the variance in the year of schooling? Which measurement are you looking at? (d) An education advocacy group argues that on average, a person's year of schooling will increase by 0.075 year if the distance to the nearest college is decreased by 10 miles. Is the advocacy group's claim consistent with the regression result in (a)? (e) Other factors also affect how much college a person completes. Does controlling for some other factors change the estimated effect of the distance on year of schooling? To answer this question, construct a table like that in the end of lecture slides #2. Include a simple linear regression, a base specification [a multiple linear regression that includes a set of control variables, for example, you can include all the available variables except for "urban" and "tuition" ], and a modification of the base specification [for example, you can include all variables in the data]. Discuss how the estimated effect of Dist on ED changes across the specifications. [Note that you can use commands in our tutorial code file to make a table where each column represents a regression specification. See the part "tables of regressions..." in our tutorial code file for details.] (f) It has been argued that, controlling for other factors, African Americans and His- panics complete more education than whites. Is this consistent with the regression you did in part (e)? [Let's use the base specification in part (e)] (g) Using the base specification in part (e), test whether the coefficients on father's education ("dadcoll") and mother's education(" momcoll") are equal or not.
Documentation for College Distance Data These data are taken from the High School and Beyond survey conducted by the Department of Education in 1980, with a follow-up in 1986. The survey included students from approximately 1100 high schools. The data used here were supplied by Professor Cecilia Rouse of Princeton University and were used in her paper "Democratization or Diversion? The Effect of Community Colleges on Educational Attainment," Journal of Business and Economic Statistics, April 1995, Vol. 12, No. 2, pp 217-224. The data in College Distance exclude students in the western states. The data in College DistanceWest includes only those students in the western states. Series in Data Set Name Desrciption yrsed Years of Education Completed (See below) female 1 = Female/0 = Male 1 = Black/0 = Not-Black black Hispanic bytest 1 Hispanic/0 = Not-Hispanic Base Year Composite Test Score. (These are achievement tests given to high school seniors in the sample) dadcoll 1 = Father is a College Graduate/ 0 = Father is not a College Graduate momcoll 1 = Mother is a College Graduate/ 0 = Mother is not a College Graduate incomehi 1 = Family Income > $25,000 per year/ 0 = Income ≤ $25,000 per year. |ownhome | 1= Family Owns Home / 0 = Family Does not Own Home urban 1 = School in Urban Area/= School not in Urban Area cue80 County Unempolyment rate in 1980 stwmfg80 dist State Hourly Wage in Manufacturing in 1980 Distance from 4yr College in 10's of miles Avg. State 4yr College Tuition in $1000's tuition Years of Education: Rouse computed years of education by assigning 12 years to all members of the senior class. Each additional year of secondary education counted as a one year. Student's with vocational degrees were assigned 13 years, AA degrees were assigned 14 years, BA degrees were assigned 16 years, those with some graduate education were assigned 17 years, and those with a graduate degree were assigned 18 years.
This I cannot upload the .dta file to solve this question, so i'd
like to know approximate way to solve this.
3. You are given a data file CollegeDistance that contains data from a random sample of high schools seniors interviewed in 1980 and re-interviewed in 1986. In this exercise, you will use the data to investigate the relationship between the year of schooling and the distance from each student's high school to the nearest four-year college. (Proximity to college lowers the cost of education.) A detailed description of the data can be find in the pdf file. (a) Run a regression of years of schooling (ED) on distance to the nearest college (Dist), where Dist measures in tens of miles. (For example, Dist=2 means that the distance is 20 miles.) What is the estimated intercept? What is the estimated slope? (b) Bob's high school was 20 miles from the nearest college. Predict Bob's year of schooling using the estimated regression in (a). How would the predict change if Bob lived 10 miles from the nearest college? (c) Does the distance to college explain a large fraction of the variance in the year of schooling? Which measurement are you looking at? (d) An education advocacy group argues that on average, a person's year of schooling will increase by 0.075 year if the distance to the nearest college is decreased by 10 miles. Is the advocacy group's claim consistent with the regression result in (a)? (e) Other factors also affect how much college a person completes. Does controlling for some other factors change the estimated effect of the distance on year of schooling? To answer this question, construct a table like that in the end of lecture slides #2. Include a simple linear regression, a base specification [a multiple linear regression that includes a set of control variables, for example, you can include all the available variables except for "urban" and "tuition" ], and a modification of the base specification [for example, you can include all variables in the data]. Discuss how the estimated effect of Dist on ED changes across the specifications. [Note that you can use commands in our tutorial code file to make a table where each column represents a regression specification. See the part "tables of regressions..." in our tutorial code file for details.] (f) It has been argued that, controlling for other factors, African Americans and His- panics complete more education than whites. Is this consistent with the regression you did in part (e)? [Let's use the base specification in part (e)] (g) Using the base specification in part (e), test whether the coefficients on father's education ("dadcoll") and mother's education(" momcoll") are equal or not.
Documentation for College Distance Data These data are taken from the High School and Beyond survey conducted by the Department of Education in 1980, with a follow-up in 1986. The survey included students from approximately 1100 high schools. The data used here were supplied by Professor Cecilia Rouse of Princeton University and were used in her paper "Democratization or Diversion? The Effect of Community Colleges on Educational Attainment," Journal of Business and Economic Statistics, April 1995, Vol. 12, No. 2, pp 217-224. The data in College Distance exclude students in the western states. The data in College DistanceWest includes only those students in the western states. Series in Data Set Name Desrciption yrsed Years of Education Completed (See below) female 1 = Female/0 = Male 1 = Black/0 = Not-Black black Hispanic bytest 1 Hispanic/0 = Not-Hispanic Base Year Composite Test Score. (These are achievement tests given to high school seniors in the sample) dadcoll 1 = Father is a College Graduate/ 0 = Father is not a College Graduate momcoll 1 = Mother is a College Graduate/ 0 = Mother is not a College Graduate incomehi 1 = Family Income > $25,000 per year/ 0 = Income ≤ $25,000 per year. |ownhome | 1= Family Owns Home / 0 = Family Does not Own Home urban 1 = School in Urban Area/= School not in Urban Area cue80 County Unempolyment rate in 1980 stwmfg80 dist State Hourly Wage in Manufacturing in 1980 Distance from 4yr College in 10's of miles Avg. State 4yr College Tuition in $1000's tuition Years of Education: Rouse computed years of education by assigning 12 years to all members of the senior class. Each additional year of secondary education counted as a one year. Student's with vocational degrees were assigned 13 years, AA degrees were assigned 14 years, BA degrees were assigned 16 years, those with some graduate education were assigned 17 years, and those with a graduate degree were assigned 18 years.