Problem Set 4 - EC
Instructions
In this problem set, you will be using data from the Education Longitudinal Study of 2002 (ELS). The ELS contains data from a nationally representative sample of adolescents who were sophomores in high school in the base year of the study. The study checked in with them again in their senior year, the equivalent of their “on-time” sophomore year of college, and again in 2012, when the college graduates in the sample would be early in their careers. In this set of problems, to help you prepare for the kinds of questions you will deal with in the midterm, you will be asked a few questions about proportions, distributions of categorical variables, means, and making a table and a bar graph. For these questions, aside from race, indicator variables have been created for all of the variables you need to answer the questions. When answering questions involving the race variable (byrace
) or the educational attainment variable (f3attainment
), you do not need to collapse the variables into fewer categories (though you may need to create clean, labeled indicator variables for each category in them). You can download the data here. There is a long-standing body of literature on inter-generational inequalities. Many of these questions will explore aspects of this using the ELS.
The assignment is entirely extra credit. Each question is worth 1 point. As always, submit a word document with your answers, your .do file, and your .log file.
Questions
What is the average math and average reading score of students at urban high schools?
What is the average math and average reading score of students at suburban high schools?
Let’s think about exposure to diversity. The ELS asked the sophomores a few questions about their 3 closest friends at school. Create a horizontal bar graph that shows the average number of friends of a different racial background across categories of urbanicity.
What is the average number of friends who care about their grades for people in the top quartile of math scores and people in the bottom quartile of math scores?
Now let’s get a picture of characteristics by high- and low-income and by urbanicity. Create a table that shows the distribution of race and gender, average math and reading scores, average number of friends from a different race, average number of friends who believe grades are important, the proportion whose parents have a college degree and the proportion whose grandparents have a college degree. This table should present these statistics overall, separately by high- and low-income, and separately by urbanicity of the school.
Your table should have 6 columns: 1 for everyone, 1 for low-income, 1 for high-income, and 1 for each of the 3 categories of urbanicity.
Is educational attainment associated with parents’ and grandparents’ education? Create two horizontal bar graphs: i) show the student’s educational attainment by 2012 by parents with and without a college degree and ii) show the student’s educational attainment by 2012 by grandparents with and without a college degree.
Now let’s look at schools. Principals were asked if various aspects of the school hindered learning at their school. Create a bar graph that shows the distribution of reasons learning is hindered at schools separately by high- and low-income students.
Let’s get a sense for the school environment and resources at schools. Create a table that presents the average percent of kids in crisis programs, percent of kids in college preparatory programs, average number of guidance counselors, average teacher salaries, and the distribution of possible rewards for good teachers overall and separately by kids whose parents went to college and kids whose parents did not.
Create a table that shows the distribution of good teacher rewards separately by whether the principals believes teacher morale is high or not and whether the principal believes student morale is high or not.