Problem Set 1
Instructions
This problem set will require 3 file submissions: 1) a clean .do file with the commands used to answer the questions, 2) a .log file with the all the output from the commands used to answer the questions, and 3) a clean word document with the answers to the questions. You will submit all three files via Brightspace.
For this problem set, you will need to use the Current Population Survey data from IPUMS. Use the “Get Data” feature and in the ASEC tab select only the 2023 sample.
Note that you will need to also open the “Basic Monthly” tab and deselect all of those samples as well.
For this assignment, you will be using four variables to describe health, employment, income, health expenditures. In parentheses below are the menus paths to find the listed variables. Complete your dataset by adding:
- HEALTH (Person -> Annual Social and Economic Supplement -> Disability)
- FTOTVAL (Person -> Annual Social and Economic Supplement -> Income)
- EMPSTAT (Person -> Core -> Work)
- MOOP (Person -> Annual Social and Economic Supplement -> Health Insurance)
This should leave you with 4 variables in 1 sample. Use the data to answer the following questions.
Questions
- What is the average and median family income in the sample?
Be sure you account for the code used for missing values in any variables you are using to answer these questions when relevant.
What is the average family income among families in the bottom and top quartiles?
What is the distribution of self-reported health status overall, in the bottom quartile of family income, and in the top quartile of family income? How would you describe the difference between people from low and high income families in self-reported health status?
What is the difference in average out of pocket spending on health costs between people with excellent health and people with poor health?
What is the difference in average out of pocket spending on health costs between people in the top decile of family income and people in the bottom decile of family income?
Create a table that shows the distribution of health status, average out of pocket spending on health, and average family income in the sample overall and separately by the top and bottom quartiles of family income. Paste the table from the csv output into word.
Your table should have 7 variable rows and 3 columns of statistics.
Extra Credit (worth 1 point)
- Create a new table starting from the table you made for question 6, but add the percent of family income spent on out of pocket costs on health care as a new variable in the table.