Problem Set 6
Introduction
In this problem set, you will use data from the American Time Use Survey (ATUS) to create graphs, tables, and estimate some two sample t-tests and simple bivariate linear regressions to answer the questions below. Remember that all graphs and tables must have a title that effectively describes the information in the graph. Tables should also have column numbers, as usual, and short, 1-2 word titles beneath the numbers describing the column’s contents. You can download the data for the problem set here. Each question is worth 1 point.
The ATUS measures all time-use activities in minutes. The data for this exercise is at the individual-level from 2003-2023.
Questions
How much time, in a typical day, is spent socializing with or communicating with others in the US?
Some might wonder whether family formation in America leads to a decline in participation in social activity. Create a table with average time spent socializing and communicating with others, average time spent attending or hosting parties, age, gender, race, and educational attainment (use the four categories already included in the data) overall, separately by married and not married, and separately by having at least 1 child and no children in the household. Write 1 to 2 sentences describing whether family formation seems to impact socializing time.
It’s possible that age, and perhaps educational attainment, plays a role in socializing time as well. Create a horizontal bar graph that presents average time spent socializing and communicating with others across age categories and separately by having a college degree and not having a college degree. Write a sentence or two describing what you notice about social time by age and educational status.
Perhaps households substitue television for socializing. Create a line graph that plots average time spent socializing and communicating with others and average time spent watching television and movies (non-religious) separately by people wihtout children and people with at least one child by each year in the data. Write a few sentences describing the trends you see.
You should have four lines for this graph, two for each time-use activity.
- Run a two-sample t-test for whether the difference in time spent socializing between people with and without kids is statistically significant. Report the difference in means, the t-statistic, and the p-value for the t-test.
You may want to create an indicator variable for whether someone has any children in their household.
There is a longstanding literature on the “second shift” burdens that disproportionately affect women, which refers to the phenomenon in which even working women spend more time and effort on family-related tasks. In the data, there is a variable that captures total time spent on secondary childcare (
trtcc
), or time spent performing child care duties while also doing something else. Run a two-sample t-test of time spent providing secondary care by gender, restricting the sample to only employed people with a young child in the HH. What is the average difference in secondary care time per day among working men and working women with young children? Is the difference statistically significant? Report the t-score and p-value of the difference.As we age, we might anticipate other responsibilities crowding out socializing time. Estimate a linear regression of time spent socializing or communicating with others on age. What is the relationship between age and socializing time? Is the relationship statisticall significant?
Using the results from question 7, how much time would a 25 year old be expected to spend socializing in a typical day? How about a 65 year old?
You will need to combine the slope and the intercept in solving for the answer.
Let’s look at time spent on tobacco and drug use. Create a table that shows time spent using tobacco and drugs, time spent working, educational attainment, marital status, number of children in the household, and hours spent alone overall and separately by people who spend zero time using tobacco and drugs and people who spend more than 0 time using tobacco and drugs. What are some descriptive differences that stand out to you and why?
Estimate a regression of time spent using tobacco and drugs on the number of hours spent alone in a day. What is the relationship between time spent alone and tobacco and drug use? Is the relationship statistically significant?
Create a horizontal bar graph that plots average time socializing and communicating with others, average time spent working, and average time spent alone across levels of educational attainment and separately by whether there is a young child (less than 2 years old) in the household. What do you notice in time use patterns by education and parenting a young child?
Let’s look at time spent on various forms of leisure by whether someone has a college degree. Create a horizontal bar graph of the average for all the leisure activities (t120301 through t120313) by whether someone has a college degree. What differences in leisure activities do you notice across groups?
You will need to use the label define
and label val
command to label the college degree binary variable. You will also need to use the legend
option in the graph to label the means of each variable for a legible graph.