Problem Set - Racial Bias in the Labor Market

In this question you’ll partially replicate a well-known paper on racial bias in the labor market: “Are Emily and Greg More Employable Than Lakisha and Jamal? A Field Experiment on Labor Market Discrimination” by Marianne Bertrand and Sendhil Mullainathan. The paper, which I’ll refer to as BM for short, appears in Volume 94, Issue #4 of the American Economic Review. You will need to consult this paper to complete this problem.

For convenience, I’ve posted a copy of the dataset from this paper on my website at https://ditraglia.com/data/lakisha_aer.csv. Each row of the dataset corresponds to a single fictitious job applicant. After loading the tidyverse library, you can read the data into a tibble called bm using the read_csv() function as follows:

library(tidyverse)
bm <- read_csv('https://ditraglia.com/data/lakisha_aer.csv')

Read the introduction and conclusion of BM. Then write a short paragraph answering the following:
1. What research question do BM try to answer?
2. What data and methodology do they use to address the question?
3. What do the authors consider to be their key findings?
Now that you have a rough idea of what the paper is about, it’s time to examine the dataset bm. Carry out the following steps:
1. Display the tibble bm. How many rows and columns does it have?
2. Display only the columns sex, race and firstname of bm. What information do these columns contain? How are sex and race encoded?
3. Add two new columns to bm: female should take the value TRUE if sex is female, and black should take value TRUE if race is black.
Read parts A-D of section II in BM. Then write a short paragraph answering the following:
1. How did the experimenters create their bank of resumes for the experiment?
2. The experimenters classified the resumes into two groups. What were they and how did they make the classification?
3. How did the experimenters generate identities for their fictitious job applicants?
Randomized controlled trials are all about balance: when the treatment is randomly assigned, the characteristics of the treatment and control groups will be the same on average. To answer the following parts you’ll need a few additional pieces of information. First, the variable computerskills takes on the value 1 if a given resume says that the applicant has computer skills. Second, the variables education and yearsexp indicate level of education and years experience, while ofjobs indicates the number of previous jobs listed on the resume. To analyse whether characteristics are balanced across treatment and control, compare summary statistics between applicants with white- and black-sounding names.
1. Is sex balanced across race? Use dplyr to answer this question. Hint: what happens if you apply the function sum to a vector of TRUE and FALSE values?
2. Are computer skills balanced across race? Hint: the summary statistic you’ll want to use is the proportion of individuals in each group with computer skills. If you have a vector of ones and zeros, there is a very easy way to compute this.
3. Are education and ofjobs balanced across race?
4. Compute the mean and standard deviation of yearsexp by race. Comment on your findings.
5. Why do we care if sex, education, ofjobs, computerskills, and yearsexp are balanced across race?
6. Is computerskills balanced across sex? What about education? What’s going on here? Is it a problem? Hint: re-read section II C of the paper.
The outcome of interest in bm is call which takes on the value 1 if the corresponding resume elicits an email or telephone callback for an interview. Check your answers to the following against Table 1 of the paper:
1. Calculate the average callback rate for all resumes in bm.
2. Calculate the average callback rates separately for resumes with “white-sounding” and “black-sounding” names. What do your results suggest?
3. Repeat part (b), but calculate the average rates for each combination of race and sex. What do your results suggest?
You saw in 5 (b) that applicants with African-American sounding names receive fewer callbacks. Is this statistically significant? Test the null hypothesis that there is no difference in callback rates between black- and white-sounding names against its two-sided alternative to find out. For simplicity, you can rely on the asymptotic normality of the t-statistic.
1. Read the help file for the dplyr function pull() and use it to extract two binary vectors of call-backs from bm, one for black names and one for white names.
2. Store sample sizes, sample means, and sample variances in variables. Use these to compute your test statistic and degrees of freedom.
3. Look up the function qnorm() and use it to compute the critical value for a two-sided t-test with \(\alpha = 0.05\).
4. Look up the function pnorm() and use it to compute the p-value. Check your result against Table 1.
5. Comment on your results.
What’s in a name? Read Section II E and Section IV of BM. Then look up “The Causes and Consequences of Distinctively Black Names” by Roland Fryer and Steven Levitt. Read its introduction and conclusion and answer these reading questions:
1. What do you think: what social content is contained in names, and what questions can we study using names?
2. What are some weaknesses that BM acknowledge in their study?
3. What are some potential confounds that may complicate the interpretation of results based on randomly assigning stereotypically black and white names to resumes?
4. What is “taste-based” discrimination? What is “statistical” discrimination? How consistent are these models with the results of BM?
5. What are the key findings of Fryer and Levitt’s study?
6. How do Fryer and Levitt’s results relate to those of BM? What are some possible ways to reconcile the two sets of findings?