{
  "nbformat": 4,
  "nbformat_minor": 0,
  "metadata": {
    "colab": {
      "name": "8) Do Hosts Discriminate against Black Guests in Airbnb?.ipynb",
      "provenance": []
    },
    "kernelspec": {
      "display_name": "Python 3",
      "language": "python",
      "name": "python3"
    },
    "language_info": {
      "codemirror_mode": {
        "name": "ipython",
        "version": 3
      },
      "file_extension": ".py",
      "mimetype": "text/x-python",
      "name": "python",
      "nbconvert_exporter": "python",
      "pygments_lexer": "ipython3",
      "version": "3.7.9"
    }
  },
  "cells": [
    {
      "cell_type": "markdown",
      "metadata": {
        "id": "mfxvKsxwpZjN"
      },
      "source": [
        "# 8) Do Hosts Discriminate against Black Guests in Airbnb?"
      ]
    },
    {
      "cell_type": "markdown",
      "metadata": {
        "id": "kRSC3oJZQSQ2"
      },
      "source": [
        "[Vitor Kamada](https://www.linkedin.com/in/vitor-kamada-1b73a078)\n",
        "\n",
        "E-mail: econometrics.methods@gmail.com\n",
        "\n",
        "Last updated: 11-1-2020"
      ]
    },
    {
      "cell_type": "markdown",
      "metadata": {
        "id": "nLMyEL1yszwC"
      },
      "source": [
        "Edelman et al. (2017) found that Black sounding-names are 16% less likely to be accepted as a guest in Airbnb than White sounding-names. This result is not a mere correlation. The variable race was randomized. The only difference between Blacks and Whites is the name. For everything else, Black and White guests are the same."
      ]
    },
    {
      "cell_type": "markdown",
      "metadata": {
        "id": "Z0l_mfBDpnVh"
      },
      "source": [
        "Let's open the dataset of Edelman et al. (2017). Each row is a property of Airbnb in July 2015. The sample is composed of all properties in Baltimore, Dallas, Los Angeles, St. Louis, and Washington, DC."
      ]
    },
    {
      "cell_type": "code",
      "metadata": {
        "id": "3wG1YQm6B55d",
        "outputId": "7480241c-8bec-4e39-fb8f-0cd60265d825",
        "colab": {
          "base_uri": "https://localhost:8080/",
          "height": 425
        }
      },
      "source": [
        "import numpy as np\n",
        "import pandas as pd\n",
        "pd.set_option('precision', 3)\n",
        "\n",
        "# Data from Edelman et al. (2017)\n",
        "path = \"https://github.com/causal-methods/Data/raw/master/\" \n",
        "df = pd.read_csv(path + \"Airbnb.csv\")\n",
        "df.head(5)"
      ],
      "execution_count": 1,
      "outputs": [
        {
          "output_type": "execute_result",
          "data": {
            "text/html": [
              "<div>\n",
              "<style scoped>\n",
              "    .dataframe tbody tr th:only-of-type {\n",
              "        vertical-align: middle;\n",
              "    }\n",
              "\n",
              "    .dataframe tbody tr th {\n",
              "        vertical-align: top;\n",
              "    }\n",
              "\n",
              "    .dataframe thead th {\n",
              "        text-align: right;\n",
              "    }\n",
              "</style>\n",
              "<table border=\"1\" class=\"dataframe\">\n",
              "  <thead>\n",
              "    <tr style=\"text-align: right;\">\n",
              "      <th></th>\n",
              "      <th>host_response</th>\n",
              "      <th>response_date</th>\n",
              "      <th>number_of_messages</th>\n",
              "      <th>automated_coding</th>\n",
              "      <th>latitude</th>\n",
              "      <th>longitude</th>\n",
              "      <th>bed_type</th>\n",
              "      <th>property_type</th>\n",
              "      <th>cancellation_policy</th>\n",
              "      <th>number_guests</th>\n",
              "      <th>bedrooms</th>\n",
              "      <th>bathrooms</th>\n",
              "      <th>cleaning_fee</th>\n",
              "      <th>price</th>\n",
              "      <th>apt_rating</th>\n",
              "      <th>property_setup</th>\n",
              "      <th>city</th>\n",
              "      <th>date_sent</th>\n",
              "      <th>listing_down</th>\n",
              "      <th>number_of_listings</th>\n",
              "      <th>number_of_reviews</th>\n",
              "      <th>member_since</th>\n",
              "      <th>verified_id</th>\n",
              "      <th>host_race</th>\n",
              "      <th>super_host</th>\n",
              "      <th>host_gender</th>\n",
              "      <th>host_age</th>\n",
              "      <th>host_gender_1</th>\n",
              "      <th>host_gender_2</th>\n",
              "      <th>host_gender_3</th>\n",
              "      <th>host_race_1</th>\n",
              "      <th>host_race_2</th>\n",
              "      <th>host_race_3</th>\n",
              "      <th>guest_first_name</th>\n",
              "      <th>guest_last_name</th>\n",
              "      <th>guest_race</th>\n",
              "      <th>guest_gender</th>\n",
              "      <th>guest_id</th>\n",
              "      <th>population</th>\n",
              "      <th>whites</th>\n",
              "      <th>...</th>\n",
              "      <th>host_gender_FF</th>\n",
              "      <th>host_gender_M</th>\n",
              "      <th>host_gender_MM</th>\n",
              "      <th>host_gender_MF</th>\n",
              "      <th>host_gender_same_sex</th>\n",
              "      <th>host_age_cat</th>\n",
              "      <th>ten_reviews</th>\n",
              "      <th>five_star_property</th>\n",
              "      <th>multiple_listings</th>\n",
              "      <th>shared_property</th>\n",
              "      <th>shared_bathroom</th>\n",
              "      <th>has_cleaning_fee</th>\n",
              "      <th>strict_cancellation</th>\n",
              "      <th>young</th>\n",
              "      <th>middle</th>\n",
              "      <th>old</th>\n",
              "      <th>pricey</th>\n",
              "      <th>price_median</th>\n",
              "      <th>log_price</th>\n",
              "      <th>white_proportion</th>\n",
              "      <th>black_proportion</th>\n",
              "      <th>asian_proportion</th>\n",
              "      <th>hispanic_proportion</th>\n",
              "      <th>tract_listings</th>\n",
              "      <th>log_tract_listings</th>\n",
              "      <th>simplified_host_response</th>\n",
              "      <th>graph_bins</th>\n",
              "      <th>yes</th>\n",
              "      <th>baltimore</th>\n",
              "      <th>dallas</th>\n",
              "      <th>los_angeles</th>\n",
              "      <th>sl</th>\n",
              "      <th>dc</th>\n",
              "      <th>total_guests</th>\n",
              "      <th>raw_black</th>\n",
              "      <th>prop_black</th>\n",
              "      <th>any_black</th>\n",
              "      <th>past_guest_merge</th>\n",
              "      <th>filled_september</th>\n",
              "      <th>pr_filled</th>\n",
              "    </tr>\n",
              "  </thead>\n",
              "  <tbody>\n",
              "    <tr>\n",
              "      <th>0</th>\n",
              "      <td>Yes</td>\n",
              "      <td>2015-07-19 08:26:17</td>\n",
              "      <td>2.0</td>\n",
              "      <td>1.0</td>\n",
              "      <td>34.081</td>\n",
              "      <td>-118.270</td>\n",
              "      <td>Real Bed</td>\n",
              "      <td>House</td>\n",
              "      <td>Flexible</td>\n",
              "      <td>3.0</td>\n",
              "      <td>3.0</td>\n",
              "      <td>3.0</td>\n",
              "      <td>30.0</td>\n",
              "      <td>99.0</td>\n",
              "      <td>5.0</td>\n",
              "      <td>Private Room</td>\n",
              "      <td>Los-Angeles</td>\n",
              "      <td>2015-07-19 01:34:00</td>\n",
              "      <td>0.0</td>\n",
              "      <td>1.0</td>\n",
              "      <td>8.0</td>\n",
              "      <td>March 2008</td>\n",
              "      <td>1.0</td>\n",
              "      <td>white</td>\n",
              "      <td>NaN</td>\n",
              "      <td>M</td>\n",
              "      <td>young/middle</td>\n",
              "      <td>M</td>\n",
              "      <td>M</td>\n",
              "      <td>.</td>\n",
              "      <td>white</td>\n",
              "      <td>white</td>\n",
              "      <td>.</td>\n",
              "      <td>Brad</td>\n",
              "      <td>Walsh</td>\n",
              "      <td>white</td>\n",
              "      <td>male</td>\n",
              "      <td>6.0</td>\n",
              "      <td>3340.0</td>\n",
              "      <td>1789.0</td>\n",
              "      <td>...</td>\n",
              "      <td>0</td>\n",
              "      <td>1</td>\n",
              "      <td>0</td>\n",
              "      <td>0</td>\n",
              "      <td>0</td>\n",
              "      <td>1.0</td>\n",
              "      <td>0</td>\n",
              "      <td>1</td>\n",
              "      <td>0</td>\n",
              "      <td>1</td>\n",
              "      <td>0</td>\n",
              "      <td>1</td>\n",
              "      <td>0</td>\n",
              "      <td>0</td>\n",
              "      <td>1</td>\n",
              "      <td>0</td>\n",
              "      <td>0</td>\n",
              "      <td>0</td>\n",
              "      <td>4.595</td>\n",
              "      <td>0.536</td>\n",
              "      <td>0.030</td>\n",
              "      <td>0.145</td>\n",
              "      <td>0.557</td>\n",
              "      <td>16</td>\n",
              "      <td>2.773</td>\n",
              "      <td>Yes</td>\n",
              "      <td>Yes</td>\n",
              "      <td>1.0</td>\n",
              "      <td>0</td>\n",
              "      <td>0</td>\n",
              "      <td>1</td>\n",
              "      <td>0</td>\n",
              "      <td>0</td>\n",
              "      <td>11.0</td>\n",
              "      <td>0.0</td>\n",
              "      <td>0.0</td>\n",
              "      <td>0.0</td>\n",
              "      <td>matched (3)</td>\n",
              "      <td>1</td>\n",
              "      <td>0.412</td>\n",
              "    </tr>\n",
              "    <tr>\n",
              "      <th>1</th>\n",
              "      <td>No or unavailable</td>\n",
              "      <td>2015-07-14 14:13:39</td>\n",
              "      <td>NaN</td>\n",
              "      <td>1.0</td>\n",
              "      <td>38.911</td>\n",
              "      <td>-77.020</td>\n",
              "      <td>NaN</td>\n",
              "      <td>House</td>\n",
              "      <td>Moderate</td>\n",
              "      <td>2.0</td>\n",
              "      <td>2.0</td>\n",
              "      <td>2.0</td>\n",
              "      <td>NaN</td>\n",
              "      <td>125.0</td>\n",
              "      <td>5.0</td>\n",
              "      <td>Private Room</td>\n",
              "      <td>Washington</td>\n",
              "      <td>2015-07-14 09:53:00</td>\n",
              "      <td>0.0</td>\n",
              "      <td>3.0</td>\n",
              "      <td>185.0</td>\n",
              "      <td>September 2008</td>\n",
              "      <td>1.0</td>\n",
              "      <td>hisp</td>\n",
              "      <td>NaN</td>\n",
              "      <td>F</td>\n",
              "      <td>young</td>\n",
              "      <td>F</td>\n",
              "      <td>F</td>\n",
              "      <td>F</td>\n",
              "      <td>white</td>\n",
              "      <td>hisp</td>\n",
              "      <td>hisp</td>\n",
              "      <td>Brad</td>\n",
              "      <td>Walsh</td>\n",
              "      <td>white</td>\n",
              "      <td>male</td>\n",
              "      <td>6.0</td>\n",
              "      <td>2143.0</td>\n",
              "      <td>847.0</td>\n",
              "      <td>...</td>\n",
              "      <td>0</td>\n",
              "      <td>0</td>\n",
              "      <td>0</td>\n",
              "      <td>0</td>\n",
              "      <td>0</td>\n",
              "      <td>0.0</td>\n",
              "      <td>1</td>\n",
              "      <td>1</td>\n",
              "      <td>1</td>\n",
              "      <td>1</td>\n",
              "      <td>0</td>\n",
              "      <td>0</td>\n",
              "      <td>0</td>\n",
              "      <td>1</td>\n",
              "      <td>0</td>\n",
              "      <td>0</td>\n",
              "      <td>0</td>\n",
              "      <td>1</td>\n",
              "      <td>4.828</td>\n",
              "      <td>0.395</td>\n",
              "      <td>0.448</td>\n",
              "      <td>0.057</td>\n",
              "      <td>0.089</td>\n",
              "      <td>19</td>\n",
              "      <td>2.944</td>\n",
              "      <td>No</td>\n",
              "      <td>No</td>\n",
              "      <td>0.0</td>\n",
              "      <td>0</td>\n",
              "      <td>0</td>\n",
              "      <td>0</td>\n",
              "      <td>0</td>\n",
              "      <td>1</td>\n",
              "      <td>167.0</td>\n",
              "      <td>0.0</td>\n",
              "      <td>0.0</td>\n",
              "      <td>0.0</td>\n",
              "      <td>matched (3)</td>\n",
              "      <td>1</td>\n",
              "      <td>0.686</td>\n",
              "    </tr>\n",
              "    <tr>\n",
              "      <th>2</th>\n",
              "      <td>Request for more info (Can you verify? How man...</td>\n",
              "      <td>2015-07-20 16:24:08</td>\n",
              "      <td>2.0</td>\n",
              "      <td>0.0</td>\n",
              "      <td>34.005</td>\n",
              "      <td>-118.481</td>\n",
              "      <td>Pull-out Sofa</td>\n",
              "      <td>Apartment</td>\n",
              "      <td>Strict</td>\n",
              "      <td>1.0</td>\n",
              "      <td>1.0</td>\n",
              "      <td>1.0</td>\n",
              "      <td>100.0</td>\n",
              "      <td>135.0</td>\n",
              "      <td>5.0</td>\n",
              "      <td>Private Room</td>\n",
              "      <td>Los-Angeles</td>\n",
              "      <td>2015-07-20 11:25:00</td>\n",
              "      <td>0.0</td>\n",
              "      <td>2.0</td>\n",
              "      <td>20.0</td>\n",
              "      <td>September 2008</td>\n",
              "      <td>0.0</td>\n",
              "      <td>white</td>\n",
              "      <td>NaN</td>\n",
              "      <td>F</td>\n",
              "      <td>middle/young</td>\n",
              "      <td>F</td>\n",
              "      <td>F</td>\n",
              "      <td>.</td>\n",
              "      <td>white</td>\n",
              "      <td>white</td>\n",
              "      <td>.</td>\n",
              "      <td>Brad</td>\n",
              "      <td>Walsh</td>\n",
              "      <td>white</td>\n",
              "      <td>male</td>\n",
              "      <td>6.0</td>\n",
              "      <td>5700.0</td>\n",
              "      <td>4648.0</td>\n",
              "      <td>...</td>\n",
              "      <td>0</td>\n",
              "      <td>0</td>\n",
              "      <td>0</td>\n",
              "      <td>0</td>\n",
              "      <td>0</td>\n",
              "      <td>1.0</td>\n",
              "      <td>1</td>\n",
              "      <td>1</td>\n",
              "      <td>1</td>\n",
              "      <td>1</td>\n",
              "      <td>1</td>\n",
              "      <td>1</td>\n",
              "      <td>1</td>\n",
              "      <td>0</td>\n",
              "      <td>1</td>\n",
              "      <td>0</td>\n",
              "      <td>0</td>\n",
              "      <td>1</td>\n",
              "      <td>4.905</td>\n",
              "      <td>0.815</td>\n",
              "      <td>0.046</td>\n",
              "      <td>0.054</td>\n",
              "      <td>0.119</td>\n",
              "      <td>21</td>\n",
              "      <td>3.045</td>\n",
              "      <td>Requests more information</td>\n",
              "      <td>Conditional No</td>\n",
              "      <td>0.0</td>\n",
              "      <td>0</td>\n",
              "      <td>0</td>\n",
              "      <td>1</td>\n",
              "      <td>0</td>\n",
              "      <td>0</td>\n",
              "      <td>19.0</td>\n",
              "      <td>0.0</td>\n",
              "      <td>0.0</td>\n",
              "      <td>0.0</td>\n",
              "      <td>matched (3)</td>\n",
              "      <td>0</td>\n",
              "      <td>0.331</td>\n",
              "    </tr>\n",
              "    <tr>\n",
              "      <th>3</th>\n",
              "      <td>I will get back to you</td>\n",
              "      <td>2015-07-20 06:47:38</td>\n",
              "      <td>NaN</td>\n",
              "      <td>0.0</td>\n",
              "      <td>34.092</td>\n",
              "      <td>-118.282</td>\n",
              "      <td>NaN</td>\n",
              "      <td>House</td>\n",
              "      <td>Strict</td>\n",
              "      <td>8.0</td>\n",
              "      <td>8.0</td>\n",
              "      <td>8.0</td>\n",
              "      <td>115.0</td>\n",
              "      <td>319.0</td>\n",
              "      <td>5.0</td>\n",
              "      <td>Entire Place</td>\n",
              "      <td>Los-Angeles</td>\n",
              "      <td>2015-07-20 02:44:00</td>\n",
              "      <td>0.0</td>\n",
              "      <td>1.0</td>\n",
              "      <td>42.0</td>\n",
              "      <td>September 2008</td>\n",
              "      <td>1.0</td>\n",
              "      <td>white</td>\n",
              "      <td>NaN</td>\n",
              "      <td>mix</td>\n",
              "      <td>middle</td>\n",
              "      <td>M</td>\n",
              "      <td>mix</td>\n",
              "      <td>mix</td>\n",
              "      <td>white</td>\n",
              "      <td>white</td>\n",
              "      <td>mult</td>\n",
              "      <td>Tanisha</td>\n",
              "      <td>Jackson</td>\n",
              "      <td>black</td>\n",
              "      <td>female</td>\n",
              "      <td>15.0</td>\n",
              "      <td>2235.0</td>\n",
              "      <td>1393.0</td>\n",
              "      <td>...</td>\n",
              "      <td>0</td>\n",
              "      <td>0</td>\n",
              "      <td>0</td>\n",
              "      <td>0</td>\n",
              "      <td>0</td>\n",
              "      <td>2.0</td>\n",
              "      <td>1</td>\n",
              "      <td>1</td>\n",
              "      <td>0</td>\n",
              "      <td>0</td>\n",
              "      <td>0</td>\n",
              "      <td>1</td>\n",
              "      <td>1</td>\n",
              "      <td>0</td>\n",
              "      <td>1</td>\n",
              "      <td>0</td>\n",
              "      <td>1</td>\n",
              "      <td>1</td>\n",
              "      <td>5.765</td>\n",
              "      <td>0.623</td>\n",
              "      <td>0.043</td>\n",
              "      <td>0.109</td>\n",
              "      <td>0.381</td>\n",
              "      <td>11</td>\n",
              "      <td>2.398</td>\n",
              "      <td>Not sure or check later</td>\n",
              "      <td>Conditional No</td>\n",
              "      <td>0.0</td>\n",
              "      <td>0</td>\n",
              "      <td>0</td>\n",
              "      <td>1</td>\n",
              "      <td>0</td>\n",
              "      <td>0</td>\n",
              "      <td>41.0</td>\n",
              "      <td>0.0</td>\n",
              "      <td>0.0</td>\n",
              "      <td>0.0</td>\n",
              "      <td>matched (3)</td>\n",
              "      <td>0</td>\n",
              "      <td>0.536</td>\n",
              "    </tr>\n",
              "    <tr>\n",
              "      <th>4</th>\n",
              "      <td>Message not sent</td>\n",
              "      <td>.</td>\n",
              "      <td>NaN</td>\n",
              "      <td>1.0</td>\n",
              "      <td>38.830</td>\n",
              "      <td>-76.897</td>\n",
              "      <td>Real Bed</td>\n",
              "      <td>House</td>\n",
              "      <td>Strict</td>\n",
              "      <td>2.0</td>\n",
              "      <td>2.0</td>\n",
              "      <td>2.0</td>\n",
              "      <td>35.0</td>\n",
              "      <td>40.0</td>\n",
              "      <td>5.0</td>\n",
              "      <td>Private Room</td>\n",
              "      <td>Washington</td>\n",
              "      <td>.</td>\n",
              "      <td>0.0</td>\n",
              "      <td>1.0</td>\n",
              "      <td>37.0</td>\n",
              "      <td>October 2008</td>\n",
              "      <td>0.0</td>\n",
              "      <td>mult</td>\n",
              "      <td>NaN</td>\n",
              "      <td>FF</td>\n",
              "      <td>middle/young</td>\n",
              "      <td>FF</td>\n",
              "      <td>FF</td>\n",
              "      <td>.</td>\n",
              "      <td>mult</td>\n",
              "      <td>mult</td>\n",
              "      <td>.</td>\n",
              "      <td>Lakisha</td>\n",
              "      <td>Jones</td>\n",
              "      <td>black</td>\n",
              "      <td>female</td>\n",
              "      <td>11.0</td>\n",
              "      <td>4696.0</td>\n",
              "      <td>482.0</td>\n",
              "      <td>...</td>\n",
              "      <td>1</td>\n",
              "      <td>0</td>\n",
              "      <td>0</td>\n",
              "      <td>0</td>\n",
              "      <td>1</td>\n",
              "      <td>1.0</td>\n",
              "      <td>1</td>\n",
              "      <td>1</td>\n",
              "      <td>0</td>\n",
              "      <td>1</td>\n",
              "      <td>0</td>\n",
              "      <td>1</td>\n",
              "      <td>1</td>\n",
              "      <td>0</td>\n",
              "      <td>1</td>\n",
              "      <td>0</td>\n",
              "      <td>0</td>\n",
              "      <td>0</td>\n",
              "      <td>3.689</td>\n",
              "      <td>0.103</td>\n",
              "      <td>0.809</td>\n",
              "      <td>0.034</td>\n",
              "      <td>0.057</td>\n",
              "      <td>2</td>\n",
              "      <td>0.693</td>\n",
              "      <td>NaN</td>\n",
              "      <td>NaN</td>\n",
              "      <td>NaN</td>\n",
              "      <td>0</td>\n",
              "      <td>0</td>\n",
              "      <td>0</td>\n",
              "      <td>0</td>\n",
              "      <td>1</td>\n",
              "      <td>28.0</td>\n",
              "      <td>0.0</td>\n",
              "      <td>0.0</td>\n",
              "      <td>0.0</td>\n",
              "      <td>matched (3)</td>\n",
              "      <td>1</td>\n",
              "      <td>0.555</td>\n",
              "    </tr>\n",
              "  </tbody>\n",
              "</table>\n",
              "<p>5 rows × 104 columns</p>\n",
              "</div>"
            ],
            "text/plain": [
              "                                       host_response  ... pr_filled\n",
              "0                                                Yes  ...     0.412\n",
              "1                                  No or unavailable  ...     0.686\n",
              "2  Request for more info (Can you verify? How man...  ...     0.331\n",
              "3                             I will get back to you  ...     0.536\n",
              "4                                   Message not sent  ...     0.555\n",
              "\n",
              "[5 rows x 104 columns]"
            ]
          },
          "metadata": {
            "tags": []
          },
          "execution_count": 1
        }
      ]
    },
    {
      "cell_type": "markdown",
      "metadata": {
        "id": "Aji09URQpjLa"
      },
      "source": [
        "The chart below shows that a Black guest receives less \"Yes\" from the hosts than a White guest. Somebody might argue that the results of Edelman et al. (2017) are driven by differences in host responses, such as conditional or non-response. For example, you could argue that Blacks are more likely to have fake accounts categorized as spam. However, note that discrimination results are driven by \"Yes\" and \"No\" and not by intermediate responses."
      ]
    },
    {
      "cell_type": "code",
      "metadata": {
        "id": "UKNW_qjTm60G",
        "outputId": "9d41347d-d327-4eb8-8932-fe72c2ce635f",
        "colab": {
          "base_uri": "https://localhost:8080/",
          "height": 542
        }
      },
      "source": [
        "# Data for bar chart\n",
        "count = pd.crosstab(df[\"graph_bins\"], df[\"guest_black\"])\n",
        "\n",
        "import plotly.graph_objects as go\n",
        "\n",
        "node = ['Conditional No', 'Conditional Yes', 'No',\n",
        "        'No Response', 'Yes']\n",
        "fig = go.Figure(data=[\n",
        "    go.Bar(name='Guest is white', x=node, y=count[0]),\n",
        "    go.Bar(name='Guest is African American', x=node, y=count[1]) ])\n",
        "\n",
        "fig.update_layout(barmode='group',\n",
        "  title_text = 'Host Responses by Race',\n",
        "  font=dict(size=18) )\n",
        "\n",
        "fig.show()"
      ],
      "execution_count": 2,
      "outputs": [
        {
          "output_type": "display_data",
          "data": {
            "text/html": [
              "<html>\n",
              "<head><meta charset=\"utf-8\" /></head>\n",
              "<body>\n",
              "    <div>\n",
              "            <script src=\"https://cdnjs.cloudflare.com/ajax/libs/mathjax/2.7.5/MathJax.js?config=TeX-AMS-MML_SVG\"></script><script type=\"text/javascript\">if (window.MathJax) {MathJax.Hub.Config({SVG: {font: \"STIX-Web\"}});}</script>\n",
              "                <script type=\"text/javascript\">window.PlotlyConfig = {MathJaxConfig: 'local'};</script>\n",
              "        <script src=\"https://cdn.plot.ly/plotly-latest.min.js\"></script>    \n",
              "            <div id=\"498123ff-4a1e-493e-9ae8-7198aaf289dd\" class=\"plotly-graph-div\" style=\"height:525px; width:100%;\"></div>\n",
              "            <script type=\"text/javascript\">\n",
              "                \n",
              "                    window.PLOTLYENV=window.PLOTLYENV || {};\n",
              "                    \n",
              "                if (document.getElementById(\"498123ff-4a1e-493e-9ae8-7198aaf289dd\")) {\n",
              "                    Plotly.newPlot(\n",
              "                        '498123ff-4a1e-493e-9ae8-7198aaf289dd',\n",
              "                        [{\"name\": \"Guest is white\", \"type\": \"bar\", \"x\": [\"Conditional No\", \"Conditional Yes\", \"No\", \"No Response\", \"Yes\"], \"y\": [505, 418, 663, 429, 1152]}, {\"name\": \"Guest is African American\", \"type\": \"bar\", \"x\": [\"Conditional No\", \"Conditional Yes\", \"No\", \"No Response\", \"Yes\"], \"y\": [513, 341, 873, 423, 940]}],\n",
              "                        {\"barmode\": \"group\", \"font\": {\"size\": 18}, \"template\": {\"data\": {\"bar\": [{\"error_x\": {\"color\": \"#2a3f5f\"}, \"error_y\": {\"color\": \"#2a3f5f\"}, \"marker\": {\"line\": {\"color\": \"#E5ECF6\", \"width\": 0.5}}, \"type\": \"bar\"}], \"barpolar\": [{\"marker\": {\"line\": {\"color\": \"#E5ECF6\", \"width\": 0.5}}, \"type\": \"barpolar\"}], \"carpet\": [{\"aaxis\": {\"endlinecolor\": \"#2a3f5f\", \"gridcolor\": \"white\", \"linecolor\": \"white\", \"minorgridcolor\": \"white\", \"startlinecolor\": \"#2a3f5f\"}, \"baxis\": {\"endlinecolor\": \"#2a3f5f\", \"gridcolor\": \"white\", \"linecolor\": \"white\", \"minorgridcolor\": \"white\", \"startlinecolor\": \"#2a3f5f\"}, \"type\": \"carpet\"}], \"choropleth\": [{\"colorbar\": {\"outlinewidth\": 0, \"ticks\": \"\"}, \"type\": \"choropleth\"}], \"contour\": [{\"colorbar\": {\"outlinewidth\": 0, \"ticks\": \"\"}, \"colorscale\": [[0.0, \"#0d0887\"], [0.1111111111111111, \"#46039f\"], [0.2222222222222222, \"#7201a8\"], [0.3333333333333333, \"#9c179e\"], [0.4444444444444444, \"#bd3786\"], [0.5555555555555556, \"#d8576b\"], [0.6666666666666666, \"#ed7953\"], [0.7777777777777778, \"#fb9f3a\"], [0.8888888888888888, \"#fdca26\"], [1.0, \"#f0f921\"]], \"type\": \"contour\"}], \"contourcarpet\": [{\"colorbar\": {\"outlinewidth\": 0, \"ticks\": \"\"}, \"type\": \"contourcarpet\"}], \"heatmap\": [{\"colorbar\": {\"outlinewidth\": 0, \"ticks\": \"\"}, \"colorscale\": [[0.0, \"#0d0887\"], [0.1111111111111111, \"#46039f\"], [0.2222222222222222, \"#7201a8\"], [0.3333333333333333, \"#9c179e\"], [0.4444444444444444, \"#bd3786\"], [0.5555555555555556, \"#d8576b\"], [0.6666666666666666, \"#ed7953\"], [0.7777777777777778, \"#fb9f3a\"], [0.8888888888888888, \"#fdca26\"], [1.0, \"#f0f921\"]], \"type\": \"heatmap\"}], \"heatmapgl\": [{\"colorbar\": {\"outlinewidth\": 0, \"ticks\": \"\"}, \"colorscale\": [[0.0, \"#0d0887\"], [0.1111111111111111, \"#46039f\"], [0.2222222222222222, \"#7201a8\"], [0.3333333333333333, \"#9c179e\"], [0.4444444444444444, \"#bd3786\"], [0.5555555555555556, \"#d8576b\"], [0.6666666666666666, \"#ed7953\"], [0.7777777777777778, \"#fb9f3a\"], [0.8888888888888888, \"#fdca26\"], [1.0, \"#f0f921\"]], \"type\": \"heatmapgl\"}], \"histogram\": [{\"marker\": {\"colorbar\": {\"outlinewidth\": 0, \"ticks\": \"\"}}, \"type\": \"histogram\"}], \"histogram2d\": [{\"colorbar\": {\"outlinewidth\": 0, \"ticks\": \"\"}, \"colorscale\": [[0.0, \"#0d0887\"], [0.1111111111111111, \"#46039f\"], [0.2222222222222222, \"#7201a8\"], [0.3333333333333333, \"#9c179e\"], [0.4444444444444444, \"#bd3786\"], [0.5555555555555556, \"#d8576b\"], [0.6666666666666666, \"#ed7953\"], [0.7777777777777778, \"#fb9f3a\"], [0.8888888888888888, \"#fdca26\"], [1.0, \"#f0f921\"]], \"type\": \"histogram2d\"}], \"histogram2dcontour\": [{\"colorbar\": {\"outlinewidth\": 0, \"ticks\": \"\"}, \"colorscale\": [[0.0, \"#0d0887\"], [0.1111111111111111, \"#46039f\"], [0.2222222222222222, \"#7201a8\"], [0.3333333333333333, \"#9c179e\"], [0.4444444444444444, \"#bd3786\"], [0.5555555555555556, \"#d8576b\"], [0.6666666666666666, \"#ed7953\"], [0.7777777777777778, \"#fb9f3a\"], [0.8888888888888888, \"#fdca26\"], [1.0, \"#f0f921\"]], \"type\": \"histogram2dcontour\"}], \"mesh3d\": [{\"colorbar\": {\"outlinewidth\": 0, \"ticks\": \"\"}, \"type\": \"mesh3d\"}], \"parcoords\": [{\"line\": {\"colorbar\": {\"outlinewidth\": 0, \"ticks\": \"\"}}, \"type\": \"parcoords\"}], \"pie\": [{\"automargin\": true, \"type\": \"pie\"}], \"scatter\": [{\"marker\": {\"colorbar\": {\"outlinewidth\": 0, \"ticks\": \"\"}}, \"type\": \"scatter\"}], \"scatter3d\": [{\"line\": {\"colorbar\": {\"outlinewidth\": 0, \"ticks\": \"\"}}, \"marker\": {\"colorbar\": {\"outlinewidth\": 0, \"ticks\": \"\"}}, \"type\": \"scatter3d\"}], \"scattercarpet\": [{\"marker\": {\"colorbar\": {\"outlinewidth\": 0, \"ticks\": \"\"}}, \"type\": \"scattercarpet\"}], \"scattergeo\": [{\"marker\": {\"colorbar\": {\"outlinewidth\": 0, \"ticks\": \"\"}}, \"type\": \"scattergeo\"}], \"scattergl\": [{\"marker\": {\"colorbar\": {\"outlinewidth\": 0, \"ticks\": \"\"}}, \"type\": \"scattergl\"}], \"scattermapbox\": [{\"marker\": {\"colorbar\": {\"outlinewidth\": 0, \"ticks\": \"\"}}, \"type\": \"scattermapbox\"}], \"scatterpolar\": [{\"marker\": {\"colorbar\": {\"outlinewidth\": 0, \"ticks\": \"\"}}, \"type\": \"scatterpolar\"}], \"scatterpolargl\": [{\"marker\": {\"colorbar\": {\"outlinewidth\": 0, \"ticks\": \"\"}}, \"type\": \"scatterpolargl\"}], \"scatterternary\": [{\"marker\": {\"colorbar\": {\"outlinewidth\": 0, \"ticks\": \"\"}}, \"type\": \"scatterternary\"}], \"surface\": [{\"colorbar\": {\"outlinewidth\": 0, \"ticks\": \"\"}, \"colorscale\": [[0.0, \"#0d0887\"], [0.1111111111111111, \"#46039f\"], [0.2222222222222222, \"#7201a8\"], [0.3333333333333333, \"#9c179e\"], [0.4444444444444444, \"#bd3786\"], [0.5555555555555556, \"#d8576b\"], [0.6666666666666666, \"#ed7953\"], [0.7777777777777778, \"#fb9f3a\"], [0.8888888888888888, \"#fdca26\"], [1.0, \"#f0f921\"]], \"type\": \"surface\"}], \"table\": [{\"cells\": {\"fill\": {\"color\": \"#EBF0F8\"}, \"line\": {\"color\": \"white\"}}, \"header\": {\"fill\": {\"color\": \"#C8D4E3\"}, \"line\": {\"color\": \"white\"}}, \"type\": \"table\"}]}, \"layout\": {\"annotationdefaults\": {\"arrowcolor\": \"#2a3f5f\", \"arrowhead\": 0, \"arrowwidth\": 1}, \"coloraxis\": {\"colorbar\": {\"outlinewidth\": 0, \"ticks\": \"\"}}, \"colorscale\": {\"diverging\": [[0, \"#8e0152\"], [0.1, \"#c51b7d\"], [0.2, \"#de77ae\"], [0.3, \"#f1b6da\"], [0.4, \"#fde0ef\"], [0.5, \"#f7f7f7\"], [0.6, \"#e6f5d0\"], [0.7, \"#b8e186\"], [0.8, \"#7fbc41\"], [0.9, \"#4d9221\"], [1, \"#276419\"]], \"sequential\": [[0.0, \"#0d0887\"], [0.1111111111111111, \"#46039f\"], [0.2222222222222222, \"#7201a8\"], [0.3333333333333333, \"#9c179e\"], [0.4444444444444444, \"#bd3786\"], [0.5555555555555556, \"#d8576b\"], [0.6666666666666666, \"#ed7953\"], [0.7777777777777778, \"#fb9f3a\"], [0.8888888888888888, \"#fdca26\"], [1.0, \"#f0f921\"]], \"sequentialminus\": [[0.0, \"#0d0887\"], [0.1111111111111111, \"#46039f\"], [0.2222222222222222, \"#7201a8\"], [0.3333333333333333, \"#9c179e\"], [0.4444444444444444, \"#bd3786\"], [0.5555555555555556, \"#d8576b\"], [0.6666666666666666, \"#ed7953\"], [0.7777777777777778, \"#fb9f3a\"], [0.8888888888888888, \"#fdca26\"], [1.0, \"#f0f921\"]]}, \"colorway\": [\"#636efa\", \"#EF553B\", \"#00cc96\", \"#ab63fa\", \"#FFA15A\", \"#19d3f3\", \"#FF6692\", \"#B6E880\", \"#FF97FF\", \"#FECB52\"], \"font\": {\"color\": \"#2a3f5f\"}, \"geo\": {\"bgcolor\": \"white\", \"lakecolor\": \"white\", \"landcolor\": \"#E5ECF6\", \"showlakes\": true, \"showland\": true, \"subunitcolor\": \"white\"}, \"hoverlabel\": {\"align\": \"left\"}, \"hovermode\": \"closest\", \"mapbox\": {\"style\": \"light\"}, \"paper_bgcolor\": \"white\", \"plot_bgcolor\": \"#E5ECF6\", \"polar\": {\"angularaxis\": {\"gridcolor\": \"white\", \"linecolor\": \"white\", \"ticks\": \"\"}, \"bgcolor\": \"#E5ECF6\", \"radialaxis\": {\"gridcolor\": \"white\", \"linecolor\": \"white\", \"ticks\": \"\"}}, \"scene\": {\"xaxis\": {\"backgroundcolor\": \"#E5ECF6\", \"gridcolor\": \"white\", \"gridwidth\": 2, \"linecolor\": \"white\", \"showbackground\": true, \"ticks\": \"\", \"zerolinecolor\": \"white\"}, \"yaxis\": {\"backgroundcolor\": \"#E5ECF6\", \"gridcolor\": \"white\", \"gridwidth\": 2, \"linecolor\": \"white\", \"showbackground\": true, \"ticks\": \"\", \"zerolinecolor\": \"white\"}, \"zaxis\": {\"backgroundcolor\": \"#E5ECF6\", \"gridcolor\": \"white\", \"gridwidth\": 2, \"linecolor\": \"white\", \"showbackground\": true, \"ticks\": \"\", \"zerolinecolor\": \"white\"}}, \"shapedefaults\": {\"line\": {\"color\": \"#2a3f5f\"}}, \"ternary\": {\"aaxis\": {\"gridcolor\": \"white\", \"linecolor\": \"white\", \"ticks\": \"\"}, \"baxis\": {\"gridcolor\": \"white\", \"linecolor\": \"white\", \"ticks\": \"\"}, \"bgcolor\": \"#E5ECF6\", \"caxis\": {\"gridcolor\": \"white\", \"linecolor\": \"white\", \"ticks\": \"\"}}, \"title\": {\"x\": 0.05}, \"xaxis\": {\"automargin\": true, \"gridcolor\": \"white\", \"linecolor\": \"white\", \"ticks\": \"\", \"title\": {\"standoff\": 15}, \"zerolinecolor\": \"white\", \"zerolinewidth\": 2}, \"yaxis\": {\"automargin\": true, \"gridcolor\": \"white\", \"linecolor\": \"white\", \"ticks\": \"\", \"title\": {\"standoff\": 15}, \"zerolinecolor\": \"white\", \"zerolinewidth\": 2}}}, \"title\": {\"text\": \"Host Responses by Race\"}},\n",
              "                        {\"responsive\": true}\n",
              "                    ).then(function(){\n",
              "                            \n",
              "var gd = document.getElementById('498123ff-4a1e-493e-9ae8-7198aaf289dd');\n",
              "var x = new MutationObserver(function (mutations, observer) {{\n",
              "        var display = window.getComputedStyle(gd).display;\n",
              "        if (!display || display === 'none') {{\n",
              "            console.log([gd, 'removed!']);\n",
              "            Plotly.purge(gd);\n",
              "            observer.disconnect();\n",
              "        }}\n",
              "}});\n",
              "\n",
              "// Listen for the removal of the full notebook cells\n",
              "var notebookContainer = gd.closest('#notebook-container');\n",
              "if (notebookContainer) {{\n",
              "    x.observe(notebookContainer, {childList: true});\n",
              "}}\n",
              "\n",
              "// Listen for the clearing of the current output cell\n",
              "var outputEl = gd.closest('.output');\n",
              "if (outputEl) {{\n",
              "    x.observe(outputEl, {childList: true});\n",
              "}}\n",
              "\n",
              "                        })\n",
              "                };\n",
              "                \n",
              "            </script>\n",
              "        </div>\n",
              "</body>\n",
              "</html>"
            ]
          },
          "metadata": {
            "tags": []
          }
        }
      ]
    },
    {
      "cell_type": "markdown",
      "metadata": {
        "id": "PwsK4pm9N_LQ"
      },
      "source": [
        "Let's replicate the main results of Edelman et al. (2017)."
      ]
    },
    {
      "cell_type": "code",
      "metadata": {
        "id": "NV30eUJCGpqX",
        "outputId": "707117c4-09e8-47a9-9153-5317fb3b3f8d",
        "colab": {
          "base_uri": "https://localhost:8080/"
        }
      },
      "source": [
        "import statsmodels.api as sm\n",
        "\n",
        "df['const'] = 1 \n",
        "\n",
        "# Column 1\n",
        "#  The default missing ='drop' of statsmodels doesn't apply\n",
        "# to the cluster variable. Therefore, it is necessary to drop\n",
        "# the missing values like below to get the clustered standard \n",
        "# errors.\n",
        "df1 = df.dropna(subset=['yes', 'guest_black', 'name_by_city'])\n",
        "reg1 = sm.OLS(df1['yes'], df1[['const', 'guest_black']])\n",
        "res1 = reg1.fit(cov_type='cluster',\n",
        "                cov_kwds={'groups': df1['name_by_city']})\n",
        "\n",
        "# Column 2\n",
        "vars2 = ['yes', 'guest_black', 'name_by_city', \n",
        "        'host_race_black', 'host_gender_M']\n",
        "df2 = df.dropna(subset = vars2)\n",
        "reg2 = sm.OLS(df2['yes'], df2[['const', 'guest_black',\n",
        "                    'host_race_black', 'host_gender_M']])\n",
        "res2 = reg2.fit(cov_type='cluster',\n",
        "                cov_kwds={'groups': df2['name_by_city']})\n",
        "\n",
        "# Column 3\n",
        "vars3 = ['yes', 'guest_black', 'name_by_city', \n",
        "         'host_race_black', 'host_gender_M',\n",
        "         'multiple_listings', 'shared_property',\n",
        "         'ten_reviews', 'log_price']\n",
        "df3 = df.dropna(subset = vars3)\n",
        "reg3 = sm.OLS(df3['yes'], df3[['const', 'guest_black',\n",
        "                    'host_race_black', 'host_gender_M',\n",
        "                    'multiple_listings', 'shared_property',\n",
        "                    'ten_reviews', 'log_price']])\n",
        "res3 = reg3.fit(cov_type='cluster',\n",
        "                cov_kwds={'groups': df3['name_by_city']})\n",
        "\n",
        "columns =[res1, res2, res3]"
      ],
      "execution_count": 3,
      "outputs": [
        {
          "output_type": "stream",
          "text": [
            "/usr/local/lib/python3.6/dist-packages/statsmodels/tools/_testing.py:19: FutureWarning:\n",
            "\n",
            "pandas.util.testing is deprecated. Use the functions in the public API at pandas.testing instead.\n",
            "\n"
          ],
          "name": "stderr"
        }
      ]
    },
    {
      "cell_type": "code",
      "metadata": {
        "id": "y8viPqNKct8h",
        "outputId": "554e40c7-bbdb-4f3b-ae43-2533de0bbce0",
        "colab": {
          "base_uri": "https://localhost:8080/"
        }
      },
      "source": [
        "#  Library to print professional publication\n",
        "# tables in Latex, HTML, etc.\n",
        "!pip install stargazer"
      ],
      "execution_count": 4,
      "outputs": [
        {
          "output_type": "stream",
          "text": [
            "Requirement already satisfied: stargazer in /usr/local/lib/python3.6/dist-packages (0.0.5)\n"
          ],
          "name": "stdout"
        }
      ]
    },
    {
      "cell_type": "markdown",
      "metadata": {
        "id": "8CTftYlrCdNP"
      },
      "source": [
        " In column 1, White-sounding names are accepted 49% of the time; whereas, Black-\n",
        "sounding names are accepted by around 41% of the time. Therefore, a Black name carries a penalty of 8%. This result is remarkably robust to a set of control variables in columns 2 and 3."
      ]
    },
    {
      "cell_type": "code",
      "metadata": {
        "id": "dMHO77Flch3t",
        "outputId": "1dec8334-f8f0-4052-e918-8bd4d37ae357",
        "colab": {
          "base_uri": "https://localhost:8080/",
          "height": 587
        }
      },
      "source": [
        "# Settings for a nice table\n",
        "from stargazer.stargazer import Stargazer\n",
        "stargazer = Stargazer(columns)\n",
        "stargazer.title('The Impact of Race on Likelihood of Acceptance')\n",
        "stargazer"
      ],
      "execution_count": 5,
      "outputs": [
        {
          "output_type": "execute_result",
          "data": {
            "text/html": [
              "The Impact of Race on Likelihood of Acceptance<br><table style=\"text-align:center\"><tr><td colspan=\"4\" style=\"border-bottom: 1px solid black\"></td></tr><tr><td style=\"text-align:left\"></td><td colspan=\"3\"><em>Dependent variable:yes</em></td></tr><tr><td style=\"text-align:left\"></td><tr><td style=\"text-align:left\"></td><td>(1)</td><td>(2)</td><td>(3)</td></tr><tr><td colspan=\"4\" style=\"border-bottom: 1px solid black\"></td></tr><tr><td style=\"text-align:left\">const</td><td>0.488<sup>***</sup></td><td>0.497<sup>***</sup></td><td>0.755<sup>***</sup></td></tr><tr><td style=\"text-align:left\"></td><td>(0.012)</td><td>(0.013)</td><td>(0.067)</td></tr><tr><td style=\"text-align:left\">guest_black</td><td>-0.080<sup>***</sup></td><td>-0.080<sup>***</sup></td><td>-0.087<sup>***</sup></td></tr><tr><td style=\"text-align:left\"></td><td>(0.017)</td><td>(0.017)</td><td>(0.017)</td></tr><tr><td style=\"text-align:left\">host_gender_M</td><td></td><td>-0.050<sup>***</sup></td><td>-0.048<sup>***</sup></td></tr><tr><td style=\"text-align:left\"></td><td></td><td>(0.014)</td><td>(0.014)</td></tr><tr><td style=\"text-align:left\">host_race_black</td><td></td><td>0.069<sup>***</sup></td><td>0.093<sup>***</sup></td></tr><tr><td style=\"text-align:left\"></td><td></td><td>(0.023)</td><td>(0.023)</td></tr><tr><td style=\"text-align:left\">log_price</td><td></td><td></td><td>-0.062<sup>***</sup></td></tr><tr><td style=\"text-align:left\"></td><td></td><td></td><td>(0.013)</td></tr><tr><td style=\"text-align:left\">multiple_listings</td><td></td><td></td><td>0.062<sup>***</sup></td></tr><tr><td style=\"text-align:left\"></td><td></td><td></td><td>(0.015)</td></tr><tr><td style=\"text-align:left\">shared_property</td><td></td><td></td><td>-0.068<sup>***</sup></td></tr><tr><td style=\"text-align:left\"></td><td></td><td></td><td>(0.017)</td></tr><tr><td style=\"text-align:left\">ten_reviews</td><td></td><td></td><td>0.120<sup>***</sup></td></tr><tr><td style=\"text-align:left\"></td><td></td><td></td><td>(0.013)</td></tr><td colspan=\"4\" style=\"border-bottom: 1px solid black\"></td></tr><tr><td style=\"text-align: left\">Observations</td><td>6,235</td><td>6,235</td><td>6,168</td></tr><tr><td style=\"text-align: left\">R<sup>2</sup></td><td>0.006</td><td>0.010</td><td>0.040</td></tr><tr><td style=\"text-align: left\">Adjusted R<sup>2</sup></td><td>0.006</td><td>0.009</td><td>0.039</td></tr><tr><td style=\"text-align: left\">Residual Std. Error</td><td>0.496 (df=6233)</td><td>0.495 (df=6231)</td><td>0.488 (df=6160)</td></tr><tr><td style=\"text-align: left\">F Statistic</td><td>21.879<sup>***</sup> (df=1; 6233)</td><td>15.899<sup>***</sup> (df=3; 6231)</td><td>35.523<sup>***</sup> (df=7; 6160)</td></tr><tr><td colspan=\"4\" style=\"border-bottom: 1px solid black\"></td></tr><tr><td style=\"text-align: left\">Note:</td>\n",
              " <td colspan=\"3\" style=\"text-align: right\">\n",
              "  <sup>*</sup>p&lt;0.1;\n",
              "  <sup>**</sup>p&lt;0.05;\n",
              "  <sup>***</sup>p&lt;0.01\n",
              " </td></tr></table>"
            ],
            "text/plain": [
              "<stargazer.stargazer.Stargazer at 0x7f96352eec88>"
            ]
          },
          "metadata": {
            "tags": []
          },
          "execution_count": 5
        }
      ]
    },
    {
      "cell_type": "markdown",
      "metadata": {
        "id": "i511M34aFziV"
      },
      "source": [
        "The table below presents the summary statistics about the hosts and properties. In an experiment, the mean values of the control variables are identical to the mean values broken by the treatment group and control group. "
      ]
    },
    {
      "cell_type": "code",
      "metadata": {
        "id": "Dw1KePxjihFt",
        "outputId": "153bb7dc-3953-49ab-c283-2074685de78e",
        "colab": {
          "base_uri": "https://localhost:8080/",
          "height": 417
        }
      },
      "source": [
        "control = ['host_race_white', 'host_race_black', 'host_gender_F', \n",
        "\t'host_gender_M', 'price', 'bedrooms', 'bathrooms', 'number_of_reviews', \n",
        "\t'multiple_listings', 'any_black', 'tract_listings', 'black_proportion']\n",
        "\n",
        "df.describe()[control].T          "
      ],
      "execution_count": 6,
      "outputs": [
        {
          "output_type": "execute_result",
          "data": {
            "text/html": [
              "<div>\n",
              "<style scoped>\n",
              "    .dataframe tbody tr th:only-of-type {\n",
              "        vertical-align: middle;\n",
              "    }\n",
              "\n",
              "    .dataframe tbody tr th {\n",
              "        vertical-align: top;\n",
              "    }\n",
              "\n",
              "    .dataframe thead th {\n",
              "        text-align: right;\n",
              "    }\n",
              "</style>\n",
              "<table border=\"1\" class=\"dataframe\">\n",
              "  <thead>\n",
              "    <tr style=\"text-align: right;\">\n",
              "      <th></th>\n",
              "      <th>count</th>\n",
              "      <th>mean</th>\n",
              "      <th>std</th>\n",
              "      <th>min</th>\n",
              "      <th>25%</th>\n",
              "      <th>50%</th>\n",
              "      <th>75%</th>\n",
              "      <th>max</th>\n",
              "    </tr>\n",
              "  </thead>\n",
              "  <tbody>\n",
              "    <tr>\n",
              "      <th>host_race_white</th>\n",
              "      <td>6392.0</td>\n",
              "      <td>0.634</td>\n",
              "      <td>0.482</td>\n",
              "      <td>0.0</td>\n",
              "      <td>0.00</td>\n",
              "      <td>1.00</td>\n",
              "      <td>1.000</td>\n",
              "      <td>1.000</td>\n",
              "    </tr>\n",
              "    <tr>\n",
              "      <th>host_race_black</th>\n",
              "      <td>6392.0</td>\n",
              "      <td>0.078</td>\n",
              "      <td>0.269</td>\n",
              "      <td>0.0</td>\n",
              "      <td>0.00</td>\n",
              "      <td>0.00</td>\n",
              "      <td>0.000</td>\n",
              "      <td>1.000</td>\n",
              "    </tr>\n",
              "    <tr>\n",
              "      <th>host_gender_F</th>\n",
              "      <td>6392.0</td>\n",
              "      <td>0.376</td>\n",
              "      <td>0.485</td>\n",
              "      <td>0.0</td>\n",
              "      <td>0.00</td>\n",
              "      <td>0.00</td>\n",
              "      <td>1.000</td>\n",
              "      <td>1.000</td>\n",
              "    </tr>\n",
              "    <tr>\n",
              "      <th>host_gender_M</th>\n",
              "      <td>6392.0</td>\n",
              "      <td>0.298</td>\n",
              "      <td>0.457</td>\n",
              "      <td>0.0</td>\n",
              "      <td>0.00</td>\n",
              "      <td>0.00</td>\n",
              "      <td>1.000</td>\n",
              "      <td>1.000</td>\n",
              "    </tr>\n",
              "    <tr>\n",
              "      <th>price</th>\n",
              "      <td>6302.0</td>\n",
              "      <td>181.108</td>\n",
              "      <td>1280.228</td>\n",
              "      <td>10.0</td>\n",
              "      <td>75.00</td>\n",
              "      <td>109.00</td>\n",
              "      <td>175.000</td>\n",
              "      <td>100000.000</td>\n",
              "    </tr>\n",
              "    <tr>\n",
              "      <th>bedrooms</th>\n",
              "      <td>6242.0</td>\n",
              "      <td>3.177</td>\n",
              "      <td>2.265</td>\n",
              "      <td>1.0</td>\n",
              "      <td>2.00</td>\n",
              "      <td>2.00</td>\n",
              "      <td>4.000</td>\n",
              "      <td>16.000</td>\n",
              "    </tr>\n",
              "    <tr>\n",
              "      <th>bathrooms</th>\n",
              "      <td>6285.0</td>\n",
              "      <td>3.169</td>\n",
              "      <td>2.264</td>\n",
              "      <td>1.0</td>\n",
              "      <td>2.00</td>\n",
              "      <td>2.00</td>\n",
              "      <td>4.000</td>\n",
              "      <td>16.000</td>\n",
              "    </tr>\n",
              "    <tr>\n",
              "      <th>number_of_reviews</th>\n",
              "      <td>6390.0</td>\n",
              "      <td>30.869</td>\n",
              "      <td>72.505</td>\n",
              "      <td>0.0</td>\n",
              "      <td>2.00</td>\n",
              "      <td>9.00</td>\n",
              "      <td>29.000</td>\n",
              "      <td>1208.000</td>\n",
              "    </tr>\n",
              "    <tr>\n",
              "      <th>multiple_listings</th>\n",
              "      <td>6392.0</td>\n",
              "      <td>0.326</td>\n",
              "      <td>0.469</td>\n",
              "      <td>0.0</td>\n",
              "      <td>0.00</td>\n",
              "      <td>0.00</td>\n",
              "      <td>1.000</td>\n",
              "      <td>1.000</td>\n",
              "    </tr>\n",
              "    <tr>\n",
              "      <th>any_black</th>\n",
              "      <td>6390.0</td>\n",
              "      <td>0.282</td>\n",
              "      <td>0.450</td>\n",
              "      <td>0.0</td>\n",
              "      <td>0.00</td>\n",
              "      <td>0.00</td>\n",
              "      <td>1.000</td>\n",
              "      <td>1.000</td>\n",
              "    </tr>\n",
              "    <tr>\n",
              "      <th>tract_listings</th>\n",
              "      <td>6392.0</td>\n",
              "      <td>9.514</td>\n",
              "      <td>9.277</td>\n",
              "      <td>1.0</td>\n",
              "      <td>2.00</td>\n",
              "      <td>6.00</td>\n",
              "      <td>14.000</td>\n",
              "      <td>53.000</td>\n",
              "    </tr>\n",
              "    <tr>\n",
              "      <th>black_proportion</th>\n",
              "      <td>6378.0</td>\n",
              "      <td>0.140</td>\n",
              "      <td>0.203</td>\n",
              "      <td>0.0</td>\n",
              "      <td>0.03</td>\n",
              "      <td>0.05</td>\n",
              "      <td>0.142</td>\n",
              "      <td>0.984</td>\n",
              "    </tr>\n",
              "  </tbody>\n",
              "</table>\n",
              "</div>"
            ],
            "text/plain": [
              "                    count     mean       std  ...     50%      75%         max\n",
              "host_race_white    6392.0    0.634     0.482  ...    1.00    1.000       1.000\n",
              "host_race_black    6392.0    0.078     0.269  ...    0.00    0.000       1.000\n",
              "host_gender_F      6392.0    0.376     0.485  ...    0.00    1.000       1.000\n",
              "host_gender_M      6392.0    0.298     0.457  ...    0.00    1.000       1.000\n",
              "price              6302.0  181.108  1280.228  ...  109.00  175.000  100000.000\n",
              "bedrooms           6242.0    3.177     2.265  ...    2.00    4.000      16.000\n",
              "bathrooms          6285.0    3.169     2.264  ...    2.00    4.000      16.000\n",
              "number_of_reviews  6390.0   30.869    72.505  ...    9.00   29.000    1208.000\n",
              "multiple_listings  6392.0    0.326     0.469  ...    0.00    1.000       1.000\n",
              "any_black          6390.0    0.282     0.450  ...    0.00    1.000       1.000\n",
              "tract_listings     6392.0    9.514     9.277  ...    6.00   14.000      53.000\n",
              "black_proportion   6378.0    0.140     0.203  ...    0.05    0.142       0.984\n",
              "\n",
              "[12 rows x 8 columns]"
            ]
          },
          "metadata": {
            "tags": []
          },
          "execution_count": 6
        }
      ]
    },
    {
      "cell_type": "markdown",
      "metadata": {
        "id": "jOGYLV2pGPiU"
      },
      "source": [
        "The balanced treatment tests (t-tests) below show that the Black and White guests are identical."
      ]
    },
    {
      "cell_type": "code",
      "metadata": {
        "id": "HMK0M1EiihId"
      },
      "source": [
        "result = []\n",
        "\n",
        "for var in control:\n",
        "    # Do the T-test and save the p-value\n",
        "    pvalue = sm.OLS(df[var], df[['const', 'guest_black']],\n",
        "               missing = 'drop').fit().pvalues[1]\n",
        "    result.append(pvalue)"
      ],
      "execution_count": 7,
      "outputs": []
    },
    {
      "cell_type": "code",
      "metadata": {
        "id": "Y1sFdeAkiqJ2",
        "outputId": "abc750b2-1196-4aad-90c2-61d4b7eda5dd",
        "colab": {
          "base_uri": "https://localhost:8080/",
          "height": 417
        }
      },
      "source": [
        "ttest = df.groupby('guest_black').agg([np.mean])[control].T\n",
        "ttest['p-value'] = result\n",
        "ttest"
      ],
      "execution_count": 8,
      "outputs": [
        {
          "output_type": "execute_result",
          "data": {
            "text/html": [
              "<div>\n",
              "<style scoped>\n",
              "    .dataframe tbody tr th:only-of-type {\n",
              "        vertical-align: middle;\n",
              "    }\n",
              "\n",
              "    .dataframe tbody tr th {\n",
              "        vertical-align: top;\n",
              "    }\n",
              "\n",
              "    .dataframe thead th {\n",
              "        text-align: right;\n",
              "    }\n",
              "</style>\n",
              "<table border=\"1\" class=\"dataframe\">\n",
              "  <thead>\n",
              "    <tr style=\"text-align: right;\">\n",
              "      <th></th>\n",
              "      <th>guest_black</th>\n",
              "      <th>0.0</th>\n",
              "      <th>1.0</th>\n",
              "      <th>p-value</th>\n",
              "    </tr>\n",
              "  </thead>\n",
              "  <tbody>\n",
              "    <tr>\n",
              "      <th>host_race_white</th>\n",
              "      <th>mean</th>\n",
              "      <td>0.643</td>\n",
              "      <td>0.626</td>\n",
              "      <td>0.154</td>\n",
              "    </tr>\n",
              "    <tr>\n",
              "      <th>host_race_black</th>\n",
              "      <th>mean</th>\n",
              "      <td>0.078</td>\n",
              "      <td>0.078</td>\n",
              "      <td>0.972</td>\n",
              "    </tr>\n",
              "    <tr>\n",
              "      <th>host_gender_F</th>\n",
              "      <th>mean</th>\n",
              "      <td>0.381</td>\n",
              "      <td>0.372</td>\n",
              "      <td>0.439</td>\n",
              "    </tr>\n",
              "    <tr>\n",
              "      <th>host_gender_M</th>\n",
              "      <th>mean</th>\n",
              "      <td>0.298</td>\n",
              "      <td>0.299</td>\n",
              "      <td>0.896</td>\n",
              "    </tr>\n",
              "    <tr>\n",
              "      <th>price</th>\n",
              "      <th>mean</th>\n",
              "      <td>166.429</td>\n",
              "      <td>195.815</td>\n",
              "      <td>0.362</td>\n",
              "    </tr>\n",
              "    <tr>\n",
              "      <th>bedrooms</th>\n",
              "      <th>mean</th>\n",
              "      <td>3.178</td>\n",
              "      <td>3.176</td>\n",
              "      <td>0.962</td>\n",
              "    </tr>\n",
              "    <tr>\n",
              "      <th>bathrooms</th>\n",
              "      <th>mean</th>\n",
              "      <td>3.172</td>\n",
              "      <td>3.167</td>\n",
              "      <td>0.927</td>\n",
              "    </tr>\n",
              "    <tr>\n",
              "      <th>number_of_reviews</th>\n",
              "      <th>mean</th>\n",
              "      <td>30.709</td>\n",
              "      <td>31.030</td>\n",
              "      <td>0.860</td>\n",
              "    </tr>\n",
              "    <tr>\n",
              "      <th>multiple_listings</th>\n",
              "      <th>mean</th>\n",
              "      <td>0.321</td>\n",
              "      <td>0.330</td>\n",
              "      <td>0.451</td>\n",
              "    </tr>\n",
              "    <tr>\n",
              "      <th>any_black</th>\n",
              "      <th>mean</th>\n",
              "      <td>0.287</td>\n",
              "      <td>0.277</td>\n",
              "      <td>0.382</td>\n",
              "    </tr>\n",
              "    <tr>\n",
              "      <th>tract_listings</th>\n",
              "      <th>mean</th>\n",
              "      <td>9.494</td>\n",
              "      <td>9.538</td>\n",
              "      <td>0.848</td>\n",
              "    </tr>\n",
              "    <tr>\n",
              "      <th>black_proportion</th>\n",
              "      <th>mean</th>\n",
              "      <td>0.141</td>\n",
              "      <td>0.140</td>\n",
              "      <td>0.919</td>\n",
              "    </tr>\n",
              "  </tbody>\n",
              "</table>\n",
              "</div>"
            ],
            "text/plain": [
              "guest_black                 0.0      1.0  p-value\n",
              "host_race_white   mean    0.643    0.626    0.154\n",
              "host_race_black   mean    0.078    0.078    0.972\n",
              "host_gender_F     mean    0.381    0.372    0.439\n",
              "host_gender_M     mean    0.298    0.299    0.896\n",
              "price             mean  166.429  195.815    0.362\n",
              "bedrooms          mean    3.178    3.176    0.962\n",
              "bathrooms         mean    3.172    3.167    0.927\n",
              "number_of_reviews mean   30.709   31.030    0.860\n",
              "multiple_listings mean    0.321    0.330    0.451\n",
              "any_black         mean    0.287    0.277    0.382\n",
              "tract_listings    mean    9.494    9.538    0.848\n",
              "black_proportion  mean    0.141    0.140    0.919"
            ]
          },
          "metadata": {
            "tags": []
          },
          "execution_count": 8
        }
      ]
    },
    {
      "cell_type": "markdown",
      "metadata": {
        "id": "_LhtUSOZE5s3"
      },
      "source": [
        "## Exercises"
      ]
    },
    {
      "cell_type": "markdown",
      "metadata": {
        "id": "tx6eyoPl3yWU"
      },
      "source": [
        "1| To the best of my knowledge, the 3 most important empirical papers in the literature of racial discrimination are Bertrand & Mullainathan (2004), Oreopoulos (2011), and Edelman et al. (2017). These 3 papers use a field experiment to capture causality and rule out confound factors. Search on the Internet and return a reference list of experimental papers about racial discrimination."
      ]
    },
    {
      "cell_type": "markdown",
      "metadata": {
        "id": "zk0rssc65YXj"
      },
      "source": [
        "2| Tell me a topic that you are passionate. Return a reference list of experimental papers about your topic."
      ]
    },
    {
      "cell_type": "markdown",
      "metadata": {
        "id": "m9vmejzD1vSL"
      },
      "source": [
        "3| Somebody argues that specific names drive the results of Edelman et al. (2017). In the tables below, you can see that there are not many different names representing Black and White. How can this critic be refuted? What can you do to show that results are not driven by specific names?"
      ]
    },
    {
      "cell_type": "code",
      "metadata": {
        "id": "o-cK7fW-SnA_",
        "outputId": "80c49260-be15-4fa3-9d9f-337a92bd6c67",
        "colab": {
          "base_uri": "https://localhost:8080/"
        }
      },
      "source": [
        "female = df['guest_gender']=='female'\n",
        "df[female].groupby(['guest_race', 'guest_first_name'])['yes'].mean()"
      ],
      "execution_count": 9,
      "outputs": [
        {
          "output_type": "execute_result",
          "data": {
            "text/plain": [
              "guest_race  guest_first_name\n",
              "black       Lakisha             0.433\n",
              "            Latonya             0.370\n",
              "            Latoya              0.442\n",
              "            Tamika              0.482\n",
              "            Tanisha             0.413\n",
              "white       Allison             0.500\n",
              "            Anne                0.567\n",
              "            Kristen             0.486\n",
              "            Laurie              0.508\n",
              "            Meredith            0.498\n",
              "Name: yes, dtype: float64"
            ]
          },
          "metadata": {
            "tags": []
          },
          "execution_count": 9
        }
      ]
    },
    {
      "cell_type": "code",
      "metadata": {
        "id": "IuQ3O1-MUkkP",
        "outputId": "3e3755ca-144b-4638-9877-95f51f882782",
        "colab": {
          "base_uri": "https://localhost:8080/"
        }
      },
      "source": [
        "male = df['guest_gender']=='male'\n",
        "df[male].groupby(['guest_race', 'guest_first_name'])['yes'].mean()"
      ],
      "execution_count": 10,
      "outputs": [
        {
          "output_type": "execute_result",
          "data": {
            "text/plain": [
              "guest_race  guest_first_name\n",
              "black       Darnell             0.412\n",
              "            Jamal               0.354\n",
              "            Jermaine            0.379\n",
              "            Kareem              0.436\n",
              "            Leroy               0.371\n",
              "            Rasheed             0.409\n",
              "            Tyrone              0.377\n",
              "white       Brad                0.419\n",
              "            Brent               0.494\n",
              "            Brett               0.466\n",
              "            Greg                0.467\n",
              "            Jay                 0.581\n",
              "            Todd                0.448\n",
              "Name: yes, dtype: float64"
            ]
          },
          "metadata": {
            "tags": []
          },
          "execution_count": 10
        }
      ]
    },
    {
      "cell_type": "markdown",
      "metadata": {
        "id": "kyWJyokoyR-I"
      },
      "source": [
        "4| Is there any potential research question that can be explored based on the table below? Justify."
      ]
    },
    {
      "cell_type": "code",
      "metadata": {
        "id": "slexRu8m0O-M",
        "outputId": "b71d8c7e-2d1d-40fe-ab7f-36936bc1e0d5",
        "colab": {
          "base_uri": "https://localhost:8080/",
          "height": 570
        }
      },
      "source": [
        "pd.crosstab(index= [df['host_gender_F'], df['host_race']],\n",
        "            columns=[df['guest_gender'], df['guest_race']], \n",
        "            values=df['yes'], aggfunc='mean')"
      ],
      "execution_count": 11,
      "outputs": [
        {
          "output_type": "execute_result",
          "data": {
            "text/html": [
              "<div>\n",
              "<style scoped>\n",
              "    .dataframe tbody tr th:only-of-type {\n",
              "        vertical-align: middle;\n",
              "    }\n",
              "\n",
              "    .dataframe tbody tr th {\n",
              "        vertical-align: top;\n",
              "    }\n",
              "\n",
              "    .dataframe thead tr th {\n",
              "        text-align: left;\n",
              "    }\n",
              "\n",
              "    .dataframe thead tr:last-of-type th {\n",
              "        text-align: right;\n",
              "    }\n",
              "</style>\n",
              "<table border=\"1\" class=\"dataframe\">\n",
              "  <thead>\n",
              "    <tr>\n",
              "      <th></th>\n",
              "      <th>guest_gender</th>\n",
              "      <th colspan=\"2\" halign=\"left\">female</th>\n",
              "      <th colspan=\"2\" halign=\"left\">male</th>\n",
              "    </tr>\n",
              "    <tr>\n",
              "      <th></th>\n",
              "      <th>guest_race</th>\n",
              "      <th>black</th>\n",
              "      <th>white</th>\n",
              "      <th>black</th>\n",
              "      <th>white</th>\n",
              "    </tr>\n",
              "    <tr>\n",
              "      <th>host_gender_F</th>\n",
              "      <th>host_race</th>\n",
              "      <th></th>\n",
              "      <th></th>\n",
              "      <th></th>\n",
              "      <th></th>\n",
              "    </tr>\n",
              "  </thead>\n",
              "  <tbody>\n",
              "    <tr>\n",
              "      <th rowspan=\"8\" valign=\"top\">0</th>\n",
              "      <th>UU</th>\n",
              "      <td>0.400</td>\n",
              "      <td>0.542</td>\n",
              "      <td>0.158</td>\n",
              "      <td>0.381</td>\n",
              "    </tr>\n",
              "    <tr>\n",
              "      <th>asian</th>\n",
              "      <td>0.319</td>\n",
              "      <td>0.378</td>\n",
              "      <td>0.474</td>\n",
              "      <td>0.511</td>\n",
              "    </tr>\n",
              "    <tr>\n",
              "      <th>black</th>\n",
              "      <td>0.444</td>\n",
              "      <td>0.643</td>\n",
              "      <td>0.419</td>\n",
              "      <td>0.569</td>\n",
              "    </tr>\n",
              "    <tr>\n",
              "      <th>hisp</th>\n",
              "      <td>0.464</td>\n",
              "      <td>0.571</td>\n",
              "      <td>0.375</td>\n",
              "      <td>0.478</td>\n",
              "    </tr>\n",
              "    <tr>\n",
              "      <th>mult</th>\n",
              "      <td>0.568</td>\n",
              "      <td>0.727</td>\n",
              "      <td>0.408</td>\n",
              "      <td>0.357</td>\n",
              "    </tr>\n",
              "    <tr>\n",
              "      <th>unclear</th>\n",
              "      <td>0.444</td>\n",
              "      <td>0.500</td>\n",
              "      <td>0.444</td>\n",
              "      <td>0.333</td>\n",
              "    </tr>\n",
              "    <tr>\n",
              "      <th>unclear_three votes</th>\n",
              "      <td>0.476</td>\n",
              "      <td>0.392</td>\n",
              "      <td>0.368</td>\n",
              "      <td>0.367</td>\n",
              "    </tr>\n",
              "    <tr>\n",
              "      <th>white</th>\n",
              "      <td>0.383</td>\n",
              "      <td>0.514</td>\n",
              "      <td>0.386</td>\n",
              "      <td>0.449</td>\n",
              "    </tr>\n",
              "    <tr>\n",
              "      <th rowspan=\"7\" valign=\"top\">1</th>\n",
              "      <th>UU</th>\n",
              "      <td>0.444</td>\n",
              "      <td>0.250</td>\n",
              "      <td>0.333</td>\n",
              "      <td>0.750</td>\n",
              "    </tr>\n",
              "    <tr>\n",
              "      <th>asian</th>\n",
              "      <td>0.429</td>\n",
              "      <td>0.607</td>\n",
              "      <td>0.436</td>\n",
              "      <td>0.460</td>\n",
              "    </tr>\n",
              "    <tr>\n",
              "      <th>black</th>\n",
              "      <td>0.603</td>\n",
              "      <td>0.537</td>\n",
              "      <td>0.397</td>\n",
              "      <td>0.446</td>\n",
              "    </tr>\n",
              "    <tr>\n",
              "      <th>hisp</th>\n",
              "      <td>0.391</td>\n",
              "      <td>0.667</td>\n",
              "      <td>0.292</td>\n",
              "      <td>0.389</td>\n",
              "    </tr>\n",
              "    <tr>\n",
              "      <th>unclear</th>\n",
              "      <td>0.600</td>\n",
              "      <td>0.556</td>\n",
              "      <td>0.125</td>\n",
              "      <td>0.400</td>\n",
              "    </tr>\n",
              "    <tr>\n",
              "      <th>unclear_three votes</th>\n",
              "      <td>0.387</td>\n",
              "      <td>0.583</td>\n",
              "      <td>0.312</td>\n",
              "      <td>0.657</td>\n",
              "    </tr>\n",
              "    <tr>\n",
              "      <th>white</th>\n",
              "      <td>0.450</td>\n",
              "      <td>0.494</td>\n",
              "      <td>0.370</td>\n",
              "      <td>0.476</td>\n",
              "    </tr>\n",
              "  </tbody>\n",
              "</table>\n",
              "</div>"
            ],
            "text/plain": [
              "guest_gender                      female          male       \n",
              "guest_race                         black  white  black  white\n",
              "host_gender_F host_race                                      \n",
              "0             UU                   0.400  0.542  0.158  0.381\n",
              "              asian                0.319  0.378  0.474  0.511\n",
              "              black                0.444  0.643  0.419  0.569\n",
              "              hisp                 0.464  0.571  0.375  0.478\n",
              "              mult                 0.568  0.727  0.408  0.357\n",
              "              unclear              0.444  0.500  0.444  0.333\n",
              "              unclear_three votes  0.476  0.392  0.368  0.367\n",
              "              white                0.383  0.514  0.386  0.449\n",
              "1             UU                   0.444  0.250  0.333  0.750\n",
              "              asian                0.429  0.607  0.436  0.460\n",
              "              black                0.603  0.537  0.397  0.446\n",
              "              hisp                 0.391  0.667  0.292  0.389\n",
              "              unclear              0.600  0.556  0.125  0.400\n",
              "              unclear_three votes  0.387  0.583  0.312  0.657\n",
              "              white                0.450  0.494  0.370  0.476"
            ]
          },
          "metadata": {
            "tags": []
          },
          "execution_count": 11
        }
      ]
    },
    {
      "cell_type": "markdown",
      "metadata": {
        "id": "NZP-ULypPfqH"
      },
      "source": [
        "5| In Edelman et al. (2017), the variable \"name_by_city\" was used to cluster the standard errors. How was the variable \"name_by_city\" created based on other variables? Show the code.\n",
        "\n"
      ]
    },
    {
      "cell_type": "markdown",
      "metadata": {
        "id": "kXfXnqgXgT9g"
      },
      "source": [
        "6| Use the data from Edelman et al. (2017) to test the homophily hypothesis that hosts might prefer guests of the same race. Produce a nice table using the library Stargazer. Interpret the results. "
      ]
    },
    {
      "cell_type": "markdown",
      "metadata": {
        "id": "YaHJyQDpU_pu"
      },
      "source": [
        "7| Overall, people know that socioeconomic status is correlated with race. Fryer & Levitt (2004) showed that distinct/unique African American names are correlated with lower socioeconomic status. Edelman et al. (2017: 17) clearly state: \"Our findings cannot identify whether the discrimination is based on race, socioeconomic status, or a combination of these two.\"\n",
        "Propose an experimental design to disentangle the effect of race from socioeconomic status. Explain your assumptions and describe the procedures in detail."
      ]
    },
    {
      "cell_type": "markdown",
      "metadata": {
        "id": "i-MPIny2O8Om"
      },
      "source": [
        "## Reference"
      ]
    },
    {
      "cell_type": "markdown",
      "metadata": {
        "id": "ZOwX1_TNpbMt"
      },
      "source": [
        "Bertrand, Marianne, and Sendhil Mullainathan. (2004). [Are Emily and Greg More Employable Than Lakisha and Jamal? A Field Experiment on Labor Market Discrimination](https://github.com/causal-methods/Papers/raw/master/Are%20Emily%20and%20Greg%20More%20Employable%20than%20Lakisha%20and%20Jamal.pdf). American Economic Review, 94 (4): 991-1013. \n",
        "\n",
        "Edelman, Benjamin, Michael Luca, and Dan Svirsky. (2017). [Racial Discrimination in the Sharing Economy: Evidence from a Field Experiment](https://github.com/causal-methods/Papers/raw/master/Racial%20Discrimination%20in%20the%20Sharing%20Economy.pdf). American Economic Journal: Applied Economics, 9 (2): 1-22.\n",
        "\n",
        "Fryer, Roland G., Jr., and Steven D. Levitt. (2004). The Causes and Consequences of Distinctively Black Names. Quarterly Journal of Economics 119 (3): 767–805.\n",
        "\n",
        "Oreopoulos, Philip. (2011). [Why Do Skilled Immigrants Struggle in the Labor Market? A Field Experiment with Thirteen Thousand Resumes](https://github.com/causal-methods/Papers/raw/master/Oreopoulos/Why%20Do%20Skilled%20Immigrants%20Struggle%20in%20the%20Labor%20Market.pdf). American Economic Journal: Economic Policy, 3 (4): 148-71.\n"
      ]
    }
  ]
}