SSCC - Social Science Computing Cooperative Supporting Statistical Analysis for Research

4.2 Naming variables

These exercises use the PSID.csv data set that was imported in the prior section.

  1. Import the PSID.csv data set.

    library(tidyverse)
    psid_path <- file.path("..", "datasets", "PSID.csv")
    psid <- read_csv(psid_path, col_types = cols())
    Warning: Missing column names filled in: 'X1' [1]
    glimpse(psid)
    Observations: 4,856
    Variables: 9
    $ X1       <dbl> 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16...
    $ intnum   <dbl> 4, 4, 4, 4, 5, 6, 6, 7, 7, 7, 10, 10, 10, 11, 13, 13,...
    $ persnum  <dbl> 4, 6, 7, 173, 2, 4, 172, 4, 170, 171, 3, 171, 178, 17...
    $ age      <dbl> 39, 35, 33, 39, 47, 44, 38, 38, 39, 37, 48, 47, 40, 3...
    $ educatn  <dbl> 12, 12, 12, 10, 9, 12, 16, 9, 12, 11, 13, 12, 12, 16,...
    $ earnings <dbl> 77250, 12000, 8000, 15000, 6500, 6500, 7000, 5000, 21...
    $ hours    <dbl> 2940, 2040, 693, 1904, 1683, 2024, 1144, 2080, 2575, ...
    $ kids     <dbl> 2, 2, 1, 2, 5, 2, 3, 4, 3, 5, 98, 3, 0, 0, 2, 0, 0, 1...
    $ married  <chr> "married", "divorced", "married", "married", "married...
  2. Set the variable names to something useful, if they are not already. Change at least one name.

    psid <-
      rename(
        psid,
        obs_num = X1,
        intvw_num = intnum,
        person_id = persnum,
        marital_status = married
        )