WLS Data Conventions

Structure of WLS Longitudinal Data

In the wide-form, each record contains data for the entire family. There is only one ID for the public - IDPUB. IDPUB is the same for graduate respondents, their sibling and their spouse. Variables that are the same for the different respondents are named uniquely. See variable naming conventions for details.

Missing Values

The following missing value codes are used throughout 1957-2011 data. 1957-1994 data only include codes -1 through -4. In the online codebook, clicking on "Inap" for value -2 expands the description to indicate which cases are inappropriate. Clicking on the expanded description reduces the label back to "Inap".

Value Label
-1 Don't know
-2 Inappropriate
-3 Refused
-4 Not ascertained
-5 Partial interview
-6 Could Not Code
-7 No such income/assets/pensions/inheritance
-8 dk/r screener
-9 Item not on path
-10 Global refusal
-11 At most part time worker
-12 At least full time worker
-13 Amount Differs For Generic/Brand-Name
-14 Amount Changes Over The Year (Vol)
-15 Multiple Sessions
-16 Never went to usual provider
-17 Special Missing Code for Allocation Cases
-18 Unable to Code (ICD-9 related questions)
-19 Occupational scores not assigned to military occupations
-20 Amount varies
-21 Pays all costs not covered by the plan
-22 Different payment arrangement
-23 Attended a HS in Wisconsin, did not graduate from that HS
-24 Volunteered, < 1 drink per day
-25 Only drank at religious services
-26 Unable to code, multiple people mentioned
-27 Not asked of Proxy
-28 Not asked of participant via telephone
-29 Left Blank on SAQ
-30 Not part of MOSAQ
-31 Invalid Activity Reported As Job

Note: From 1964-1977, if the respondent was not fielded or did not respond at
all to the survey, all variables for the respondent in that year of the survey
are coded -3 (originally labeled refused, not ascertained).
From 1993-2011, if the respondent was not fielded or did not respond to the 
survey at all, all variables for the respondent in that year are coded 
system missing (. for numeric variables and blank for string variables).
	

Variable Naming


If the variable name is a mnemonic, then that variable is from 1957-1977 surveys.
From 1993 onward, variable names consist of a combination of letters and numbers
that allow for convenient identification of the year and mode of the survey.

Variables that differ only by year of the survey have identical names for every 
character past the first letter.  Please note that the fielding of the surveys 
spanned multiple years.  In the table below we simplify the year by listing only
the year where the greatest number of surveys were completed. 


First Letter Grad First Letter Sib Grad Year Sib Year Mode
R
S
1993
1994
Phone
G
C
2004
2005
Phone
H
K
2011
2011
In-Person
M
N
1993
1994
SAQ
I
D
2004
2005
SAQ
J
P
2011
2011
SAQ
 

Menu