CPS Utilities Extraction Report April 10, 2005, 4.05 PM Source: Education and School Enrollment 1978 October CPS Request file name ==> c:\cpsworking\mdat4903_oct.req Source file name ==> e:\data\oct78.z Stata dataset file name ==> c:\cpsworking\mdat4903_oct\oct78.dta This report file name ==> c:\cpsworking\mdat4903_oct\oct78.rpt Warning: 'grdatn' is not available for 1978. Warning: 'subhh' is not available for 1978. Warning: 'famrel' is not available for 1978. Warning: 'famnum' is not available for 1978. Warning: 'intstat' is not available for 1978. Warning: 'famwgt' is not available for 1978. Warning: 'hhwgt' is not available for 1978. Warning: 'agrdatn' is not available for 1978. Number of records in source file is 162116. Number of records read is 162116. Number of records written is 162116. Value summaries: Variable Observations Minimum Maximum Mean recnum 162116 1 162116 81058.50 age 149732 0 99 33.12 marstat 116778 1 5 2.49 race 149732 1 3 1.15 sex 149732 1 2 1.52 rrp 149732 1 6 2.20 hhid "char" variable, cannot be summarized. hhnum 129141 1 5 1.09 _year 162116 1978 1978 1978.00 grdhi 116778 1 19 12.83 grdcom 116778 1 2 1.29 mis 162116 1 8 4.53 lineno 149732 0 32 2.39 wgtfnl 162116 0 856344 132241.05 grdatt 43214 1 19 9.85 rectyp 162116 1 3 1.14 agrdhi 149732 0 1 0.00 state 162116 11 95 52.88 spneth "char" variable, cannot be summarized. Options selected: Include labels in Stata dataset Create Stata dataset file directly Description of selected variables: <recnum> Unique record ID number - Unicon created Each observation is assigned a unique id number, facilitating later merges with the same file should the user wish to extract additional variables at a later time. This ID variable can not be used to match different files across years; instead, refer to the discussion in Appendix S. The maximum value of this variable is the total record count from each file. There is no corresponding variable to be found in the census files; therefore, no column numbers are listed here. Type: internal Topic: record keeping Code: Year Total Count Year Total Count 1968 155,369 1969 146,835 1970 149,208 1971 142,623 1972 144,813 1973 139,501 1974 136,608 1975 134,898 1976 133,697 1977 146,250 1978 162,116 1979 160,666 1980 188,318 1981 170,567 1982 169,099 1983 167,502 1984 165,490 1985 165,995 1986 163,303 1987 163,816 1988 154,224 1989 161,750 1990 164,259 1991 162,138 1992 159,439 1993 157,154 1994 153,030 1995 148,392 1996 135,845 1997 135,599 1998 135,673 1999 136,710 2000 135,283 Note: Universe - all records _______________________________________________________________________________ <age> Age of person as of end of survey week Original location, length, and name of variable: 68-81 82-88 89-93 94-00 97 97 120 122 2 2 2 2 I27 I27 A-AGE PEAGE Type: basic Topic: demography Related variable: aage - allocation flag agetop - topcode Code: 68-84 85-88 89-00 Adult age 14-99 14-90 15-90 Child age 0-13 0-13 0-14 In 1994 forward, missings are denoted by '-1'. Note: Universe - all records 1985+ are topcoded at 90. _______________________________________________________________________________ <marstat> Marital status Original location, length, and name of variable: 68-81 82-88 89-93 94-00 99 99 122 125 1 1 1 2 I18E I18E A-MARITL PEMARITL Type: basic Topic: demography Related variable: amarstat - allocation flag Code: 68-88 89-93 94-00 Married, civilian spouse present 1 1 1 Married, AF spouse present 2 2 1 Married - spouse abs (inc separated) 3 Married - spouse abs (exc separated) 3 Married, spouse absent 2 Widowed 4 4 3 Divorced 4 5 4 Separated 3 6 5 Never married 5 7 6 Note: 1968-1993 Universe - all adult records, including Armed Forces 1994-2000 Universe - those people who are 15 years old or more (age >= 15) (The universe is the same for all years but 94+ were more explicit.) The questionnaires in 1979-1988 say 'Married, spouse absent excluding separated' yet the variable code during these years includes separated in value 3. _______________________________________________________________________________ <grdatn> Educational attainment Original location, length, and name of variable: 92-93 94-00 127 137 2 2 A-HGA PEEDUCA Type: basic Topic: education Related variable: agrdatn - allocation flag _educ - education recode, Unicon (68-91) grdhi - highest grade attended (68-92) grdcom - completed highest grade attended (68-92) Code: 92-00 None or children 00 Less than 1st grade 31 1st, 2nd, 3rd or 4th grade 32 5th or 6th grade 33 7th or 8th grade 34 9th grade 35 10th grade 36 11th grade 37 12th grade-no diploma 38 High school graduate - diploma or GED, etc. 39 Some college but not degree 40 Associate's degree in college - occ/voc program 41 Associate's degree in college - academic program 42 Bachelor's degree in college - BA, BS, AB, etc. 43 Master's degree - MA, MS, MBA etc. 44 Professional school degree - MD, DDS, DVM, etc. 45 Doctorate degree - PhD, EdD, etc. 46 Note: 1992-1993 Universe - all adult records, including Armed Forces 1994-2000 Universe - all adult records (popstat = 2 or 3) _______________________________________________________________________________ <race> Race Original location, length, and name of variable: 68-81 82-88 89-93 94-00 100 100 130 139 1 1 1 2 I29 I29 A-RACE PERACE Type: basic Topic: demography Related variable: arace - allocation flag Code: 68-88 89-95 96-00 White 1 1 1 Black 2 2 2 Amer Indian, Aleut Eskimo 3 3 Asian/Pacific Islander 4 4 Other 3 5 In 1994 forward, missings are denoted by '-1'. Note: 1968-1993 Universe - all persons 1994-2000 Universe - all persons (popstat = 1, 2, or 3) In Jan 1996, the Census revised procedures for editing and allocating the race variable. All "Other" responses were allocated into one of the 4 main race categories. This was done to offset a major increase in "Other" responses which caused severe underestimates of the American Indian and Asian Pacific Islander populations. Due to this, one should use caution when making comparisons between 1995 and 1996 data, especially for those identifying all four race groups. _______________________________________________________________________________ <sex> Sex Original location, length, and name of variable: 68-81 82-88 89-93 94-00 101 101 125 129 1 1 1 2 I30 I30 A-SEX PESEX Type: basic Topic: demography Related variable: asex - allocation flag Code: 68-00 Male 1 Female 2 In 1994 forward, missings are denoted by '-1'. Note: 1968-1993 Universe - all persons 1994-2000 Universe - all persons (popstat = 1, 2, or 3) _______________________________________________________________________________ <rrp> Relationship to reference person Original location, length, and name of variable: 68-81 82-88 89-93 94-00 96 96 116 118 1 1 2 2 I12 I12 A-RRP PERRP Type: basic Topic: family Related variable: arrp - allocation variable Code: 68-88 78K 89-93 94 95-00 Reference person with relatives in HH 1 01 01 01 Reference person with no relatives in HH 2 02 02 02 Husband 03 Wife 3 04 Spouse 03 03 Own child/ child of reference person 1 05 04 04 Grandchild 05 05 Parent 06 06 06 Brother/sister 2 07 07 07 Other relative of reference person 4 3 08 08 08 Foster child 09 09 Nonrel of ref w/own rels in HH (2nd fam) 5 4 09 10 10 Partner/roommate 11 Nonrel of ref-no own rel in HH (2nd ind) 6 5 10 12 12 Unmarried partner with relatives 13 Unmarried partner without relatives 14 Housemate/roommate with relatives 15 Housemate/roommate without relatives 16 Roomer/boarder with relatives 17 Roomer/boarder without relatives 18 Blank or out of range 6 Note: 1968-1969 Universe - all adults 1970-1993 Universe - all persons In 1970-1977, children are "plugged" with a value of 4. In 1978, the child values (78K) are different from the adult. In 1979 forward, children may have values 4-6. The term HEAD becomes REFERENCE PERSON starting in 1989. 1994-2000 Universe - all persons (popstat = 1, 2, or 3) _______________________________________________________________________________ <hhid> Unique household identifier Original location, length, and name of variable: 68-81 82-88 89-93 94 95-00 4 4 102 1 1 12 12 12 12 15 H-ID HRHHID HRHHID Type: basic Topic: record keeping Code: numeric with length of 12 or 15 Note: Universe - all records Per Census: This variable is unique for each household. It will be recycled after the 8 month rotations. It can be broken down as follows: columns 1-2 regional office number columns 3-5 PSU - a geographical division columns 6-9 segment - a smaller geographical division columns 10-12 serial number - unique to a household In 68-73 and 76 the following variables combine to form a unique identifier for each housing unit (note the combined length of the variables is 12 characters, same as hhid variable) columns 1-5 random Random cluster code columns 6 segnum Segment number - rotation number columns 7-9 segnum Segment number - remainder columns 10-11 sernum Serial number columns 12 subhh Subdivided household number In 1995 forward three more columns were added to denote county within state. _______________________________________________________________________________ <subhh> Subdivided household # - generated to subdivide multi-headed hh Original location, length, and name of variable: 68-76 15 1 Type: basic Topic: record keeping Code: 68-76 Not sub divided/not duplicate 0 First of subdivided etc./first of duplicate etc. 1...9 Note: Universe - all records There are households with duplicate random cluster codes (random) and serial numbers (sernum). Random Cluster Code (random), Segment Number (segnum 72+ only), Serial Number (sernum), and Subdivided Household Number (subhh) form a unique identifier for each sample housing unit. In later years these variables are presented as a single variable hhid. _______________________________________________________________________________ <hhnum> Household number Original location, length, and name of variable: 68-81 82-88 89-93 94-00 48 48 6 77 1 1 1 2 I9 I9 H-HHNUM HUHHNUM Type: basic Topic: record keeping Related variable: ahhnum - allocation flag Code: 68-00 Blank .,-1 Household number 1-8 Note: Universe - all households The inital household receives a value of 1, and subsequent replacement households increase the value by 1. (per Census: This variable notes which household is living at this address (house, apartment, etc.). For example if in MIS 1 one family [household] lived here and then in MIS 2 another family moved in, this would be household 2 at this address. As the address is only questioned for 8 months in a row the maximum value is 8.) This is an unedited variable in 1994 forward. Additional valid entries are: -1 (blank) -2 (don't know) -3 (refused) _______________________________________________________________________________ <famrel> Family relationship Original location, length, and name of variable: 84-88 89-93 94-00 408 275 153 1 1 2 FAMREL A-FAMREL PRFAMREL Type: basic Topic: family Code: 84-88 89-00 Not a family member 0 0 Reference person 1 1 Spouse 2 2 Child 3 3 Other relative in primary family 4 Other relative in prim. fam. & unrelated subfam only 4 Note: 1984-1993 Universe - all persons 1994-2000 Universe - all persons (1<=popstat<=3) In 1984-1988, data in columns 385-480 are the result of the new demographic edit. These demographic characteristics are usually consistent with those produced by the basic CPS edit (columns 94-105). Choice of which data set to use should depend on the user's needs. Comparability of BLS's published data or BLS's duplication of Phase II population controls best matches use of the basic CPS edit characteristics which were used in the basic CPS weighting. Users interested in family data or replicating BLS's family data should use the characterostocs produced by the new demographic edit. _______________________________________________________________________________ <_year> Survey year - Unicon created This is a variable that takes its value for year from the request file, not from any column location on the data files. Type: basic Topic: record keeping Code: Four-digit year Note: Universe - all records _______________________________________________________________________________ <grdhi> Highest grade attended Original location, length, and name of variable: 68-81 82-88 89-91 92* 103 103 127 374 2 2 2 2 I18H I18H A-HGA A-S40 Type: basic Topic: education Related variable: agrdhi - allocation flag _educ - education recode, Unicon (68-91) grdcom - completed highest grade attended (68-92) grdatn - educational degree attained (92+) Code: 68-88 89-92* NIU 00 . No response 99 Elementary: 0 or K 01 1 02 2 03 3 04 4 05 5 06 6 07 7 08 8 09 None 00 E1 01 E2 02 E3 03 E4 04 E5 05 E6 06 E7 07 E8 08 High School: 9 10 10 11 11 12 12 13 H1 09 H2 10 H3 11 H4 12 College: 13 14 14 15 15 16 16 17 17 18 18+ 19 C1 13 C2 14 C3 15 C4 16 C5 17 C6+ 18 Note: 1968-1991 Universe - all adult records, including Armed Forces 1992 Universe - adults IN THE SUPPLEMENT* who are not atttending school or college (schatt=2) _______________________________________________________________________________ <grdcom> Completed highest grade attended Original location, length, and name of variable: 68-81 82-88 89-91 92* 105 105 129 376 1 1 1 1 I18I I18I A-HGC A-S41 Type: basic Topic: education Related variable: agrdcom - allocation flag _educ - education recode, Unicon (68-91) grdhi - highest grade attended (68-92) grdatn - educational degree attained (92+) Code: 68-91 92* Yes 1 1 No 2 2 No response 9 Note: 1968-1991 Universe - all adult record, including Armed Forces 1992 Universe - adults IN THE SUPPLEMENT* who are not presently attending or enrolled in regular school (schatt=2) _______________________________________________________________________________ <mis> Month in sample Original location, length, and name of variable: 68-81 82-88 89-93 94-00 2 2 35 63 1 1 1 2 MIS MIS H-MIS HRMIS Type: basic Topic: record keeping Code: 1 - 8 Note: Universe - all records _______________________________________________________________________________ <lineno> Line number of each person Original location, length, and name of variable: 68-81 82-88 89-93 94-00 94 94 114 147 2 2 2 2 I18A I18A A-LINENO PULINENO Type: basic Topic: record keeping Related variable: alineno - allocation flag Code: 68-93 94-00 Line number 01-39 01-99 Note: Universe - all records Note to those interested in matching: This variable is transcribed by the interviewer from rotation to rotation. So this should be a unique identifier for an individual within the household. This is an unedited variable in 1994 forward. Additional valid entries are: -1 (blank) -2 (don't know) -3 (refused) _______________________________________________________________________________ <famnum> Family number (ID) within household Original location, length, and name of variable: 84-88 89-93 94-00 405 272 151 2 2 2 FAMNUM A-FAMNUM PRFAMNUM Type: basic Topic: record keeping Code: 84-88 89-00 Not a family member 00 00 Primary family member only 01 01 Member of subfamily # 02-39 02-19 Note: 1984-1993 Universe - all persons 1994-2000 Universe - all persons (popstat = 1, 2, or 3) In 1984-1988, data in columns 385-480 are the result of the new demographic edit. These demographic characteristics are usually consistent with those produced by the basic CPS edit (columns 94-105). Choice of which data to use should depend on the user's needs. Comparability of BLS's published data or BLS's duplication of Phase II population controls best matches use of the basic CPS edit characteristics which were used in the basic CPS weighting. Users interested in family data or replicating BLS's family data should use the characteristics produced by the new demographic edit. _______________________________________________________________________________ <wgtfnl> Final weight (*100 or *10000) Original location, length, and name of variable 68-76 77 78-88 89-93 94-00 121 126 121 248 613 12 7 12 8 10 A-FNLWGT PWSSWGT Type: basic Topic: record keeping Code: 68-93 2 implied decimals, right justified, space filled 94-00 4 implied decimals For noninterview A records Regular 'Type A' 1 Subsamples 2-4 For noninterview B/C records Regular 'Type B/C' 1 Subsamples 2-4 The Census does not provide decimal points in their data. USER MUST DIVIDE BY 100 OR 10000. Note: 1968-1993 Universe - all persons In both 1979 and 1980 the final weight variable is written over in this location (121) by the October supplement weight (wgt). However, the documentation in some years indicates that the supplement weight is equal to the final weight so Unicon has assigned the value to both wgtfnl and wgt. It is up to the user to determine whether to use it as the wgtfnl. 1981 is a transitional year; there are 2 final weight variables in 1981. The one in this position (121) is 1970 based, and the other one (wgtfnl80), column 589, is 1980 based. In subsequent years, the final weight variable is 1980 based. In 87-88, the Armed Forces record indicatates this field as being 'all fill'. Used for most tabulations, controlled to independent estimates for 1) states 2) origin, sex, and age and 3) age, race, and sex (source for this comment is the 1994 manual). 1994-2000 Universe - all persons (1<=popstat<=3) _______________________________________________________________________________ <grdatt> Grade attending Original location, length, and name of variable 68-82 83 84 85 86 87 88 89 423 453 507 730 483 484 484 363 2 2 2 2 2 2 2 2 I36 I31 I58 I31 I32 I32 I32 A-S32 90 91 92 93 94-97 98-00 363 364 364 364 819 861 2 2 2 2 2 2 A-S32 A-S32 A-S32 A-S32 PEGRADE PEGRADE Type: supplement Topic: education Related variable: chgrd - grade attending (child 84+) agrdatt - allocation flag Code: 68-86 87-93 94-00 Adult: E1 Grade 1 01 01 01 E2 Grade 2 02 02 02 E3 Grade 3 03 03 03 E4 Grade 4 04 04 04 E5 Grade 5 05 05 05 E6 Grade 6 06 06 06 E7 Grade 7 07 07 07 E8 Grade 8 08 08 08 H1 High school 1 09 09 09 H2 High school 2 10 10 10 H3 High school 3 11 11 11 H4 High school 4 12 12 12 C1 College 1(freshman) 13 13 13 C2 College 2(sophomore) 14 14 14 C3 College 3 (junior) 15 15 15 C4 College 4 (senior) 16 16 16 C5 College 5(grad yr 1) 17 17 17 C6+ College 6+(grad yr 2+) 18 18 18 Special school 19 19 NIU . 99 -1 No response 20 Child: Nursery - full day 01 Nursery - part day 02 Kindergarten - full day 03 Kindergarten - part day 04 E1 Grade 1 05 E2 Grade 2 06 E3 Grade 3 07 E4 Grade 4 08 E5 Grade 5 09 E6 Grade 6 10 E7 Grade 7 11 E8 Grade 8 12 H1 High school 1 13 H2 High school 2 14 H3 High school 3 15 H4 High school 4 16 Special school 17 NIU . No response 20 Note: 1968-1998 Universe - all those attending school (schatt=1) In 68-83 this same variable appears on adult and child records. In 84 forward see chatt for child information. 1999-2000 Universe - adult civilians (age>=15 & popstat=2) _______________________________________________________________________________ <intstat> Household interview status Original location, length, and name of variable: 89-93 94-00 34 57 1 2 H-HHTYPE HRINTSTA Type: basic Topic: record keeping Code: 89-93 94-00 Interview 1 1 Noninterview type A 2 2 Noninterview type B/C 3 Noninterview type B 3 Noninterview type C 4 Note: Universe - all households _______________________________________________________________________________ <rectyp> Record type Original location, length, and name of variable: 68-81 82-88 89-93 1 1 101 1 1 1 INTTYPE INTTYPE H-RECTYP Type: basic Topic: record keeping Code: 68-83 84-93 Interviewed adult 1 1 Type A noninterview 2 2 Type B/C noninterview 3 3 Armed forces record 4 Child record 1 5 Note: Universe - all records Record type 1 includes children in 1968-1983. IMPORTANT: In 1994+, all records have the same layout, therefore there is no rectyp variable in the file. To select adult interview records use the variables intstat=1 and age>=15. _______________________________________________________________________________ <famwgt> Family weight (*100 or *10000) Original location, length, and name of variable: 84-88 89-93 94-00 445 294 583 12 8 10 FAMWGT A-FAMWGT PWFMWGT Type: basic Topic: record keeping Code: 84-93 94-00 Decimal points implied 2 4 Right justified, space filled The Census does not provide decimal points in their data. USER MUST DIVIDE BY 100 OR 10000. Note: 1984-1993 Universe - all persons 1994-2000 Universe - all persons (1<=popstat<=3) Only used for tallying family characteristics In 1984-1988, data in columns 385-480 are the result of the new demographic edit. These demographic characteristics are usually consistent with those produced by the basic CPS edit (columns 94-105) but are not necessarily identical. Choice of which data set to use should depend on the user's needs. Comparability of BLS's published data or duplication of Phase II population controls best matches use of the basic CPS edit characteristics which were used in the basic CPS weighting. Users interested in family data or replicating BLS's family data should use the characteristics produced by the new demographic edit. _______________________________________________________________________________ <hhwgt> Household weight (*100 or *10000) Original location, length, and name of variable: 89-93 94-00 49 47 9 10 H-HHWGT HWHHWGT Type: basic Topic: record keeping Code: 89-93 94-00 2 implied decimal points 4 implied decimals The Census does not provide decimal points in their data. USER MUST DIVIDE BY 100 OR 10000. Note: 1989-1994 Universe - all households 1995-2000 Universe - all interviewed households (intstat=1) Final household weight equivalent to the weight of the wife in husband-wife households and the reference person in all other households. Used for tallying household characteristics _______________________________________________________________________________ <agrdatn> Allocation flag: grdatn Original location, length, and name of variable: 92-93 94-00 318 671 1 2 A%HGA PXEDUCA Type: basic Topic: education item allocation flag Related variable: grdatn - educational attainment Code: 92-93 94-00 No change 0 Value to blank 1 50 Blank to value 2 11 Value to value 3 10 Allocated 4 Value to value - no error 5 Refusal to value, allocated - no error 6 Blank to NA - no error 7 Blank to NA - error 8 Value - no change 00 Blank - no change 01 Don't know - no change 02 Refused - no change 03 Don't know to value 12 Refused to value 13 Value to longitudinal value 20 Blank to longitudinal value 21 Don't know to longitudinal value 22 Refused to longitudinal value 23 Value to allocated value longitudinal 30 Blank to allocated value longitudinal 31 Don't know to allocated value long 32 Refused to allocated value longitudinal 33 Value to allocated value 40 Blank to allocated value 41 Don't know to allocated value 42 Refused to allocated value 43 Don't know to blank 52 Refused to blank 53 Note: Universe - all adults, including Armed Forces _______________________________________________________________________________ <agrdhi> Allocation flag: grdhi Original location, length, and name of variable: 68-81 82-88 89-91 253 325 318 1 1 1 I32 I18H A%HGA Type: basic Topic: education item allocation flag Related variable: grdhi - highest grade attended Code: 68-88 89-91 Not allocated 0 No change 0 Allocated 1 4 Note: Universe - all adults (including Armed Forces 89+) _______________________________________________________________________________ <state> Census state code Original location, length, and name of variable: 68-81 82-88 89-93 94-00 17 17 79 91 2 2 2 2 MST-STATE MST-STATE HG-ST60 GESTCEN Type: basic Topic: geography Code: 68-00 Appendix A Note: Universe - all households First digit of state code is geographic division code: e.g., Northeast Region (Region 1) New England Division (Div. 1) In 1968-1993, the codes are 1960 based. _______________________________________________________________________________ <spneth> Spanish ethnicity Original location, length, and name of variable: 73-88 89-93 94-00 155 194 141 1 2 2 I33 A-REORGN PRORIGIN Type: basic Topic: demography Related variable: aspneth - allocation flag Code: 73-88 89-00 Mexican American 1 01 Chicano 2 02 Mexican (Mexicano) 3 03 Puerto Rican 4 04 Cuban 5 05 Central/South American 6 06 Other Spanish 7 07 All other 8 08 Don't know 9 09 Not answered A 10 Note: Universe - all adult records This is a character variable in 1973-1988. _______________________________________________________________________________