CPS Utilities Extraction Report April 10, 2005, 4.40 PM Source: Marriage and Fertility 1977 June CPS Request file name ==> c:\cpsworking\jdat71_78.req Source file name ==> e:\data\jun77.z Stata dataset file name ==> c:\cpsworking\jdat71_78\jun77.dta This report file name ==> c:\cpsworking\jdat71_78\jun77.rpt Warning: 'grdatn' is not available for 1977. Warning: 'subhh' is not available for 1977. Warning: 'famnum' is not available for 1977. Warning: 'famrel' is not available for 1977. Warning: 'hhwgt' is not available for 1977. Warning: 'intstat' is not available for 1977. Warning: 'sernum' is not available for 1977. Warning: 'segnum' is not available for 1977. Warning: 'random' is not available for 1977. Number of records in source file is 129402. Number of records read is 129402. Number of records written is 129402. Value summaries: Variable Observations Minimum Maximum Mean recnum 129402 1 129402 64701.50 age 116960 14 99 40.32 marstat 116960 1 5 2.48 grdhi 116960 1 19 12.64 grdcom 116960 1 2 1.24 sex 116960 1 2 1.53 race 116960 1 3 1.14 rrp 116960 1 6 2.50 hhid "char" variable, cannot be summarized. hhnum 129367 1 6 1.08 _year 129402 1977 1977 1977.00 mis 129402 1 8 4.50 lineno 116960 0 39 1.94 wgtfnl 129402 0 871767 127219.35 rectyp 129402 1 3 1.17 state 129402 11 95 52.36 spneth "char" variable, cannot be summarized. Options selected: Include labels in Stata dataset Create Stata dataset file directly Description of selected variables: <recnum> Unique record ID number - Unicon created Each observation is assigned a unique id number, facilitating later merges with the same file should the user wish to extract additional variables at a later time. The maximum value of this variable is the total record count from each file. There is no corresponding variable to be found in the census files; therefore, no column numbers are listed here. Topic: record keeping Code: Year Total Count Year Total Count 1971 105,914 1973 110,873 1974 109,818 1975 98,806 1976 109,173 1977 129,402 1979 160,921 1980 188,201 1981 170,359 1982 169,440 1983 168,015 1984 166,055 1985 165,471 1986 165,201 1987 163,541 1988 152,460 1990 164,200 1991 161,938 1992 159,339 1994 153,572 1995 154,299 1998 134,996 2000 135,488 Note: Universe - all records 1975 does not have any noninterview records on the file. _______________________________________________________________________________ <age> Age of person as of end of survey week Original location, length, and name of variable: 71 73-77 79-83 84-88 90-92 94-95 98 00 97 97 97 97 120 122 122 122 2 2 2 2 2 2 2 2 I27 I27 I18C I18D A-AGE PEAGE PEAGE PEAGE Topic: demography Related variable: aage - allocation flag agetop - topcode Code: 71-77 79-85 86-88 90-00 Adult age 14-99 14-99 14-90 15-90 Child age 0-13 0-13 0-14 In 1994 forward, missings are denoted by '-1'. Note: Universe - all records 1986-2000 are topcoded at 90. _______________________________________________________________________________ <marstat> Marital status Original location, length, and name of variable: 71 73-77 79-83 84-88 90-92 94-95 98 00 99 99 99 99 122 159 159 159 1 1 1 1 1 2 2 2 I28 I28 I18D I18E A-MARITL PRMARSTA PRMARSTA PRMARSTA Topic: demography Related variable: marsta2 - marital status, new grouping 94+ amarstat - allocation flag Code: 71-88 90-00 Married, civilian spouse present 1 1 Married, AF spouse present 2 2 Married - spouse abs (inc separated) 3 Married - spouse abs (exc separated) 3 Widowed 4 4 Divorced 4 5 Separated 3 6 Never married 5 7 NIU (94+) -1 Note: Universe - all adult records, including Armed Forces (2<=popstat<=3) The questionnaires in 1984-1988 say 'Married, spouse absent excluding separated' yet the variable code during these years includes separated in value 3. CAUTION: PEMARITL is the original edited variable in 1994 forward. It has 6 categories and we have called it marsta2. PRMARSTA is the recoded variable in these years. Since it has 7 categories, it is a better match with the earlier years. We have labeled it marstat. _______________________________________________________________________________ <grdhi> Highest grade attended Original location, length, and name of variable: 71 73-77 79-83 84-88 90-91 103 103 103 103 127 2 2 2 2 2 I31 I31 I18F I18H A-HGA Topic: education Related variable: agrdhi - allocation flag Code: 71-88 90-91 None 01 00 E1 02 01 E2 03 02 E3 04 03 E4 05 04 E5 06 05 E6 07 06 E7 08 07 E8 09 08 H1 10 09 H2 11 10 H3 12 11 H4 13 12 C1 14 13 C2 15 14 C3 16 15 C4 17 16 C5 18 17 C6+ 19 18 Note: Universe - all adult records, including Armed Forces _______________________________________________________________________________ <grdcom> Completed highest grade attended Original location, length, and name of variable: 71 73-77 79-83 84-88 90-91 105 105 105 105 129 1 1 1 1 1 I33 I32 I18G I18I A-HGC Topic: education Related variable: agrdcom - allocation flag Code: 71-91 Yes 1 No 2 Note: 1971-1988 Universe - all adult record 1990-1991 Universe - all adult record, including Armed Forces _______________________________________________________________________________ <grdatn> Educational attainment Original location, length, and name of variable: 92 94-95 98 00 127 137 137 137 2 2 2 2 A-HGA PEEDUCA PEEDUCA PEEDUCA Topic: education Related variable: agrdatn - allocation flag Code: 92-00 None or children 00 Less than 1st grade 31 1st, 2nd, 3rd or 4th grade 32 5th or 6th grade 33 7th or 8th grade 34 9th grade 35 10th grade 36 11th grade 37 12th grade-no diploma 38 High school graduate - diploma or GED, etc. 39 Some college but not degree 40 Associate's college degree - occ/voc program 41 Associate's degree in college - academic program 42 Bachelor's degree in college - BA, BS, AB, etc. 43 Master's degree - MA, MS, MBA etc. 44 Professional school degree - MD, DDS, DVM, etc. 45 Doctorate degree - PhD, EdD, etc. 46 NIU -1 Note: 1992 Universe - all adult records, including Armed Forces 1994-2000 Universe - all adult records (2<=popstat<=3) _______________________________________________________________________________ <sex> Sex Original location, length, and name of variable: 71 73-77 79-83 84-88 90-92 94-95 98 00 101 101 101 101 125 129 129 129 1 1 1 1 1 2 2 2 I30 I30 I18E I18G A-SEX PESEX PESEX PESEX Topic: demography Related variable: asex - allocation flag Code: 71-00 Male 1 Female 2 In 1994 forward, missings are denoted by '-1'. Note: 1971-1992 Universe - all persons 1994-2000 Universe - all persons (1<=popstat<=3) _______________________________________________________________________________ <race> Race Original location, length, and name of variable: 71 73-77 79-83 84-88 90-92 94-95 98 00 100 100 100 100 130 139 139 139 1 1 1 1 1 2 2 2 I29 I29 I18H I18J A-RACE PERACE PERACE PERACE Topic: demography Related variable: arace - allocation flag Code: 71-88 90-95 98-00 White 1 1 1 Black 2 2 2 Amer Indian, Aleut Eskimo 3 3 Asian/Pacific Islander 4 4 Other 3 5 In 1994 forward, missings are denoted by '-1'. Note: 1971-1992 Universe - all persons 1994-2000 Universe - all persons (1<=popstat<=3) In Jan 1996, the Census revised procedures for editing and allocating the race variable. All "Other" responses were allocated into one of the 4 main race categories. This was done to offset a major increase in "Other" responses which caused severe underestimates of the American Indian and Asian Pacific Islander populations. Due to this, one should use caution when making comparisons between 1995 and 1996 data, especially for those identifying all four race groups. _______________________________________________________________________________ <rrp> Relationship to reference person Original location, length, and name of variable: 71 73-77 79-88 90-92 94-95 98 00 96 96 96 116 118 118 118 1 1 1 2 2 2 2 I26 I26 I18B A-RRP PERRP PERRP PERRP Topic: family Related variable: arrp - allocation flag rrpold - earlier definition of rrp applied to later years Code: 71-88 90-93 94-95 98-00 Reference person with relatives in HH 1 1 1 1 Reference person with no relatives in HH 2 2 2 2 Husband 3 Wife 3 4 Spouse 3 3 Own child/ child of reference person 5 4 4 Grandchild 5 5 Parent 6 6 6 Brother/sister 7 7 7 Other relative of reference person 4 8 8 8 Foster child 9 9 Nonrel of ref own rels in HH (2nd fammem) 5 9 10 10 Partner/roommate 11 Nonrel of ref-no own rel in HH (2nd ind) 6 10 12 12 Unmarried partner with relatives 13 Unmarried partner without relatives 14 Housemate/roommate with relatives 15 Housemate/roommate without relatives 16 Roomer/boarder with relatives 17 Roomer/boarder without relatives 18 Note: 1971-1992 Universe - all persons In 1979-1988, children 0-13 years old may have values 4-6. The term HEAD becomes REFERENCE PERSON starting in 1989. 1994-2000 Universe - all persons (1<=popstat<=3) _______________________________________________________________________________ <hhid> Unique household identifier Original location, length, and name of variable: 71-74 76-77 78-83 84-88 90-92 94-95 98 00 4 4 4 4 102 1 1 1 12 12 12 12 12 12 15 15 IDENT-NUM IDENT-NUM IDENT-NUM H-ID HRHHID HRHHID HRHHID Topic: record keeping Code: numeric with length of 12 or 15 Note: Universe - all households Per Census: This variable is unique for each household. It will be recycled after the 8 month rotations. It can be broken down as follows: columns 1-2 regional office number columns 3-5 PSU - a geographical division columns 6-9 segment - a smaller geographical division columns 10-12 serial number - unique to a household In 1971, the following variables are used: columns 1-3 Scrambled PSU number columns 4-7 sernum Serial number columns 8-12 padding In 1973-1974, the following variables combine to form a unique identifier for each housing unit (note the combined length of the variables is 12 characters, same as hhid variable) In 1975, the sernum and subhh variables are not in the 10th-12th columns. They are located elsewhere on the record. Therefor, hhid is not available in 1975 (unless made by user). In 1977 forward, the same 12 columns are documented as 'Household ID number' with no mention of the subparts. In 1976 the 12 columns are listed by the subparts, not as a whole. columns 1-5 random Random cluster code columns 6 segnum Segment number - rotation number columns 7-9 segnum Segment number - remainder columns 10-11 sernum Serial number columns 12 subhh Subdivided household number In 1996 forward, three more columns were added to denote county within state. _______________________________________________________________________________ <hhnum> Household number Original location, length, and name of variable: 71 73-77 79-88 90-92 94-95 98 00 48 48 48 6 77 77 77 1 1 1 1 2 2 2 I9 I9 I9 H-HHNUM HUHHNUM HUHHNUM HUHHNUM Topic: record keeping Related variable: ahhnum - allocation flag Code: 71-00 Household number 1-8 Note: Universe - all households The inital household receives a value of 1, and subsequent replacement households increase the value by 1. (per Census: This variable notes which household is living at this address (house, apartment, etc.). For example if in MIS 1 one family [household] lived here and then in MIS 2 another family moved in, this would be household 2 at this address. As the address is only questioned for 8 months in a row the maximum value is 8.) This is an unedited variable in 1994 forward. Additional valid entries are: -1 (blank) -2 (don't know) -3 (refused) _______________________________________________________________________________ <subhh> Subdivided household # - generated to subdivide multi-headed hh Original location, length, and name of variable: 71 73-74 75 76 107 15 107 15 1 1 1 1 SUBDIVHH SUBHHNO SUBHH SUBHHNO Topic: record keeping Code: 71-76 Not sub divided/not duplicate 0 First of subdivided etc./first of duplicate etc. 1-9 Note: Universe - all records 1971 There are households with duplicate PSU and serial numbers, which together form the household id (hhid). This id combined with Subdivided Household Number (subhh) form a unique identifier for each sample housing unit. 1973-1976 There are households with duplicate random cluster codes (random), segment (segnum), and serial numbers (sernum). Random Cluster Code (random), Segment Number (segnum), Serial Number (sernum), and Subdivided Household Number (subhh) form a unique identifier for each sample housing unit. _______________________________________________________________________________ <famnum> Family number (ID) within household Original location, length, and name of variable: 84-88 90-92 94-95 98 00 405 272 151 151 151 2 2 2 2 2 FAM-NUM A-FAMNUM PRFAMNUM PRFAMNUM PRFAMNUM Topic: record keeping Code: 84-88 90-00 Not a family member 00 00 Primary family member only 01 01 Member of subfamily # 02-39 02-19 Member of subfamily # 02-39 02-19 NIU -1 Note: 1984-1992 Universe - all persons 1994-2000 Universe - all persons (1<=popstat<=3) In 1984-1988, data in columns 385-480 are the result of the new demographic edit. These demographic characteristics are usually consistent with those produced by the basic CPS edit (columns 94-105). Choice of which data to use should depend on the user's needs. Comparability of BLS's published data or BLS's duplication of Phase II population controls best matches use of the basic CPS edit characteristics which were used in the basic CPS weighting. Users interested in family data or replicating BLS's family data should use the characteristics produced by the new demographic edit. _______________________________________________________________________________ <famrel> Family relationship Original location, length, and name of variable: 84-88 90-92 94-95 98 00 408 275 153 153 153 1 1 2 2 2 FAM-REL A-FAMREL PRFAMREL PRFAMREL PRFAMREL Topic: family Code: 84-88 90-00 Not a family member 0 0 Reference person 1 1 Spouse 2 2 Child 3 3 Other relative in primary family 4 Other relative (prim fam & unrelated subfam only) 4 NIU -1 Note: 1984-1992 Universe - all persons 1994-2000 Universe - all persons (1<=popstat<=3) In 1984-1988, data in columns 385-480 are the result of the new demographic edit. These demographic characteristics are usually consistent with those produced by the basic CPS edit (columns 94-105). Choice of which data set to use should depend on the user's needs. Comparability of BLS's published data or BLS's duplication of Phase II population controls best matches use of the basic CPS edit characteristics which were used in the basic CPS weighting. Users interested in family data or replicating BLS's family data should use the characterostocs produced by the new demographic edit. _______________________________________________________________________________ <hhwgt> Household weight (*100 or *10000) Original location, length, and name of variable: 90-93 94-95 98 00 49 47 47 47 9 10 10 10 H-HHWGT HWHHWGT HWHHWGT HWHHWGT Topic: record keeping Code: 90-93 94-00 Decimal points implied 2 4 NIU -1 The Census does not provide decimal points in their data. USER MUST DIVIDE BY 100 OR 10000. Note: 1990-1995 Universe - all households 1998-2000 Universe - all interviewed households (intstat=1) _______________________________________________________________________________ <_year> Survey year - Unicon created This is a variable that takes its value for year from the request file, not from any column location on the data files. The values are all numeric except. Topic: record keeping Code: Four-digit year Note: Universe - all records _______________________________________________________________________________ <mis> Month in sample Original location, length, and name of variable: 71 73-88 90-92 94-95 98 00 2 2 35 63 63 63 1 1 1 2 2 2 MIS MIS H-MIS HRMIS HRMIS HRMIS Topic: record keeping Code: 1 - 8 Note: Universe - all households _______________________________________________________________________________ <lineno> Line number of each person Original location, length, and name of variable: 71 73-77 79-88 90-92 94-95 98 00 94 94 94 114 147 147 147 2 2 2 2 2 2 2 I25 I25 I18A A-LINENO PULINENO PULINENO PULINENO Topic: record keeping Related variable: alineno - allocation flag Code: 71-92 94-00 Range 01-39 01-99 Note: Universe - all persons Note to those interested in matching: This variable is transcribed by the interviewer from rotation to rotation. So this should be a unique identifier for an individual within the household. This is an unedited variable in 1994 forward. Additional valid entries are: -1 (blank) -2 (don't know) -3 (refused) _______________________________________________________________________________ <wgtfnl> Final weight (*100 or *10000) Original location, length, and name of variable 71 73-77 79-88 90-92 94-95 98 00 121 121 121 248 613 613 613 12 12 12 12 10 10 10 FINALWGT FINALWGT FINALWGT A-FNLWGT PWSSWGT PWSSWGT PWSSWGT Topic: record keeping Code: 71-92 94-00 Implied decimals 2 4 NIU -1 For noninterview A records (71-92) Regular 'Type A' 1 Subsamples 2-4 For noninterview B/C records (71-92) Regular 'Type B/C' 1 Subsamples 2-4 The Census does not provide decimal points in their data. USER MUST DIVIDE BY 100 OR 10000. Note: 1971-1992 Universe - all persons 1994-2000 Universe - all persons (1<=popstat<=3) Used for most tabulations, controlled to independent estimates for 1) states; 2) origin, sex, and age; and 3) age, race, and sex. _______________________________________________________________________________ <rectyp> Record type Original location, length, and name of variable: 71 73-77 79-88 90-92 1 1 1 101 1 1 1 1 RECTYP RECTYP RECTYP H-RECTYP Topic: record keeping Code: 71-83 84-92 Interviewed adult 1 1 Type A noninterview 2 2 Type B/C noninterview 3 3 Armed forces record 4 Child record 1 5 Note: Universe - all records Record type 1 includes children in 1979-1983. In 1971 & 1975, only record types 1 are on the file. IMPORTANT: In 1994+, all records have the same layout, therefore there is no rectyp variable in the file. To select adult interview records use the variables intstat=1 and age>=15. _______________________________________________________________________________ <intstat> Household interview status Original location, length, and name of variable: 90-92 94-95 98 00 34 57 57 57 1 2 2 2 H-HHTYPE HRINTSTA HRINTSTA HRINTSTA Topic: record keeping Code: 90-92 94-00 Interview 1 1 Noninterview type A 2 2 Noninterview type B/C 3 Noninterview type B 3 Noninterview type C 4 Note: Universe - all households _______________________________________________________________________________ <sernum> Serial number Original location, length, and name of variable: 71 73-74 75 76 7 13 25 13 4 2 2 2 SERIALNO SERIALNO SERIALNO SERIALNO Topic: record keeping Code: 71 73-76 Range 1000-8999 0-99 Note: Universe - all records This is a 'household designator within PSU, SEGMENT, and SAMPLE group.' (per Census manual 1975) Random Cluster Code (random), Segment Number (segnum), Serial Number (sernum), and Subdivided Household Number (subhh) form a unique identifier for each sample housing unit. _______________________________________________________________________________ <segnum> Segment number Original location, length, and name of variable: 73-76 9 4 SEGMENT-NO Topic: record keeping Code: 73-76 Range 1000-8999 Note: Universe - all records In 1973 through 1976 the code has four digits rather than three. The fourth digit, thousand's digit is the rotation number. Random Cluster Code (random), Segment Number (segnum), Serial Number (sernum), and Subdivided Household Number (subhh) form a unique identifier for each sample housing unit. _______________________________________________________________________________ <random> Random cluster code Original location, length, and name of variable: 71-76 4 5 RANDOM CLUSTER CODE Topic: record keeping Code: See notes. Note: Universe - all records Random cluster code replaces PSU in earlier years (Primary sampling unit). Random Cluster Code (random), Segment Number (segnum), Serial Number (sernum), and Subdivided Household Number (subhh) form a unique identifier for each sample housing unit. _______________________________________________________________________________ <state> Census state code Original location, length, and name of variable: 71 73-74 76-77 79-88 90-92 94-95 98 00 17 17 17 17 79 91 91 91 2 2 2 2 2 2 2 2 STATE MST-STATE MST-STATE MST-STATE HG-ST60 GESTCEN GESTCEN GESTCEN Topic: geography Code: 71-00 Appendix A Note: Universe - all households First digit of state code is geographic division code: e.g., Northeast Region (Region 1) New England Division (Div. 1) In 1971-1992, the codes are 1960 based. _______________________________________________________________________________ <spneth> Spanish ethnicity Original location, length, and name of variable: 73-77 79-83 84-88 90-92 94-95 98 00 155 155 155 194 141 141 141 1 1 2 2 2 2 2 ITEM33 ITEM18I ITEM18K A-REORGN PRORIGIN PRORIGIN PRORIGIN Topic: demography Related variable: aspneth - allocation flag Code: 73 74-88 90-00 Mexican American 1 1 1 Chicano 2 2 2 Mexican (Mexicano) 3 3 3 Puerto Rican 4 4 4 Cuban 5 5 5 Central/South American 6 6 6 Other Spanish 7 7 7 All other 8 8 8 Don't know 9 9 Not answered 9 A 10 NIU -1 Note: 1973-1979 Universe - all adult records 1980-2000 Universe - all records This is a character variable in 1974-1988. 1994 has an undefined top value of 13. _______________________________________________________________________________