Return to CDE Home Page Return to CDE Home Page Return to CDE Home Page

University of Wisconsin-Madison
Center for Demography and Ecology


Home > Data ResourcesData Extractors

About CDE Info Services

Print Library Resources
MADCAT
Electronic Library
Popline
JSTOR

Data Library Resources
CDE Public FTP Site
Data Library Catalog

CDERR (Current DEmographic Research Reports)
Subscribe to CDERR
Current/Past CDERR
Reports

Demographic Resources (Organizations, Internet, etc.)

Reference Resources (Directories, Internet search, writing, etc.)

Other Resources (UW, Travel, etc.)

CDE Info Homepage

CDE Homepage

Data Extractors | Data Archives

Social Science Data Extractors

Contents: (Note: Please wait for the entire page to load before clicking on any of the contents.) This page describes a selection of web sites that allow the user to quickly extract social science data. Either raw data, summary data, or both can be extracted. Various downloading options (data format, type of Internet access protocol, supporting documentation, etc.) are supported by various systems. Extraction systems can be interactive web sites, software systems that must be downloaded to the PC and installed there, or both. In some cases, interactive graphing and mapping are available as well. Some sites allow downloading of full data sets that underlay the extraction system.     

US Population

Decennial Census Summary Numbers
Population Estimates
Decennial Census Public Use Microdata Samples (PUMS)

International Population, Vital Statistics, and Socio-Economic Estimates and Projections

Microdata and Longitudinal Surveys

Multiple Study Sites
Specific Study Sites

Vital Statistics and Health
Education
Justice and Crime
Poverty, Welfare, Income, and Employment
Macroeconomic Indicators
Miscellaneous Multi-Subject Sites

US Population

Decennial Census Summary Numbers

Geographic Visualization of Census Data [Back to Contents]

Population Estimates

  • Census State Population Projections by State, Age, Sex, Race, and Hispanic Origin: 2004-2050--CDC Wonder

  • (http://wonder.cdc.gov/population.html)
    Center for Disease Control's Wonder extractor allows the user to pick geographies (down to the county level), race, gender, age, and time for the estimates database. Geographies, times, and demographics can be picked for the projection database. Users can create a two dimensional table using any of five variables in each database. Download options are available.  Note that users must login at the main site before going to the population estimates (Census) and projection sites. Populations are provided by the Census Bureau. 
  • Post 1990 Population Estimates for US Counties & Higher Level Geographies by Age, Race, Sex and Hispanic Origin thru 1999--University of Missouri-St. Louis
    (http://www.oseda.missouri.edu/uic/uicapps/agersex.html)
    This system accesses the most recent county level population estimates from the U.S. Census Bureau with detail by age (year cohorts through 85+), race, Hispanic origin (for total and white race only) and sex. These data are reported for July 1 of each year from 1990 to 1999. Users can pick geographies, age, race, sex, year, and sort variables. Download options are available.
  • [Back to Contents]


    Decennial Census Public Use Microdata Samples (PUMS)

    • 1990 Census Public Use Microdata Samples (PUMS)--University of Virginia

    • (http://fisher.lib.virginia.edu/pums)
      The University of Virginia Geospatial and Statistical Data Center provides separate extraction systems for a customized subset of data, descriptive statistics, and cross tabulations from the 1990 1% PUMS files for all states. Users pick geographies, variables, type of subsampling, and table dimensions (for cross tabs). Data files are returned as both comma delimited ASCII and SPSS portable files, along with variable lists and an electronic codebook. Several descriptive statistics are available.
    • IPUMS--University of Minnesota History Department

    • (http://www.ipums.umn.edu/)
      IPUMS contains "high precision" samples drawn from the 1850-1990 censuses. It assigns uniform codes across the samples. Users can pick geographies, variables, sample sizes, and cases. Output can be accessed in raw or compressed form, with a customized codebook and SPSS data definition statements. Note that free registration is required to use the extraction system.
    [Back to Contents]

    International Population, Vital Statistics, and Socio-Economic Estimates and Projections

    [Back to Contents]

    Microdata and Longitudinal Surveys

    Multiple Study Sites

    • FERRET--Bureau of Labor Statistics and Census Bureau
      (http://dataferrett.census.gov/)
      DataFerrett Browser download:
      (http://dataferrett.census.gov/TheDataWeb/index.html)
      Ferret provides interactive access to all major CPS (Current Population Surveys) and supplements as far back as 1992 (years vary by supplements), the 1992, 1993, and available 1996 Survey of Income and Program Participation (SIPP), and the 1997 Survey of Program Dynamics, in addition to selected health related surveys (see below). Selected data (raw or SAS data sets) or descriptive statistics can be accessed. Download options are available. Users must login before using the system.
    • [Back to Contents]


      Specific Study Sites

      [Back to Contents]

      Vital Statistics and Health

      • CDC Wonder--Centers for Disease Control

      • (http://wonder.cdc.gov/)
        Among the useful vital and health related statistical data sets Wonder provides extraction for are: AIDS Cases Reported by State and Local Health Departments (users can pick demographics, case-definitions, dates of diagnosis, dates of report, HIV exposure group, and mortality); Microfiche AIDS (same as above except users can also pick more detailed geographies);  SEER (Cancer Surveillance, Epidemiology and End Results) (users can pick geographies, demographics, time periods and disease codes);  ICD9 Finder (disease by classification number) (users can search by keyword); State Injury Mortality Data (users can pick geographies and injury type); Mortality (users can pick geographies, demographics, and time periods); Natality (users can pick geographies, demographics, and natality variables, and create two dimensional tables by any of nine variables); Sexually Transmitted Disease Morbidity (users can pick geographies, times, genders, and diseases); and Tuberculosis Surveillance (users can pick geographies, times, demographics, and disease case characteristics). Note that time periods covered vary by database. Download options are available. Wonder also hosts many bibliographic databases. Users must login at the main site before accessing data. 
      • Cancer Query System--Surveillance, Epidemiology, and End Results (SEER)--National Cancer Institute

      • (http://www.seer.cancer.gov/query/)
        NCI's Cancer Query System on the Web (CANQUES, available only to browsers that support Java 1.1), "allows the user to access over 10 million pre-calculated cancer statistics. Statistics are available from SEER Cancer Statistics Reviews, 1973-2000. Users can retrieve data related to: SEER Incidence Rates and Trends; US Mortality Rates and Trends; Individual State Mortality Rates; SEER Mortality Rates; Median Age at Diagnosis and Death; NHL and Kaposi's Sarcoma in San Francisco; and Relative Survival Rates by SEER Registry and Historic Stage. Users can pick demographics, geographies (when available), types of cancers, and time periods. Download options are available. 
      • FERRET--Bureau of Labor Statistics and Census Bureau

      • (http://dataferrett.census.gov/)
        FERRET provides access to the 1993 National Health Interview Survey (NHIS) and the National Health and Nutrition Examination Survey III (NHANES 1988-1994). Users can pick variables and selected values. Selected data (raw or SAS data sets) or descriptive statistics can be downloaded. Download options are available. Note that users must register before accessing data.
      • Health Care Cost and Utilization Project (HCUPnet)--HHS Agency for Health Care Research and Quality

      • (http://hcup.ahrq.gov/HCUPnet.asp)
        HCUPnet provides interactive access to national statistics about hospital stays. Users can choose: diagnoses and procedures;  outcomes and measures; patient characteristics; and hospital characteristics. Data are gathered from the latest Nationwide Inpatient Sample. 
      • HIV/AIDS Surveillance Database--Census Bureau

      • (http://www.census.gov/ftp/pub/ipc/www/hivaidsn.html)
        HIV/AIDS Surveillance Database "is a compilation of information from widely scattered small scale surveys on the AIDS pandemic and HIV seroprevalence in developing countries. Currently the database contains around 40,000 individual data records from over 4,000 publications and presentations. The database also includes information from incidence studies." This extraction system requires downloading and installing the database on a local PC. Users can pick geographies from over 160 countries, population subgroup, age and sex. Summary tables and maps are also available. 
      • Substance Abuse and Mental Health Data Archive (SAMHDA) Data Analysis System (DAS)--ICPSR

      • (http://www.icpsr.umich.edu/SAMHDA/das.html)
        Health related surveys covered in the SAMHDA DAS include the National Household Survey on Drug Abuse, Treatment Episode Data Set, and Washington D.C. Metropolitan Area Drug Study (DC*MADS). Users can pick variables, cases, and get raw data or descriptive statistics. Download options are available.
      • Survey Documentation & Analysis Data Analysis System (DAS) --University of California, Berkeley

      • (http://sda.berkeley.edu:7502/archive.htm)

        DAS provides access to the following health related surveys: National Health Interview Survey (NHIS) 1991 (Person File only), and Health Studies From Brazil (in Portuguese). Users can pick variables, cases, and get raw data or descriptive statistics. Download options are available. 
      • Trends in Health and Aging (National Center for Health Statistics)

      • (http://209.217.72.34/aging/)
        Trends in Health and Aging "contains information on trends in health-related behaviors, health status, health care utilization, and cost of care for the older population  in the United States." At present over 20 national tables are available. NCHS will provide state based tables and "estimates and official data from other sources" in the future. Users can download the Beyond 20/20 extraction system or can browse tables directly (Microsoft IE 4.01 or Netscape 4.51 or higher required for the second option). Download options are available.
      • Health Data for All Ages (HDAA)
      • (http://www.cdc.gov/nchs/health_data_for_all_ages.htm)
        Provided by the National Center for Health Statistics, these tables provide information about health at the state level. Tables can be viewed, manipulated, printed, or downloaded in the Beyond 20/20 format at this time. This format requires the user to download special software from the site. At present, the site contains mostly mortality tables.
      • WHO Statistical Information System (WHOSIS)

      • (http://www.who.int/whosis/en/)
        This page contains WHO and other organization extractors for country time series data regarding core health, mortality, population, and disease indicators.
      [Back to Contents]

      Education

      • Data Analysis System (DAS Web)--National Center for Education Statistics

      • (http://nces.ed.gov/das/)
        This extraction system requires downloading and installing the DAS for Windows extraction system. DAS allows the user to create tables from data in various NCES surveys. In addition to table variables (usually in percentage format), standard errors are provided. DAS does not allow for extracting raw data from the surveys at this time. Surveys covered include: National Postsecondary Students Aid Study (NPSAS); National Study of Postsecondary Faculty (NSOPF); Beginning Postsecondary Students (BPS, 90, 92); Baccalaureate and Beyond (B&B);  National Longitudinal Study of 1972 (NLS); High School and Beyond (HS & B); and  National Educational Longitudinal Study of 1988 (NELS). Years and/or follow-ups of the studies vary. Download options are available.
      •  Search for NCES Tables and Figures--National Center for Education Statistics

      • (http://nces.ed.gov/quicktables/)
        This search tool lets you locate all tables/figures/charts published in the inventory of NCES' Education Statistics Quarterly; the NEDRC (National Education Data Resource Center) Postsecondary Table Library; the Condition of Education; the Digest of Education Statistics, and many other NCES publications.New tables are constantly being added to this database (close to 4,000 recently published tables, graphs & figures are now available).
        Global Education Database--US Agency for International Development
        (http://qesdb.cdie.org/ged/index.html)
        In July 2003, USAID's Office of Education released its fourth PC-based database of international education statistics, GED 2003. Users in the Agency and their development partners worldwide were able to access the data by downloading the GED program to their desktop computers from a CD-ROM or from the web. In an effort to make current and future education data even more accessible, EGAT/ED has developed this web-based version of the GED. All of the data can now be accessed online from this site without downloading a program. There are 224 indicators compiled from the UNESCO Institute of Statistics and 71 indicators compiled from the Demographic and Health Surveys (DHS), a USAID program that has conducted full-scale nationally representative household surveys in over 60 developing countries since 1984. USAID plans to update this online database as new data become available.
      [Back to Contents]

      Justice and Crime

      • Data Analysis System (DAS)--National Archive of Criminal Justice Data (NACJD), ICPSR

      • (http://www.icpsr.umich.edu/NACJD/SDA/das.html)
        Users of this interactive DAS can run frequencies or cross tabs, comparisons of means or correlations, or download customized sets of variables/cases for the following data sets: Uniform Crime Reporting Data: Supplementary Homicide Reports; National Crime Victimization Survey; National Corrections Reporting Program; Survey of Inmates in State Correctional Facilities; and Census of State and Federal Adult Correctional Facilities. Years of data vary by survey. Users can pick variables, cases, and format of output. Download options are available. 
      • Federal Justice Statistics Resource Center (FJSRC)--Bureau of Justice Statistics and Urban Institute

      • (http://fjsrc.urban.org/)
        The FJSRC "maintains the Bureau of Justice  Statistics (BJS) Federal Justice Statistics Program (FJSP) database, which contains  information about suspects and defendants processed in the Federal criminal justice system. Using data obtained from Federal agencies, the FJSP compiles comprehensive  information describing defendants from each stage of Federal criminal case processing." Users can download compressed ASCII versions of Standard Analysis File (SAF) data sets  (after registration), or use a query system to download selected summary statistics (frequencies and cross tabulations) from them (Defendants in Criminal Cases Filed and Terminated in U.S. District Court, Offenders Entering and Exiting Federal Prisons, Population of Offenders in Federal Prisons, and Defendants Sentenced) from 1994 to the latest year available. Download options are available.
      [Back to Contents]

      Poverty, Welfare, Income and Employment

      [Back to Contents]

      Macroeconomic Indicators

      • Bureau of Labor Statistics Data--Bureau of Labor Statistics

      • (http://www.bls.gov/data/)
        The Bureau of Labor Statistics offers three extraction systems for access to its thousands of time series covering employment & unemployment, prices & living conditions, compensation & working conditions, and productivity & technology. Monthly, quarterly, and annual data is available (depending on the database) from as early as 1913 to the present. Most Requested Series retrieves data for commonly requested BLS time series. Selective Access retrieves data for all available time series. Series Report retrieves data for all available time series by series identifier number. Users pick variables, geographies, seasonality, and output type. Raw data can also be downloaded via FTP. 
      • EconData--University of Maryland

      • (http://inforumweb.umd.edu/Econdata.html)
         EconData contains hundreds of thousands of regional, national, and international  macroeconomic time series from various US government agencies such as the Bureau of Labor Statistics, Bureau of Economic Analysis, Census Bureau, and Federal Reserve Board. Monthly, quarterly, and annual time series are available going back as far as 1929.  This extraction system requires downloading and installing on the PC. Users can pick variables and time. Download options are available, as well as options for graphics and more complex econometric analysis.
      • The Economic Time Series Page--Ted Bos, University of Alabama-Birmingham

      • (http://www.EconoMagic.com/)
        While not exactly an extraction system in the traditional sense, Economics Professor Ted Bos has set up a web site that offers access to so many macroeconomic time series (over 75,000) that it, in essence, acts as one.  Monthly, quarterly, and annual time series are available from the Bureau of Labor Statistics, Bureau of Economic Analysis, Census Bureau, and Federal Reserve Board. Time periods vary.  Data is available at  national, state, county, and municipal level. Users select the time series they are interested in. Download options are available, as are charts of the data. 
      • Penn World Tables, Mark 6.1--National Bureau of Economic Research (Robert Summers and Alan Heston) Center for International Comparisons
        University of Pennsylvania

      • (http://pwt.econ.upenn.edu/)
        The Penn World Tables contain a set of international economic comparisons for 152 countries in 29 macroeconomic topics from as early as 1950 to 2000. PWT attempts to provide a set of macroeconomic variables that is standardized across time and countries to facilitate ease of international comparisons. The best information about the construction of the tables can be found in Summers and Heston's article in the May 1991 Quarterly Journal of Economics: "The Penn World Table (Mark 5): An Expanded Set of International Comparisons, 1950-1988". Both extraction sites allow users to pick countries, years, and variables. NBER allows multiple picks. The advantage to the CHASS site is that it allows multiple download options, including plots. 
      [Back to Contents]

      Miscellaneous Multi-Subject Sites

      • American FactFinder--Census Bureau

      • (http://factfinder.census.gov/)
        American FactFinder provides interactive access to information about "community, economy, and society" from some of the largest Census databases. It provides "quick reports," detailed tables, and/or maps from the US Census, American Community Survey, and Economic Census at this time. Data can be extracted by geogaphies and variables.  Selected summary statisics can be calculated from raw data. 
      • CenStats--Census Bureau

      • (http://censtats.census.gov/)
        Census Bureau's CenStats offers is an extraction system that provides access to several popular Census Bureau databases, including the Building Permits,Census Tract Street Locator, Consolidated Federal Funds Report, County and Zip Business Patterns, USA Counties, Detailed Occupation by Race and Sex, 1990 Public Law 94-171, and International Trade Data. Data can be extracted by geographies, variables, and time, depending upon the database. CenStats is a quick and powerful system for extracting ready reference information.
      • Geospatial and Statistical Data Center Interactive Data--University of Virginia

      • (http://fisher.lib.virginia.edu/)
        In addition to PUMS data (see above), the University of Virginia Library offers interactive access to: the 1988 and 1994 City and County Databooks (Census Bureau); Uniform Crime Reporting Data (Bureau of Justice Statistics); and  national, regional, state, county and municipal macroeconomic data (National Income and Product Accounts, Regional Economic Information System, NBER Productivity, US Imports and Exports, the 1987 Standard Industrial Classification Manual, Regional Economic Projections, State Personal Income, and County Business Patterns, provided by the Bureau of Economic Analysis, National Bureau of Economic Research, and Census Bureau), among other data. Time series vary by data set. Users can pick geographies, variables, and times. Download options are available. Note: users should click on "Interactive Data" in the left hand frame of the page to access the extraction system.
      • Missouri State Census Data Center Social and Economic Information Library UEXPLORE--University of Missouri

      • (http://mcdc2.missouri.edu/applications/uexplore.shtml)
        MSCDC's UEXPLORE provides access to data in the areas of agriculture, compendia, economic indicators, education , employment, health, geography/GIS, and population. In all over 100 data sets are available, including: various decennial census summary tape and public use microdata files; Current Population Surveys; population estimates; agricultural and economic censuses; USA Counties; County Business Patterns; City and County Databooks (Census Bureau);  Bureau of Economic Analysis employment and income data; and Integrated Postsecondary Education Data System (IPEDS) and Common Core of Data (CCD) files from the National Center for Education Statistics, among others. Geographies and time periods vary by data set, with particularly strong coverage, as might be expected, for Missouri. However, many data sets have complete state or even finer level geographic coverage. Users pick geographies and variables. Record selection criteria options are available, as is sorting. Download options (including SAS data sets) are available. Note that this is an extremely powerful, but not intuitive extraction system, and users should read the online tutorial before using it. 
      • World Bank World Development Indicators (WDI) Data Query (http://devdata.worldbank.org/data-query/)

      • This extraction system is a small subset of the World Development Indicators Database. It contains 54 macroeconomic and demographic time series for 207 countries and 18 regions. Data is available for the latest five years only. Download options are available.
      • Search Population and Health Data--Population Reference Bureau

      • (http://www.prb.org/datafind/datafinder7.htm)
        This interactive query, which returns data for 72 demographic variables for the world 221 countries, 28 regions and subregions, and the US and the 50 states,  contains data drawn from several Population Reference Bureau sources, including: World Population Data Sheet; Women of our World; Breastfeeding Patterns in the Developing World; and the United States Population Data Sheet. Users can query multiple geographies and variables. Data for only the latest year are returned.
      [Back to Contents]

Last updated 08/22/2005 by Jack Solock jsolock@ssc.wisc.edu
Special Librarian, CDE Information Services

© 2000 University of Wisconsin Center for Demography and Ecology
http://www.ssc.wisc.edu/cde/library/info.htm