Is IRB review required for use of public datasets?

Is IRB review required for use of public datasets?

IRB approval is required for all research involving human subjects. The Common Rule defines "human subject" at 45 CFR 46.102(f) as follows:
(f) Human subject means a living individual about whom an investigator (whether professional or student) conducting research obtains
        (1) Data through intervention or interaction with the individual, or
        (2) Identifiable private information.

Generally, investigators should submit an "Application for Designation of Not Human Subjects Research (NHSR)" to the Office of the IRB (OIRB) for a formal NHSR determination if they want to use data about individuals when that data is (1) neither obtained through intervention or interaction with the individual, (2) nor private and individually identifiable.

However, the UAB OIRB has reviewed the datasets listed below and determined that research involving these datasets does not meet the regulatory criteria for human subjects research. It is not necessary to submit a NHSR application for use of the datasets specifically listed on this page.

Investigators can nominate additional datasets for inclusion on this list by submitting the "Public Dataset Nomination Form." Please provide in your submission as much information as possible to identify the dataset so that the OIRB can appropriately research the dataset and determine, in accordance with the regulations, whether the information in the data constitutes human subjects.


Existing Datasets that do not constitute Human Subjects Research

Many of the datasets listed below have a publically available component and a restricted use component. The use of data from the following list of UAB IRB approved public data sets is not considered human subject research as long as the following two criteria are met:
        Research will NOT involve merging any of the data sets in such a way that individuals might be identified
        Researcher will NOT enhance the public data set with identifiable, or potentially identifiable data

If the two criteria above are met and the research will involve data from a dataset listed below, no UAB IRB review or approval is needed.

NOTE: If restricted datasets will be used, contact the IRB.

Site of Data Data Set
1000 Genomes Project
http://www.1000genomes.org
Agency for Healthcare Research and Quality (AHRQ)
http://www.ahrq.gov/
American College of Surgeons (ACS) http://www.facs.org National Trauma Data Bank (NTDB)
http://www.facs.org/trauma/ntdb/

American College of Surgeons National Surgical Quality Improvement Program(ACS-NSQIP) Participant Use Data File
http://site.acsnsqip.org/participant-use-data-file/
Bureau of Labor Statistics
http://www.bls.gov/nls/home.htm
Public-Use Data Files Only - excludes restricted and geo-coded/zip code datasets
  • National Longitudinal Survey of Youth 1997 (NLSY97)
  • National Longitudinal Survey of Youth 1979 (NLSY79)
  • NLSY79 Children and Young Adults
  • National Longitudinal Survey of Young Women and Mature Women (NLSW)
  • National Longitudinal Survey of Young Men and Older Men
American Time Use Survey (ATUS)
http://www.bls.gov/tus/data.htm
Centers for Disease Control (CDC)
http://www.cdc.gov
Behavioral Risk Factor Surveillance System (BRFSS)
http://www.cdc.gov/brfss/technical_infodata/surveydata.htm

Youth Risk Behavioral Survey Study (YRBSS)
http://www.cdc.gov/healthyyouth/yrbs/data/index.htm

National Ambulatory Medical Care Survey (NAMCS)
National Hospital Ambulatory Medical Care Survey (NHAMCS)
http://www.cdc.gov/nchs/ahcd/about_ahcd.htm
dbGAP: National Center for Biotechnology Information
http://www.ncbi.nlm.nih.gov

GENEVA Genes and Environment Initiatives in Type 2 Diabetes
(Nurses' Health Study/Health Professionals Follow-up Study)
http://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs000091.v2.p1#restricted-access-section


Multi-Ethnic Study of Atherosclerosis (MESA) Cohort
(including substudies SHARe, CARe, and ESP Heart-Go)
http://www.ncbi.nlm.nih.gov/projects/gap/cgi-bin/study.cgi?study_id=phs000209.v12.p3

Data Resource Center for Child & Adolescent Health
http://childhealthdata.org/learn/facts
National Survey of Children's Health (NSCH)
National Survey of Children with Special Health Care Needs (NS-CSHCN)
Economic and Social Data Service (ESDS) Longitudinal Public-Use Data Files Only - excludes restricted and geo-coded/zip code datasets
https://www.esds.ac.uk/longitudinal/about/introduction.asp
  • 1970 British Cohort Study (BCS70)
  • British Household Panel Survey (BHPS)
  • English Longitudinal Study of Ageing (ELSA)
  • Families and Children Study (FACS)
  • Growing Up in Scotland (GUS)
  • Longitudinal Study of Young People in England (LSYPE)
  • Millennium Cohort Study (MCS)
  • National Child Development Study (NCDS)
International HapMap Project
http://hapmap.ncbi.nlm.nih.gov/whatishapmap.html.en
National Cancer Institute (NCI)
http://www.cancer.gov/
The Cancer Genome Atlas (TCGA)
Open Access data tier (Data Level 3 & 4)
http://tcga-data.nci.nih.gov/
National Center for Education Statistics (NCES)
http://nces.ed.gov/
Public-Use Data Files including, but not limited to:
School Survey on Crime and Safety (SSOCS)
http://nces.ed.gov/surveys/ssocs/
National Center for Health Statistics
http://www.cdc.gov/nchs/
Public-Use Data Files and Documentation
  • http://www.cdc.gov/nchs/data_access/ftp_data.htm
  • NHANES: National Health and Nutrition Examination Survey
  • NHCS: National Health Care Survey
  • NIS: National Immunization Survey
  • NHIS: National Health Interview Survey
  • LSOAs: Longitudinal Studies of Aging
  • NSFG: National Survey of Family Growth
  • SLAITS: State & Local Area Integrated Telephone Survey
  • Vital Statistics: National Vital Statistics System
National Data Archive on Child Abuse and Neglect
http://www.ndacan.cornell.edu/
National Survey of Child and Adolescent Well-Being (NSCAW - General Use Files Only)
http://www.ndacan.cornell.edu/ndacan/Datasets/Abstracts/DatasetAbstract_NSCAW-General.html

National Study of the Incidence of Child Abuse and Neglect
http://www.ndacan.cornell.edu/NDACAN/Datasets_List.html
Roper Center for Public Opinion Research
http://www.ropercenter.uconn.edu/
U.S. Bureau of Labor Statistics
http://www.bls.gov/
U.S. Bureau of the Census
http://www.census.gov/
U.S. Department of Agriculture (USDA)
(Economic Research Service)
http://www.ers.usda.gov/
Food and Nutrition Assistance Research Database
http://www.ers.usda.gov/data-products/food-and-mitrotopm-assistance-research-database
U.S. Department of Energy (DOE) Comprehensive Epidemiologic Data Resource
https://www.orau.gov/cedr/default.aspx#.UH2BUW_vgl8
University of Michigan Institute of Social Research
http://home.isr.umich.edu/
Panel Study of Income Dynamics (public version only)
http://psidonline.isr.umich.edu

Health and Retirement Study (public version only)
http://hrsonline.isr.umich.edu/

Inter-University Consortium for Political and Social Research (ICPSR) (public version only)
http://www.icpsr.umich.edu/icpsrweb/ICPSR/index.jsp