NYC Youth Risk Behavior Survey: Public Use Data
The CDC releases combined Youth Risk Behavior Survey (YRBS) datasets for national, state and large urban school districts from selected surveys from 1991 to 2021.
In 2021, due to the COVID-19 pandemic, a smaller number of schools were sampled than in past years, with an aim to provide citywide estimates and not estimates of boroughs or Action Centers.
For more information on the YRBS, see:
Prior to downloading a dataset, review the following information:
- Due to the disproportionate sampling and cluster design of the YRBS, data must be weighted and variance calculated using software programs capable of handling complex survey data, such as SAS, SUDAAN or STATA. Annotated sample code for analysis using SUDAAN is provided (PDF), including design statements and nesting variables.
- To analyze NYC YRBS data, use the variable "sitecode" to create a dataset retaining only records for the geographic area of interest (for example, the city overall or by borough).
- Specific items on the NYC YRBS change year-to-year, as programs develop initiatives or new public health issues emerge. For an overview of question topics that the high school NYC YRBS has included each year, review the high school question matrix (PDF). For question topics of the middle school NYC YRBS, review the middle school question matrix (PDF).
- The datasets available from the CDC do not include data for all questions in the NYC YRBS. The datasets only include data from select questions asked in the 2021 YRBS questionnaire and in previous years. You can get access to more complete NYC YRBS datasets from the NYC Health Department by completing the EPI data request form. The CDC has not yet made available the 2018 NYC middle school dataset. You can get access to that dataset by completing the EPI data request form.
- Some behaviors and conditions are rare, or the sample sizes for some populations are small, which may make estimates unreliable. Our suggested guidelines for YRBS data reliability (PDF) incorporate relative standard error, confidence interval width and sample size.
Data Resources
Sample SAS-Callable SUDAAN Code (PDF)