Linked External Data

Domain overview

Please scroll horizontally to view the number of variables and events of administration for the displayed tables.

ABCD residential histories cannot be shared with data users, as addresses are personally identifiable information (PII). Instead, external data are linked within ABCD’s secure environment to provide geocoded measures while safeguarding participant privacy. The Linked External Data (LED) workgroup can consider requests for new linked data. Visit ABCD Study External Data Link Request to suggest additional geocoded measures for future release.

The initial inception of ABCD’s use of linked external data (LED) is described in Fan et al. (2021). Cardenas-Iniguez et al. (2024) contains suggestions on the appropriate use of the linked data.

Data use consideration: Linked External Data domain general

When creating state-level variables based on participant’s site at baseline, refer to the non-public release notes for a list of pguids to be excluded from analysis, as some participants reported an address outside of the site’s state or did not report an address. Linkages to site state could lead to misclassification.

Key references:

Fan, C. C., Marshall, A., Smolker, H., Gonzalez, M. R., Tapert, S. F., Barch, D. M., Sowell, E., Dowling, G. J., Cardenas-Iniguez, C., Ross, J., Thompson, W. K., & Herting, M. M. (2021). Developmental Cognitive Neuroscience, 52, 101030. https://doi.org/10.1016/j.dcn.2021.101030
Fan et al. (2021)
Cardenas-Iniguez, C., Schachner, J. N., Ip, K. I., Schertz, K. E., Gonzalez, M. R., Abad, S., & Herting, M. M. (2024). Developmental Cognitive Neuroscience, 65, 101338. https://doi.org/10.1016/j.dcn.2023.101338
Cardenas-Iniguez et al. (2024)

Stanford Education Data Archive (SEDA)

National Center for Education Statistics (NCES) school and district IDs corresponding to the school/district the participant reported attending are linked to data from the SEDA Educational Opportunity Project 5.0 and the SEDA Education Recovery Project. To protect participant privacy, NCES school and district IDs and school names/addresses are not released; however, anonymous pseudo IDs are provided.

SEDA Data Sources

SEDA Educational Opportunity Project 5.0 includes academic achievement estimates from 2009–2019, allowing for examination of long-term educational opportunity and trends across U.S. districts and schools. More information can be found on the SEDA website, and technical documentation can be found here.
SEDA Education Recovery Project includes annual district-level achievement data from the 2018–2019 through 2023–2024 academic years, enabling analyses of pandemic-related learning loss and recovery. These data support investigations of how factors such as remote learning and policy responses shaped student outcomes during and after the introduction of remote learning. More information can be found here.

School history processing:
School history data were expanded to a monthly resolution. Temporal gaps were filled using forward-fill followed by backward-fill within each school year (defined as July through June), ensuring complete monthly coverage.

Month	School	Imputed
July	A	A
Aug	NA	A
Sep	NA	A
October	B	B

Month	School	Imputed
July	NA	B
Aug	NA	B
Sep	NA	B
October	B	B

Month	School	Imputed
May	A	A
June	NA	A
July	NA	B
August	B	B

Source data processing:
SEDA data are provided in long format by year and subgroup. These data were reshaped to a wide format, resulting in one row per district or school with year- and subgroup-specific measures represented as separate columns (Reardon et al. (2025)).

Notes and special considerations:
Several data quality issues have been identified in the school history data. School attendance records may overlap temporally, resulting in multiple records per participant-month. This introduces ambiguity in assigning a single school or district, as well as in determining grade level and homeschool status for that period.

Tabulated data:
Tabulated datasets are derived from file-based longitudinal data and reported at the visit level (session_id).

Visit dates were aligned to monthly school records. For summer visits, school attendance was imputed using forward- and backward-fill.

Monthly school data were then linked to visit-level records. If multiple schools or districts were present for a given month, no data were assigned. The number of schools/districts is reported for each event, and the homeschool indicator reflects whether any record includes homeschooling-related terms.

Release notes: SEDA Data

7.0

SEDA longitudinal data update: Beginning with the 7.0 data release, SEDA data are provided longitudinally across waves, rather than at baseline only, and are available in multiple formats (e.g., by assessment wave, school year, and month) to support different analytic approaches. Data linked by assessment wave can be downloaded within the tabulated datasets, whereas data linked by school year and month must be downloaded as file-based data through NBDC Data Access Platform.
SEDA pseudo-identifiers update: pseudo-identifiers for this release were re-generated and should not be combined with pseudo-identifiers from previous releases.

le_l_sedaadm__schl_id - Anonymized school ID based on National Center for Education Statistics (NCES) school IDs [Cross-listed: ab_g_dyn__design_id__school]

le_l_sedaadm__distr_id - Anonymized school district ID based on National Center for Education Statistics (NCES) school IDs [Cross-listed: ab_g_dyn__design_id__district]

6.0

SEDA subtables

Data consideration: Stanford Education Data Archive (SEDA)

Data disaggregated by race or ethnicity were intentionally not released.

The following disaggregation categories are included for each geographic unit:

_all: all
_fem: female
_mal: male
_mfg: male minus female opportunity gap
_ecd: economically disadvantaged
_nec: not economically disadvantaged
_neg: gap between not economically disadvantaged and economically disadvantaged

ABCD has linked to the cohort standardized (CS) scale scores. The “CS” SEDA 5.0 files are recommended for research in SEDA documentation and include Ordinary Least Squares and Empirical Bayes estimation methods.

The CS scale is standardized within subject and grade, relative to the average of the four cohorts in our data who were in 4th grade in 2009, 2011, 2013, and 2015. We use the average of four cohorts as our reference group because they provide a stable baseline for comparison.
This metric is interpretable as an effect size, relative to the grade-specific standard deviation of student-level scores in this common, average cohort.
For example, a district mean of 0.5 on the CS scale indicates that the average student scored approximately one half of a standard deviation higher than the average national reference cohort scored in that same grade.
Means reported on the CS scale have an overall average near 0 as expected.

For all archived versions of SEDA visit here, Fahle et al. (2021).

School Academic Achievement- Administrative (SEDA 5.0 & SEDA Recovery)

le_l_sedaadm

tabulated

Measure description: Table includes pseudo district and school IDs, school grade, the number of schools and districts attended during the assessment period, and an indicator for homeschooling.

le_l_sedaadm__distr_id crosslisted as ab_g_dyn__design_id__district is the pseudo ID provided for clustering participants by school district le_l_sedaadm__schl_id crosslisted as ab_g_dyn__design_id__school is the pseudo ID provided for clustering participants by school

School history data are collected using start month/year and end month/year, and include NCES school IDs corresponding to the reported school, when available.

Manually entered data are also captured to indicate homeschooling or related experiences. Homeschool records are identified when the reported school name includes terms such as homeschool, unschool, virtual, remote, cyber, or online. le_l_sedaadm__homeschl_indicator is provided as an indicator.

School Academic Achievement - District (SEDA 5.0)

le_l_seda__distr

tabulated

Measure description: - Table includes SEDA 5.0 dataset average test scores (averaged across math and reading). The average test scores (math/reading) are included as a measure of educational opportunities at the district level. The measure also includes SEDA 5.0 dataset demographic covariates across years 2009-2019.

The linked academic achievement variables have the following disaggregation: average test score mean (averaged across grades, years, and subjects), average “learning rate” across grades, average “trend” in the test scores across cohorts, and average difference between math and reading/ language arts (RLA) test scores, along with their standard errors. Estimates are reported for all students and by demographic subgroups.

The linked academic achievement variables have the following disaggregation: average test score mean in math and in RLA (averaged across grades and years), average “learning rate” across grades in math and in RLA, and average “trend” in the test scores across cohorts in math and in RLA, along with their standard errors.

Notes and special considerations: District-level SEDA data were linked to participant individual data in multiple ways: by assessment wave within the tabulated data (district ID at the time of the assessment), as well as school year (most common district ID each academic year), and monthly (most common district ID each month), which can be downloaded as file-based data in the NBDC Data Access Platform. Although SEDA data measures academic achievement for 3rd through 8th grade, we include district-level data regardless if participants are now in high school.

School Academic Achievement - School (SEDA 5.0)

le_l_seda__schl

tabulated

Measure description: Table includes SEDA 5.0 dataset average test scores (averaged across math and reading). The average test scores (math/reading) are included as a measure of math educational opportunities at the school level when participants are in 8th grade or below. The measure also includes SEDA 5.0 dataset demographic covariates across years 2009-2019.

Notes and special considerations: School-level SEDA data were linked to participant individual data in multiple ways: by assessment wave within the tabulated data (school ID at the time of the assessment), as well as academic year (most common school ID each academic year), and monthly (most common school ID each month), which can be downloaded through the NBDC Data Access Platform. School-level SEDA data is only available if the participant is enrolled in 3rd through 8th grade.

School Academic Achievement Recovery - District (SEDA Recovery)

le_l_sedarec__distr

tabulated

Measure description: Table includes SEDA Education Recovery Project dataset, which provides annual district-level achievement data from the 2018–2019 through 2023–2024 academic years. The table also provides pandemic-related learning loss and recovery scores, which are change scores between 2024 and 2019, 2024 and 2022, and 2022 and 2019. Positive scores indicate improved academic achievement across years, whereas negative scores indicate loss in academic achievement.

Residential history-derived environmental measures

Release notes: Residential history

6.0

Address collection: The original address data collection processes in ABCD relied on a point-in-time capture of residential addresses rather than recording longitudinal residential history.

Administrative

Residential Address Description Variables

le_l_admin

tabulated

Measure description: Included in this table are variables indicating the percentage of time spent at either the primary, secondary, or tertiary residential address, in addition to variables indicating the number of years of residence at a given address.

Notes and special considerations: The original address data collection processes in ABCD relied on a point-in-time capture of residential addresses rather than recording longitudinal residential history. As such the assumption has been made that addresses reflected participants’ baseline addresses in all versions of the residential history releases to date (2.1, 3.0, 4.0, 5.0, and 6.0). Users should note that for a given survey timepoint, the percent of time may not sum up to 100%.

To address this limitation, the LED Environment and Policy WG has been actively working to improve the collection of residential history data to improve the temporal and geographic accuracy of participants’ reported addresses. These updates aim to improve the collection of retrospective and prospective data in order to provide more comprehensive and accurate residential histories that correspond to both the age of the child as well as each study visit. These data are expected to be available in future releases. Until then, users should be mindful of both the assumption and limitations of the currently available data.

Air Pollution

Satellite-based NO2 Measures (Air Quality Data for Health-Related Applications)

le_l_no2

tabulated

Measure description: Measures of nitrous dioxide (NO2) were computed for all participants’ primary, secondary, and tertiary addresses, when available. Air pollution estimates used a hybrid spatio-temporal model that provides daily estimates of air pollution levels at 1km x 1km resolution (Di et al. (2020)). Measures in the current release include mean, minimum, and maximum NO2 from 2016.

Key reference: Di, Q., Amini, H., Shi, L., Kloog, I., Silvern, R., Kelly, J., Sabath, M. B., Choirat, C., Koutrakis, P., Lyapustin, A., Wang, Y., Mickley, L. J., & Schwartz, J. (2020). Environmental Science & Technology, 54(3), 1372–1384. https://doi.org/10.1021/acs.est.9b03358

Satellite-based O3 Measures (Air Quality Data for Health-Related Applications)

le_l_o3

tabulated

Measure description: Measures of ozone (O3) were computed for all participants’ primary, secondary, and tertiary addresses, when available. Air pollution estimates used a hybrid spatio-temporal model that provides daily estimates of air pollution levels at 1km x 1km resolution (Requia et al. (2020)). Measures in the current release include mean, minimum, and maximum O3 from 2016.

Release notes: le_l_lo3

7.0

Correctly mapped variable names:There was an error during curation of the le_l_lo3 table for the 6.0 release. This naming issue has now been corrected for the 7.0 release.

6.0

Incorrectly mapped variable names

Key reference: Requia, W. J., Di, Q., Silvern, R., Kelly, J. T., Koutrakis, P., Mickley, L. J., Sulprizio, M. P., Amini, H., Shi, L., & Schwartz, J. (2020). Environmental Science & Technology, 54(18), 11037–11047. https://doi.org/10.1021/acs.est.0c01791

Satellite-based Particulate Measures

le_l_particulat

tabulated

Measure description: Annual estimates for 15 components were linked to baseline addresses for ABCD participants. Spatiotemporal models were used to create annual estimates for baseline years 2016-2018 at a resolution of 50m. Since the models are based on the calendar year, the number of days spent in each calendar year was calculated to determine an annual estimate for the year leading to the baseline date. For each calendar year (prior and visit year), the number of days spent in each year was divided by 365 with adjustments for leap year. These percentages were applied to each year’s estimates and summed to represent each participant’s estimated annual exposure.

Estimate = (ParticulatePrior Year * Percent timePrior Year) + (ParticulateVisit Year * Percent timeVisit Year)

Estimate = (ParticulatePrior Year * (DaysPrior Year 365 + ParticulateVisit Year) / (DaysVisit Year365)

Key references:

Jin, T., Amini, H., Kosheleva, A., Danesh Yazdi, M., Wei, Y., Castro, E., Di, Q., Shi, L., & Schwartz, J. (2022). Environmental Health, 21(1), 96. https://doi.org/10.1186/s12940-022-00907-2
Jin et al. (2022)
Amini, H., Danesh-Yazdi, M., Di, Q., Requia, W., Wei, Y., Abu-Awad, Y., Shi, L., Franklin, M., Kang, C., Wolfson, J., James, P., Habre, R., Zhu, Q., Apte, J., Andersen, Z., kloog, I., Dominici, F., Koutrakis, P., & Schwartz, J. (2022). https://doi.org/10.21203/rs.3.rs-1745433/v1
Amini et al. (2022)

Satellite-based PM2.5 Measures (Air Quality Data for Health-Related Applications)

le_l_pm25

tabulated

Measure description: Measures of fine particulate air pollution (PM2.5) were computed for all participants’ primary, secondary, and tertiary addresses, when available. Air pollution estimates used a hybrid spatio-temporal model that provides daily estimates of air pollution levels at 1km x 1km resolution (Di et al. (2019)). Measures in the current release include mean, minimum, and maximum PM2.5 from 2016, and number of days in 2016 that PM2.5 levels exceeded the National Ambient Air Quality Standards for safe levels of exposure.

Key reference: - Di, Q., Amini, H., Shi, L., Kloog, I., Silvern, R., Kelly, J., Sabath, M. B., Choirat, C., Koutrakis, P., Lyapustin, A., Wang, Y., Mickley, L. J., & Schwartz, J. (2019). Environment International, 130, 104909. https://doi.org/10.1016/j.envint.2019.104909

Satellite-based Pollution Measures for Prenatal Addresses

le_l_prenatal

tabulated

Measure description: A prenatal period of air pollution exposure was estimated from previous lifetime addresses for the child obtained at the 1-year follow-up, beginning with an approximate date of conception (calculated by subtracting 279 days from the reported birthdate) and continuing through the reported date of birth. Hybrid spatiotemporal models were used to create daily estimates for PM2.5, ozone (O3), and nitrogen dioxide (NO2) for the birth years [2004-2009] at a resolution of 1 km2 [1-3] (Di et al. (2019); Requia et al. (2020); Di et al. (2020)). The daily estimates of these air pollution exposures during the prenatal period of each participant were assigned to the corresponding reported birth year address(es), and daily estimates were averaged across the entire prenatal period to represent the prenatal exposure. For study participants with multiple self-reported addresses for the same calendar year, a weighted average of air pollution exposure was calculated in order to provide a single estimated exposure value for every participant. This was calculated by weighting the prenatal average exposure values for each residence by the reported percent of time spent at that residence; the sum of these weighted exposure averages was then divided by the sum of all reported percentages (regardless of total sum of time between addresses) (Eq 1) (Herting et al. (2025)).

Weighted Average = (Air PollutionAddress_1 * Percent timeAddress_1) + (Air PollutionAddress_2 * Percent timeAddress_2) / (Percent timeAddress_1 + Percent time Address_2)

The majority of participants (7,837 [91%]) had reported an address(es) corresponding to the calendar year congruent with the child’s birth year as well as the reported percentages of time spent at each address summed to 100%. For a subset of participants (806 [9%]), errors occurred in the reporting of addresses or percentages of time spent at each address, including 386 [4%] participants who reported percentages of time at multiple addresses that did not sum to 100% and 420 [5%] participants who reported addresses with no corresponding percent of time spent living at that address. Given these reporting errors, it is unclear exactly how much time was spent at various addresses during the exposure period, thus introducing uncertainty in exposure classification.

In order to convey this uncertainty, a quality control (qc) variable was created in order to rank prenatal data according to how accurate we expect the air pollution exposure weighted average to be. This qc variable, le_l_prenatal_qc is defined as follows:

`le_l_prenatal_qc`	Meaning	Number of Participants (Percent)
1	Reported percentages at all addresses sum to 100%	7837 (90.67%)
2	Reported percentages at all addresses sum to totals ranging from 90 to 110%	70 (0.81%)
3	Reported percentages at all addresses sum to totals less than 90% or greater than 110%	316 (3.66%)
4	Reported percentages at addresses were missing	420 (4.86%)

For more information on how residential histories were obtained see as reported in Fan et al. (2021).

Notes and Special Considerations: A number of assumptions and limitations of these data should be noted and highlighted in research and publications that use these derived ABCD data.

First, ABCD did not specifically ask caregivers where they resided when pregnant or if addresses changed during pregnancy, but rather addresses reported during a given calendar year and coinciding with the participant’s birth year were considered representative for the pregnancy period.
Second, caregivers were only asked to provide move-in and move-out information in terms of calendar year, without month information provided. Thus, chronological sequence of addresses that overlap within a given year can not be deciphered and this error may factor into the weight sum scores computed above.
The weighted sum equation was applied to all individuals regardless if their provided sum living in a location did or did not add to 100%. The error inherent to this approach can be reduced by only including individuals with a le_l_prenatal_qc variable equal to 1 in analyses using these data.
The approach used above assigns a prenatal exposure based on daily exposures over the 9-months prior to a child’s birthdate. This approach assumes all children were delivered full-term, which is not necessarily true. It may be best to limit the sample of participants in analyses using these data that can be confirmed to have been in utero for full-term using caregiver report information provided about the child’s medical history in other portions of the ABCD dataset (ph_p_dhx).

Key references:

Di, Q., Amini, H., Shi, L., Kloog, I., Silvern, R., Kelly, J., Sabath, M. B., Choirat, C., Koutrakis, P., Lyapustin, A., Wang, Y., Mickley, L. J., & Schwartz, J. (2019). Environment International, 130, 104909. https://doi.org/10.1016/j.envint.2019.104909
Di et al. (2019)
Di, Q., Amini, H., Shi, L., Kloog, I., Silvern, R., Kelly, J., Sabath, M. B., Choirat, C., Koutrakis, P., Lyapustin, A., Wang, Y., Mickley, L. J., & Schwartz, J. (2020). Environmental Science & Technology, 54(3), 1372–1384. https://doi.org/10.1021/acs.est.9b03358
Di et al. (2020)
Fan, C. C., Marshall, A., Smolker, H., Gonzalez, M. R., Tapert, S. F., Barch, D. M., Sowell, E., Dowling, G. J., Cardenas-Iniguez, C., Ross, J., Thompson, W. K., & Herting, M. M. (2021). Developmental Cognitive Neuroscience, 52, 101030. https://doi.org/10.1016/j.dcn.2021.101030
Fan et al. (2021)
Herting, M. M., Burnor, E., Ahmadi, H., Eckel, S. P., Gauderman, W., Schwartz, J., Berhane, K. T., McConnell, R., & Chen, J. (2025). Research Reports: Health Effects Institute, 2025, 225. https://doi.org/https://pmc.ncbi.nlm.nih.gov/articles/PMC12093475/
Herting et al. (2025)
Requia, W. J., Di, Q., Silvern, R., Kelly, J. T., Koutrakis, P., Mickley, L. J., Sulprizio, M. P., Amini, H., Shi, L., & Schwartz, J. (2020). Environmental Science & Technology, 54(18), 11037–11047. https://doi.org/10.1021/acs.est.0c01791
Requia et al. (2020)

Amenities & Services

Access to parks, social services and amenities, and additional neighborhood demographics from the The National Neighborhood Data Archive (NaNDA) have been linked to participants’ primary, secondary, and tertiary addresses. NaNDA is a publicly available data archive containing contextual measures for locations across the United States.

Code and source files used to derive these measures can be found in Abad et al. (2023)

Performing Arts and Sports Recreation Orgs (NaNDA)

le_l_artsports

tabulated

Measure description: Performing arts and sports organizations (per census tract and per county).

Number of all performing arts organizations per 1000 people
Total count of spectator sports organizations
Number of all spectator sports organizations per 1000 people

Key reference: Finlay, J., Li, M., Esposito, M., Gomez-Lopez, I., Khan, A., Clarke, P., & Chenoweth, M. (2020). https://doi.org/10.3886/E117163V2

Neighborhood Socioeconomic Status and Demographics (NaNDa)

le_l_nbhsoc

tabulated

Measure description: - The following were derived from the ACS 2013-2017 5-year estimates at the census tract level:

Proportion of people who are foreign born
Proportion of families with income greater than 75K
Proportion of families with income greater than $100K
Proportion 16+ civilian labor force unemployed

In addition, three factors associated with neighborhood sociodemographics and structural characteristics are included (based on Morenoff et al. (2007)):

Neighborhood disadvantage score is characterized by high levels of poverty, unemployment, female-headed families, households receiving public assistance income, and a high proportion of African Americans in a census tract.
Neighborhood affluence score represents a mix of characteristics associated with neighborhood affluence (concentrations of adults with a college education; with incomes>75K; and employed in managerial and professional occupations).
Neighborhood ethnic/immigrant score represents ethnic and immigrant concentration. Higher values indicate more Hispanic and foreign born in the census tract.

Data consideration: NaNDA

Population density was set to -1 for tracts with zero population in a given year. Population and area densities are missing in certain years for a few census tracts whose FIPS codes changed at some point between 2003 and 2017.

Key references:

Melendez, R., Clarke, P., Khan, A., Gomez-Lopez, I., Li, M., & Chenoweth, M. (2020). https://doi.org/10.3886/E119451V2
Melendez et al. (2020)
Morenoff, J. D., House, J. S., Hansen, B. B., Williams, D. R., Kaplan, G. A., & Hunte, H. E. (2007). Social Science & Medicine, 65(9), 1853–1866. https://doi.org/10.1016/j.socscimed.2007.05.038
Morenoff et al. (2007)
[Appendix A: Supplemental Material]

Parks (NaNDa)

le_l_parks

tabulated

Measure description: Parks (per census tract and per county):

Total number of parks per census tract (top coded at 10)
Total number of open parks per county
Proportion of open park land within census tract

Key reference: Clarke, P., Melendez, R., & Chenoweth, M. (2020). https://doi.org/10.3886/E117921V1

Religious/Civic Organizations (NaNDa)

le_l_relciv

tabulated

Measure description: - Religious/Civic Organizations (per census tract and per county):

Total count of religious organizations
Number of all religious organizations per 1000 people
Total count of civic/social organizations
Number of all civic/social organizations per 1000 people

Key reference: Finlay, J., Li, M., Esposito, M., Gomez-Lopez, I., Khan, A., Clarke, P., & Chenoweth, M. (2020). https://doi.org/10.3886/E117163V2

Social Services (NaNDa)

le_l_socsrv

tabulated

Measure description: - Measure includes senior centers, youth centers, food banks, job training programs, and day care centers (per census tract and per county):

Total count of social services
Number of all social services per 1000 people

Key reference: Finlay, J., Li, M., Esposito, M., Gomez-Lopez, I., Khan, A., Clarke, P., & Chenoweth, M. (2020). https://doi.org/10.3886/E117163V2

Built Environment

Crime (ICPSR)

le_l_crime

tabulated

Measure description: The description for this database can be found here. To maintain a stability on the crime estimates, we used three-year averages, 2010 to 2012. The resolution of this database is at the county level.

Key reference: United States Department of Justice, Office of Justice Programs, & Federal Bureau of Investigation. (2014). https://doi.org/10.3886/ICPSR33523.v2

Responsible use consideration: Crime

Crime variable is a count variable at the county level, however counties are not homogeneous, which may impact interpretation of findings..

Building Density (EPA)

le_l_densbld

tabulated

Measure description: The description for the smart location database can be found here. We used estimates based on 2010 census. The resulting variables include gross residential density (i.e., building density). The resolution of this database is at the census tract level.

Key reference: EPA. (2014).

Population Density (EPA)

le_l_denspop

tabulated

Measure description: The estimation is based on the 2010 census tract while adjusted based on potential under-reporting across the world (UN adjusted). Data are from NASA Socioeconomic Data and Applications Center / Earthdata Search (SEDAC).

Key reference: NASA. (2025). Your Gateway to NASA Earth Observation Data.

Vehicle Density (ACS)

le_l_densveh

tabulated

Measure description: Vehicle density was calculated using data from the 2014-2018 American Community Survey 5-year estimates for primary, secondary, and tertiary addresses at the census tract level. Vehicle density was calculated in two ways: 1) As an area estimate (aggregate number of variables in a census tract per square mile of land area), and 2) as a population density (aggregate number of vehicles available in a census tract per individual). Vehicle density may be associated with neighborhood residents’ levels of exposure to noxious chemicals and their vulnerability to vehicle-related injuries or fatalities.

Code and source files used to derive these measures can be found here.

Key reference: Abad, S., Schachner, J., Cardenas-Iniguez, C., Ip, K. I., & Herting, M. (2023). https://doi.org/10.17605/OSF.IO/NMDTG

Lead Risk (Vox)

le_l_leadrisk

tabulated

Measure description: All participants’ primary, secondary, and tertiary addresses were, if valid, geocoded at the census tract level with respect to an estimated risk of lead (Pb) exposure. These data are from Vox.com (https://www.vox.com/a/lead-exposure-risk-map). These risk scores (deciles, 1-10, 10 being the most at risk) were calculated for each census tract and reflect a weighted sum between two summary measures of that census tract: the age of homes and the rate of poverty. The lead risk score generally reflects an estimated probability of lead exposure given the age of homes (i.e., older homes are more likely to contain lead hazards) and the proportion of individuals living in poverty (-125% of poverty level). The housing-age and poverty-rate scores for valid primary, second, and tertiary addresses are also included.

Key reference: Vox. (2016).

Responsible use consideration: Lead risk

Lead risk variable is not validated with biological measurements, but is a proxy measure based on neighborhood SES (i.e. poverty) and age of housing.

Road Proximity (Kalibrate)

le_l_roadprox

tabulated

Measure description: Road proximity was derived for primary, secondary, and tertiary address at 1km x 1km resolution using data from Kalibrate, Lei et al. (2022). These measures capture the number of meters participants’ addresses were from major roads.

Key references:

Lei, S. Z., Amica-Carpenter, G., Bowler, C., & Michie, E. (2022).
Lei et al. (2022)
ESRI. (2025).
ESRI (2025)

Traffic Density (Kalibrate)

le_l_traffic

tabulated

Measure description: Traffic counts were derived for primary, secondary, and tertiary address at 1km x 1km resolution using data from Kalibrate. These measures indicate the total volume of cars passing through a given area across the entirety of 2016.

Key references:

Lei, S. Z., Amica-Carpenter, G., Bowler, C., & Michie, E. (2022).
Lei et al. (2022)
ESRI. (2025).
ESRI (2025)

Urban/Rural Area (Census)

le_l_urban

tabulated

Measure description: Urban area is a categorical variable indicating if participants’ addresses were in census tracts considered to be urban , urban clusters , or rural. These measures were derived from publicly available census data from 2010.

List of values: 1: Urbanized Areas (UAs) - 50,000 or more people; 2: Urban Clusters (UCs) - at least 2,500 and less than 50,000 people; 3: “Rural” - all population, housing, and territory not included within an urban area or cluster..

Key reference: Bureau. (2010). Census.gov.

Walkability (EPA)

le_l_walk

tabulated

Measure description: The description for the smart location database can be found here. Walkability index estimates were based on 2010 census at the census tract level.

Key reference: EPA. (2014).

Community Health Burden

Behavioral Health Measures (PLACES)

le_l_places

tabulated

Measure description: Measures from the PLACES dataset are available for participants’ primary, secondary, and tertiary addresses at the census tract level. The PLACES dataset is an expansion of the 500 Cities Project and is available from the Center for Disease and Control and Prevention (CDC), the Robert Wood Johnson Foundation (RWJF) and CDC Foundation. The data linked to ABCD are from the PLACES 2020 release, which reflect measures derived using the 2017/2018 Behavioral Risk Factor Surveillance System (BRFSS) data, and reflect years 2014-2018. Included are 27 measures for the entire United States: * 5 chronic disease-related unhealthy behaviors, * 13 health outcomes, and * 9 on use of preventive services. Additional technical information about PLACES (including information about downloading the data) can be found here.

Code and source files used to derive these measures can be found here: Abad et al. (2023)

Key references:

CDC. (2025). Smoking and Tobacco Use.
CDC (2025)
Abad, S., Schachner, J., Cardenas-Iniguez, C., Ip, K. I., & Herting, M. (2023). https://doi.org/10.17605/OSF.IO/NMDTG
Abad et al. (2023)

Meteorology and Exposures

Elevation of Address (Google API)

le_l_elevation

tabulated

Measure description: This is based on a direct query to the Google map, which contains elevations given where participants live. All variables are truncated to avoid potential identification while maintaining overall variations for analytical purposes.

Estimates of Environmental Noise (Harvard)

le_l_noise

tabulated

Measure description: The spatially varying noise data were obtained from a georeferenced noise model of expected environmental sound levels created by Dr. Peter James at Harvard. The model capitalized on acoustical data from 1.5 million hours of long-term measurements from 492 urban and rural sites located across the contiguous United States during 2000-2014. Geospatial sound models were developed based on these measurements to interpret and predict sound levels across the contiguous United States (D. Mennitt et al. (2013); D. J. Mennitt & Fristrup (2016)). This geospatial sound model formulated relationships between sparsely measured acoustical metrics and nonacoustic environmental factors such as topography, climate, hydrology, and anthropogenic activity. The method utilized random forest, a tree-based machine learning algorithm, to perform the regression. The resulting non-time-varying geospatial sound model enabled mapping of sound levels at 270m resolution.

All noise metrics are A-weighted. A-weighting is an adjustment that reflects how the human ear perceives sound across the frequency spectrum.

The L10 is an exceedance metric that corresponds to the sound pressure level exceeded 10% of the time.
The L50 is an exceedance metric that corresponds to the sound pressure level exceeded 50% of the time.
The L90 is an exceedance metric that corresponds to the sound pressure level exceeded 90% of the time.
Leq is the average sound level in decibels equivalent to the total sound energy measured in a 24-hour period;
LeqNight is the average sound level in from the hours of 10p-7a; and
Ldn, is the day-night average sound level, which is the average sound level over a 24-hour period where sound from 10pm-7am is upweighted by 10 dB.
Clarification: The noise level measurements between the hours of 10pm and 7am are artificially increased by 10 dB before averaging. This noise is weighted to take into account the decrease in community background noise of 10 dB during this period. For more information about similar metrics of sound day-evening-night average sound level (DENL) commonly used in Europe or community noise exposure level (CNEL) used in California legislation- that is, the DNL with the addition of an evening period from 7 PM to 10 PM when noise level measurements are boosted 5 dB to account for the approximate decrease in background community noise by 5 dB during this period.
Variables ending in _ant ant is anthropogenic, and _exi is total noise (anthropogenic and natural) sources. Natural sources alone were not included given the limited variability seen within largely urban locations in which most participants reside.

Notes and Special Considerations: A number of assumptions and limitations of these data should be noted and highlighted in research and publications that use these derived ABCD data.

Sources of the noise are not identifiable based on these models, since the source and sound level is assumed to be homogenous.
All levels were projected for the summer season.

Key references:

Mennitt, D., Fristrup, K., Nelson, L., & McKenna, M. (2013). The Journal of the Acoustical Society of America, 134(5\_Supplement), 4159–4159. https://doi.org/10.1121/1.4831244
D. Mennitt et al. (2013)
Mennitt, D. J. & Fristrup, K. M. (2016). Noise Control Engineering Journal, 64(3), 342–353. https://doi.org/10.3397/1/376384
D. J. Mennitt & Fristrup (2016)

National Air Toxics Assessment (NATA) Measures

le_l_nata

tabulated

Measure description: Three measures from the National Air Toxics Assessment (NATA) are available for participants’ primary, secondary, and tertiary addresses at the census tract level. These data were released in 2018, based on 2014 emissions data. Three measures include: 1) Diesel particulate matter level in the air, in g/m3; 2) lifetime cancer risk from inhalation of air toxics; and 3) ratio of exposure concentration to health-based reference concentration. Code and source files used to derive these measures can be found here: Abad et al. (2023)

Key references:

Abad, S., Schachner, J., Cardenas-Iniguez, C., Ip, K. I., & Herting, M. (2023). https://doi.org/10.17605/OSF.IO/NMDTG
Abad et al. (2023)
EPA, O. (2015).
O. EPA (2015)

Temperature Estimates (PRISM)

le_l_temp

tabulated

Measure description: Daily maximum modeled temperature (“Tmax”) for each of seven days preceding and including each participant’s baseline interview date was linked to the participants’ primary, secondary, and tertiary residential addresses. Meteorologic data provided for linkage ranged from January 1, 2016 to December 31, 2018 and is modeled to 800m spatial resolution from the PRISM Climate Group at Oregon State University (Spangler et al. (2019)). Tmax data is in degrees C.

Notes and Special Considerations: It is important to note that the days preceding and the day of baseline interview were linked based on ‘interview_date’, which depending on the individual may or may not include the same tasks/scans across all participants. Users should confirm interview_date matches the date of any assessment (i.e. scan session day 1 or 2, or behavioral tests) at an individual level in order to accurately use these data.

It is important to note that not all participants currently have temperature data as part of 5.0 for baseline visit dates. Investigators should consider any possible bias in who does and does not have maximum temperature if using these data.

Errors were identified in the 4.0 temperature linkages and should not be used given these issues. These errors were minimized for the provided data since the 5.0 release.

Key reference: Spangler, K. R., Weinberger, K. R., & Wellenius, G. A. (2019). Journal of Exposure Science & Environmental Epidemiology, 29(6), 777–789. https://doi.org/10.1038/s41370-018-0105-2

Vapor Pressure Deficit (VPD) Estimates (PRISM)

le_l_vpd

tabulated

Measure description: Daily maximum vapor pressure deficit (“VPDmax”) for each of seven days preceding and including each participant’s baseline interview date was linked to the participants’ residential address. Meteorologic data provided for linkage ranged from January 1, 2016 to December 31, 2018 and is modeled to 800m spatial resolution from the PRISM Climate Group at Oregon State University. VPDmax data is in hPA.VPDmax data can be used to calculate daily relative humidity and maximum heat index; Spangler et al. (2019).

It is important to note that not all participants currently have vapor pressure deficit data as part of 6.0 for baseline visit dates. Investigators should consider any possible bias in who does and does not have maximum vapor pressure deficit values if using these data.

Errors were identified in the 4.0 VPD linkages and should not be used given these issues. These errors were minimized for the provided data since the 5.0 release.

Key reference: Spangler, K. R., Weinberger, K. R., & Wellenius, G. A. (2019). Journal of Exposure Science & Environmental Epidemiology, 29(6), 777–789. https://doi.org/10.1038/s41370-018-0105-2

Natural Space and Satellite

Land-Use Measures (NLT)

le_l_urbsat

tabulated

Measure description: Here we include 11 variables comprising an “Urban-Satellite” dataset within ABCD which quantify the percentage or quantity of urbanicity measures such as nighttime lighting, population, vegetation, water, and others, at a 1km resolution.

Key references:

Xu, J., Liu, X., Li, Q., Goldblatt, R., Qin, W., Liu, F., Chu, C., Luo, Q., Ing, A., Guo, L., Liu, N., Liu, H., Huang, C., Cheng, J., Wang, M., Geng, Z., Zhu, W., Zhang, B., Liao, W., Qiu, S., Zhang, H., Xu, X., Yu, Y., Gao, B., Han, T., Cui, G., Chen, F., Xian, J., Li, J., Zhang, J., Zuo, X., Wang, D., Shen, W., Miao, Y., Yuan, F., Lui, S., Zhang, X., Xu, K., Zhang, L., Ye, Z., Banaschewski, T., Barker, G. J., Bokde, A. L. W., Flor, H., Grigis, A., Garavan, H., Gowland, P., Heinz, A., Brühl, R., Martinot, J., Artiges, E., Nees, F., Orfanos, D. P., Lemaitre, H., Paus, T., Poustka, L., Robinson, L., Hohmann, S., Fröhner, J. H., Smolka, M. N., Walter, H., Whelan, R., Winterer, J., Patrick, K., Calhoun, V., Li, M. J., Liang, M., Gong, P., Barker, E. D., Clinton, N., Marquand, A., Yu, L., Yu, C., Schumann, G., the CHIMGEN, & IMAGEN Consortia. (2021). Nature Human Behaviour, 6(2), 279–293. https://doi.org/10.1038/s41562-021-01204-7
Xu et al. (2021)
Goldblatt, R., Holz, N., Tate, G., Sherman, K., Ghebremicael, S., Bhuyan, S. S., Al-Ajlouni, Y., Santillanes, S., Araya, G., Abad, S., Herting, M. M., Thompson, W., Thapaliya, B., Sapkota, R., Xu, J., Liu, J., the environMENTAL consortium, Schumann, G., & Calhoun, V. D. (2024). https://doi.org/10.1101/2023.11.06.23298044
Goldblatt et al. (2024)

Measure of Land Cover and Tree Canopy (NLCD)

le_l_nlcd

tabulated

Measure description: Measures of land cover (e.g., low-, medium-, or high-density development, forest, wetland) and tree canopy derived from the National Land Cover Database (NLCD) are available for participants’ primary, secondary, and tertiary addresses at the census tract and county levels of aggregation. Land cover is measured as a percentage of all land within the tract/county. Tracts/counties only include the mainland US (i.e., Alaska and Hawaii are not included). Find here.

Measures provided here use the 2019 iteration of 2016 data. Shapefiles for census tracts were obtained from the US Census bureau TIGER/Line shapefiles for 2010 Find here.

Code used to derive county and census tract levels are here.

Key references:

Schertz, K., Kardan, O., Rosenberg, M., & Berman, M. (2022). https://doi.org/10.17605/OSF.IO/HXMWJ
Schertz et al. (2022)
Multi-Resolution Land Characteristics (MRLC) Consortium. (2025).
Multi-Resolution Land Characteristics (MRLC) Consortium (2025)
US Census Bureau. (2010).
U. C. Bureau (2010)

Neighborhood Composite Measures

Area Deprivation Index (ADI)

le_l_adi

tabulated

Measure description: The Area Deprivation Index (ADI) was calculated based on Kind et al. (2014). The database we queried is the 2011 - 2015 American Community Survey 5-year summary. Although the area deprivation index has 18 different sub-scores, the recommendation is to use the national percentiles. The resolution of this is at the census tract level.

Key reference: Kind, A. J., Jencks, S., Brock, J., Yu, M., Bartels, C., Ehlenbach, W., Greenberg, C., & Smith, M. (2014). Annals of Internal Medicine, 161(11), 765–774. https://doi.org/10.7326/M13-2946

Responsible use consideration: Area deprivation index

Users should carefully consider the component variables and constructs of any composite measure, and choose the most appropriate measure(s) for their research questions. Composite measures may have overlapping variables, and researchers should consider issues of collinearity when using multiple composite measures. Additionally, certain indices—such as the ADI—may have documented issues with skewness and overrepresentation of particular component variables that may not be documented here. Additional responsible use considerations can be found here.

Child Opportunity Index 2.0 (COI)

le_l_coi

tabulated

Measure description: The COI 2.0 is a composite index derived at the census tract level that measures neighborhood resources and conditions relevant to children’s healthy development. In addition to the COI index, there are 3 domain composite scores available: education, health and environment, and social and economic. There are also scores available for the 29 indicators comprising the composite scores. Raw indicator scores and z-scores are provided. The COI 2.0 data is derived from U.S. census tracts for 2010 and 2015.

See here for COI 2.0 technical documentation.

Key reference: Child Opportunity Index. (2023).

Responsible use consideration: Child opportunity index

Social Vulnerability Index (SVI)

le_l_svi

tabulated

Measure description: The Social Vulnerability Index (SVI) refers to a 15-variable composite metric that quantifies how vulnerable different communities are to external stresses on human health, such as natural disasters, human-caused disasters, and disease outbreaks, made available from the Centers for Disease Control and Prevention. Higher values reflect more vulnerable communities at the census-tract level. Both the composite and individual metrics (i.e., national percentiles and raw percentage scores) are available in the ABCD dataset. Data were derived from the 2014-2018 American Community Survey (ACS) 5-year summary.

Notes and Special Considerations: Since the 5.0 release, SVI measures for ACS 2014-2018 have also been computed based on percentage scores for each census tract so as to increase compatibility with ADI and COI census-based measures.

Key references:

CDC. (2024). Place and Health - Geospatial Research, Analysis, and Services Program (GRASP).
CDC (2024a)
Bureau. (2021). Census.gov.
Bureau (2021)

Responsible use consideration: Social vulnerability index

Supplemental measures to the Social Vulnerability Index (SSVI)

le_l_ssvi

tabulated

Measure description: These measures expand on the original SVI measures, with additional measures of sociodemographic composition, health care infrastructure & access, and medical vulnerability, which have been linked to participants’ primary, secondary, and tertiary addresses.

The following variables from the SVI, derived from the 2014-2018 ACS 5-year estimates, Health Resources & Services Area Health Resource Files, Homeland Infrastructure Foundation-Level Data, and Rx Open were included:

Percentage population estimates for white, American Indian and Alaska Native, Black or African American, Asian, Hispanic or Latino/a, Native Hawaiian/Pacific Islander (census tract level)
Percentage population estimates of Spanish/Chinese/Vietnamese/Korean/Russian speakers who “speak English less than ‘very well’” (county level)
Hospitals per 100,000 people (county level)
Urgent care clinics per 100,000 people (county level)
Pharmacies per 100,000 (county level)
Primary care physicians per 100,000 (care, non-federal; county level)
Percentage population estimates for people without health insurance (census tract level)
Percentage population estimates for people with no internet access (census tract level)
Percentage population estimates for people without no computer access (census tract level)

Neighborhood Social Factors

Alcohol Outlet Density (Census 2016)

le_l_densalc

tabulated

Measure description:A measure of alcohol outlet density was derived using Census 2016 data containing ZIP Code Business Patterns (ZIP-BP) for the number of establishments with potential to serve or sell alcohol by Industry and Employment Size of Establishment (e.g., drinking places & full service restaurants).This measure was linked to each participant’s address at the resolution of ZIP codes.ZIP-BP has a High correlation with state/local data sources on alcohol outlet density; Matthews et al. (2011).

The total number of establishments from the following North American Industry Classification System (NAICS) codes were summed:

445110 (Supermarkets and grocery stores except convenience)
445120 (Convenience stores)
445310 (Beer, wine and liquor stores)
447110 (Gasoline stations with convenience stores)
722511 (Restaurants – full-service)
722410 (Drinking places – alcoholic beverages)

Constructed Variables Contained in Analytic Files

Variable Name	Description
`le_l_densalc__addr1__alc_density__2016`	Estimate of alcohol outlet density (Number of Establishments where alcohol could be purchased and/or consumed based on North American Industry Classification System (NAICS) from ZIP Code Business Patterns from Census 2016)

Key reference: Matthews, S. A., McCarthy, J. D., & Rafail, P. S. (2011). Addictive Behaviors, 36(7), 777–780. https://doi.org/10.1016/j.addbeh.2011.02.009

Census Return

le_l_censusret

tabulated

Measure description: 2010 Decennial Census mail return rates and American Community Survey 2014-2018. 5-year self-response rates are available for participants’ primary, secondary, and tertiary addresses at the census tract level. Census response/return rates have been previously associated with social capital and civic engagement, and anomie.

Code and source files used to derive these measures can be found here (Abad et al., 2023).

Key reference: Abad, S., Schachner, J., Cardenas-Iniguez, C., Ip, K. I., & Herting, M. (2023). https://doi.org/10.17605/OSF.IO/NMDTG

Number of Jobs and Job Density (LODES)

le_l_lodes

tabulated

Measure description: Number of jobs and job density (number of jobs per square mile of land area) are available at the census tract level for participants’ primary, secondary and tertiary addresses. Employment statistics are also available, broken down by race and ethnicity. Employment information was derived from the LEHD Origin-Destination Employment Statistics (LODES) dataset for the year 2016, and are available for download here.

Code and source files used to derive these measures can be found here (Abad et al., 2023).

Key reference: US Census Bureau Center for Economic Studies. (2025).

Opportunity Zones and Investment Scores

le_l_oz

tabulated

Measure description: Opportunity Zone (OZ) designations, compiled by the Urban Institute, are available for participants’ primary, secondary, and tertiary addresses. The Tax Cuts and Jobs Act of 2017 included the designation of Opportunity Zones as a federal incentive to spur investment in low-income and undercapitalized communities. Census tracts were eligible for selection as an OZ if they were considered “low-income communities” as determined by the New Market Tax Credit program, although official designations of OZ status were determined by state governors (and mayor of District of Columbia). In addition, Investment Scores, also calculated by the Urban Institute, are available as deciles scores and represent investment flow to census tracts over the years 2011-2015 based on four components: commercial lending, multifamily lending, single-family lending, and small business lending. Additional documentation for these variables can be found from Theodos et al. (2018), here.

Code and source files used to derive these measures can be found here (Abad et al., 2023).

Key reference: Theodos, B., Meixell, B., & Hedman, C. (2018).

Rent and Mortgage Statistics

le_l_rentmort

tabulated

Measure description: In order to approximate cost of living that may be associated with housing, the following variables from the 2014-2018 American Community Survey (ACS) 5-year estimates have been linked to participants’ primary, secondary, and tertiary addresses at the census tract level:

Percent home ownership, percent of households living with rent burden (rent is at least 30% of income), percent of households living with severe rent burden (rent is at least 50% of income), median monthly gross rent, and median monthly mortgage payments.

Code and source files used to derive these measures can be found here (Abad et al., 2023).

Social Mobility (Opportunity Atlas)

le_l_socmob

tabulated

Measure description: The Opportunity Atlas uses anonymous data based on 20 million Americans followed from childhood to their mid-30s to provide outcomes for adults who grew up in each census tract. Percentile household incomes correspond to outcomes in adulthood of people who grew up in each census tract and were born between 1978 and 1983.

The variable corresponding to the 25th percentile is typically the main index of upward social mobility for each census tract, as it captures the mean income rank in adulthood for children who grew up in low-income families in this census tract. See the US Census Bureau website for more details.

Key reference: US Census Bureau. (2025a).

Residential Segregation

Dissimilarity Index (ACS)

le_l_dissim

tabulated

Measure description: The Dissimilarity Index measures the fraction of one group that would have to move to another area, in order to equalize the population distribution. While this measure is computed comparing the ratio of two racial/ethnic groups to each other within a census tract (derived from 2014-2018 ACS estimates), these values are aggregated for a given Metropolitan Service Area (MSA). Thus, we have linked Dissimilarity Indices for the MSA in which a participant may live, and not the census tract. The following contrasts were computed: Black vs. non-Hispanic white, Asian vs. non-Hispanic white, Hispanic vs. non-Hispanic white, nonwhite vs. non-Hispanic white.

Code and source files used to derive these measures can be found here (Abad et al., 2023).

Exposure/Interaction Index

le_l_expint

tabulated

Measure description: Exposure is measured by the Indices of Isolation and Interaction. The Index of interaction measures the likelihood of population subgroups interacting with one another. The Index of Isolation measures how likely it is that one group is isolated, or only surrounded by other members of the same group. We have linked a version of the Interaction Index that indicates the probability that one member of a group may interact with a member of a contrast group. While the measure is computed at the census tract (derived from 2014-2018 ACS estimates), these values are aggregated for a given Metropolitan Service Area (MSA). Thus, we have linked Interaction Indices for the MSA in which a participant may live, and not the census tract. The following contrasts were computed: Black vs. non-Hispanic white, Asian vs. non-Hispanic white, Hispanic vs. non-Hispanic white, nonwhite vs. non-Hispanic white.

Code and source files used to derive these measures can be found here (Abad et al., 2023).

Getis-Ord Gi* Statistics (ICPSR)

le_l_gi

tabulated

Measure description: From ICPSR Documentation:

Measures were calculated at the census tract level on the proportion of non-Hispanic white, non-Hispanic Black, non-Hispanic Asian and Pacific Islander for 1990 and 2000 census, and Hispanic persons per census tract. Gi* statistics are Z-scores that compare the proportion of the population in the focal tract and its neighboring tracts, to the average proportion of a larger geographic unit. For the majority of tracts, the larger geographic unit was the Core-Based Statistical Area (CBSA) these tracts belonged to, and for the minority of tracts that fell outside the boundaries of a CBSA, the County was used as the larger unit.

Data for the measures were obtained from the IPUMS National Historical Geographic Information System (NHGIS) data finder; NHGIS Data Finder (2025). Data were downloaded for the 1990 and 2000 census, and the 2006-2009, 2010-2014, and 2015-2019 5-year American Community Survey (ACS) estimates. Geographically standardized time series tables were used for 1990 and 2000 census data. All other ACS data were standardized to 2010 census tract boundaries.

Gi* statistics were calculated using both Rook and Queen conceptualization of spatial relationships. With Rook contiguity, neighbors are determined by those that share a common edge only, while Queen contiguity neighbors are those that share both an edge or a “corner” (common vertex). See detailed documentation for further details.

Code and source files used to derive these measures can be found here (Abad et al., 2023).

Index of Concentration at the Extremes (ACS)

le_l_eci

tabulated

Measure description: Index of Concentration at the Extremes (ICE) quantifies how persons in a specified area are concentrated into the top vs bottom of a specified societal distribution. We’ve linked two versions of the ICE: one in which we compare the top income quintile to lowest income quintile (income ICE) as a measure of economic segregation, and one in which we compare the top income quintile for non-Hispanic white to lowest income quintile of non-Hispanic Black individuals in a census tract (income + BW ICE) as a measure of racialized economic segregation.

ABCD Subdomain: Residential Segregation

Code and source files used to derive these measures can be found here (Abad et al., 2023).

Key reference: Krieger, N., Kim, R., Feldman, J., & Waterman, P. D. (2018). International Journal of Epidemiology, 47(3), 788–819. https://doi.org/10.1093/ije/dyy004

Multi-group Entropy Index

le_l_entropy

tabulated

Measure description: Both the Index of Dissimilarity and Interaction Index can only measure the segregation of two groups compared to each other. The Multi-Group Entropy Index measures the spatial distribution of multiple groups simultaneously (see here). This measure is dependent on the number of categories included in the computation, and for the linked measures.

Three measures are available as of this release: Multi-Group Entropy Scores at the census tract and metro-level, and a Multi-Group Entropy Index that standardizes the sum of the census-tract scores within a metro by the standard deviation of the metro-level distribution.

Code and source files used to derive these measures can be found here (Abad et al., 2023).

Key reference: Iceland, J. (2004).

Policy

Affordable Care Act Medicaid Expansion Data (KFF)

le_l_aca

tabulated

Measure description: Information on state adoption of statewide Medicaid expansion covering the period 2010-2020 was obtained from two sources:

Kaiser Family Foundation (2017): This webpage was updated in May 29, 2020 to reflect state Medicaid expansions through February 19, 2020.
Congressional Research Services (2014): The ACA Medicaid Expansion, December 30, 2014

From these two sources we constructed the following variable which indicated whether a state passed a law to expand Medicaid statewide (not just for selected subgroups) or if a Section 1115 waiver was obtained to expand Medicaid statewide prior to the 2014 federal allowance under the ACA. These data were mapped into the ABCD data based on (1) the date/month/year (DMY) of the Medicaid expansion policy adoption and (2) the interview date and state of residence of the ABCD resident.

Constructed Variables Contained in Analytic Files

Variable Name	Description
ACAexpand	Residential history derived -dichotomous indicator = 1 if the resident lives in a state at the time of the interview where Medicaid was expanded as a result of the ACA; 0 otherwise

ACA Policy last updated June 29, 2020

Key references:

KFF. (2025). Kaiser Family Foundation.
KFF (2025)
Mitchell, A. (2014).
Mitchell (2014)

Immigration Bias Measures (Hatzenbuehler)

le_l_attimm

tabulated

Measure description: State-level measures of implicit and explicit attitudes towards Latinx individuals based on state-level policies on immigration, recognizing that many Latinx individuals are not immigrants but that such state-level policies likely influence the experience of all individuals in the community with that identity.

Key reference: Hatzenbuehler, M. L., Weissman, D. G., McKetta, S., Lattanner, M. R., Ford, J. V., Barch, D. M., & McLaughlin, K. A. (2022). Journal of the American Academy of Child & Adolescent Psychiatry, 61(6), 809–819. https://doi.org/10.1016/j.jaac.2021.08.017

Cannabis Legalizations Categories by State (NCSL and MPP)

le_l_lawsmj

tabulated

Measure description: Cannabis legalization categories were assigned to participants addresses based on their state of residence. The four cannabis legalization categories are:

Recreational – allows adults to use cannabis for recreational purposes
Medical – allows adults to use cannabis for medical conditions
Low THC/CBD – allows adults to use cannabis that is low in THC and high in CBD for medical conditions
No legal access to cannabis – forbids access to cannabis.

Information about states current cannabis laws were obtained from Project (2025) and National Conference of State Legislatures (2025).

Key references:

Marijuana Policy Project. (2025). Marijuana Policy Project.
Project (2025)
National Conference of State Legislatures. (2025). National Conference of State Legislatures.
National Conference of State Legislatures (2025)

CDC Opioid Prescription Dispensing Data per 100k Residents (CDC)

le_l_rxopioid

tabulated

Measure description:The CDC makes downloadable data on the rate of opioid prescriptions dispensed in each US state using data from IQVIA Xponent. IQVIA Xponent is based on a sample of approximately 49,900 retail (non-hospital) pharmacies, which dispense nearly 92% of all retail prescriptions in the United States. For this database, a prescription is a new or refill prescription dispensed at a retail pharmacy in the sample and paid for by commercial insurance, Medicaid, Medicare, cash or its equivalent, or other third-party coverage. This database does not include mail order prescriptions. For the calculation of dispensing rates, numerators are the projected total number of opioid prescriptions dispensed annually at the county level. Annual resident population denominators are from the U.S. Census Bureau.

Opioid prescriptions, including buprenorphine, codeine, fentanyl, hydrocodone, hydromorphone, methadone, morphine, oxycodone, oxymorphone, propoxyphene, tapentadol, and tramadol are included and identified using National Drug Codes. All prescriptions with days’ supply between 1 and 365 days and a known strength description are included. Cough and cold formulations containing opioids and buprenorphine products typically used to treat opioid use disorder are not included. In addition, methadone dispensed through methadone treatment programs is not included in the IQVIA Xponent data.

Variable Names	Description
`opioid_Rxper100K_0`	Residential history derived measure of opioid prescriptions dispensed per 100K in the county in which the respondent resides in year of baseline visit
`opioid_Rxper100K_1`	Residential history derived measure of opioid prescription dispensed per 100K in the county in which the respondent resides one year prior to baseline visit
`opioid_Rxper100K_2`	Residential history derived measure of opioid prescription dispensed per 100K in the county in which the respondent resides two years prior to baseline visit
`opioid_Rxper100K_3`	Residential history derived measure of opioid prescription dispensed per 100K in the county in which the respondent resides three years prior to baseline visit
`opioid_Rxper100K_4`	Residential history derived measure of opioid prescription dispensed per 100K in the county in which the respondent resides four years prior to baseline visit
`opioid_Rxper100K_5`	Residential history derived measure of opioid prescription dispensed per 100K in the county in which the respondent resides five years prior to baseline visit

Notes and special considerations: In January 2019, IQVIA changed the frame of measurement in their projected prescription services from reflecting prescription demand to be “number of prescriptions dispensed to bin” to reflect total prescriptions actually “sold to the patient.” To do this, IQVIA eliminated the effects of voided and reversed prescriptions (prescriptions that were never received by the patient) beginning in 2017. This change in measurement frame resulted in a 1.9% downward shift in the measured opioid prescriptions dispensed. This enhancement was applied to data from 2017 and thereafter. Thus, caution should be exercised when examining trends during this time period. Furthermore, starting in 2019, prescriptions were based on the location of the prescriber, rather than the location of the pharmacy.

Key reference: CDC. (2024). Overdose Prevention.

OPTIC-Vetted Co-prescribing Naloxone Policy Data (Optic)

le_l_rxnalox

tabulated

Measure description: Co-prescribing naloxone policies captured here represent state policies requiring physicians to co-prescribe naloxone with opioid and/or benzodiazepine prescriptions. Specific dimensions of Co-Prescribing Policy Data included in this public version of the data are based on a review of the legal statutes conducted by OPTIC investigators of laws in effect as of September 2019 and reported in Table 1 of Haffajee et al. (2020), as well as additional information contained in the original legal database. Data is from RAND-USC Schaeffer Opioid Policy Tools and Information Center; OPTIC-Vetted PDMP Policy Data.

Constructed Variables Contained in Analytic Files

Variable Name	Description
`eff_nalcop`	Residential history derived -dichotomous indicator = 1 if any state co-prescribing naloxone policy was effective at date of baseline visit (even if policy restricted to a small subset of patients); 0 otherwise.
`eff_al`	Residential history derived - dichotomous indicator = 1 if state co-prescribing naloxone policy was effective for all patients at date of baseline visit; 0 otherwise.

Last Updated: September 28, 2020

Key references:

RAND Corporation. (2025). RAND Corporation.
Corporation (2025)
Haffajee, R. L., Cherney, S., & Smart, R. (2020). Drug and Alcohol Dependence, 209, 107896. https://doi.org/10.1016/j.drugalcdep.2020.107896
Haffajee et al. (2020)

OPTIC-Vetted Good Samaritan Policy Data (Optic)

le_l_goodsam

tabulated

Measure description: Good Samaritan laws are state laws that provide limited civil and/or criminal immunity to individuals who request assistance from authorities during an overdose. Specific dimensions of Good Samaritan policy data included in this public version of the data are based on a review of relevant protections granted through different variations of these laws as described in C. S. Davis & Carr (2015) and C. Davis & Carr (2017)

Description

Variable Name	Description
`any_gsl`	Residential history derived - dichotomous indicator = 1 if any type of Good Samaritan law was effective at date of baseline interview; 0 otherwise.
`gsl_arrest`	Residential history derived - dichotomous indicator = 1 if Good Samaritan law that provides protection from arrest for controlled substance possession was effective at date of baseline interview; 0 otherwise.

Key reference:

Davis, C. S. & Carr, D. (2015). Drug and Alcohol Dependence, 157, 112–120. https://doi.org/10.1016/j.drugalcdep.2015.10.013
C. S. Davis & Carr (2015)
Davis, C. & Carr, D. (2017). Journal of the American Pharmacists Association, 57(2), S180–S184. https://doi.org/10.1016/j.japh.2016.11.007
C. Davis & Carr (2017)

OPTIC-Vetted Medical Marijuana Policy Data (Optic)

le_l_medmj

tabulated

Measure description: Specific dimensions of marijuana policy data included in this public version of the data are based on variables frequently considered in marijuana policy research. The primary data come from legal review of laws conducted at RAND with funding from NIDA (PI: Rosalie Pacula) by Anne Boustead (JD, PhD), in consultation with Rosalie Pacula and Rosanna Smart (Pacula et al. (2014)).

Updates since December 2017 and March 2019 drew on data reported by National Conference of State Legislatures (2025) in March 2019 with subsequent updates through December 2019 conducted by Jason Blanchard (JD, Boston University) as part of a Boston University-USC-RAND funded project by NIAAA. The early work is described in Powell et al. (2018) and Pacula et al. (2015).

Constructed Variables Contained in Analytic Files

Variable Name	Description
`effMML`	Residential history derived dichotomous indicator = 1 if the individual lives in a state with a medical marijuana law (MML) (i.e. patients have legal protection to possess MJ containing more than trace amounts of THC for medicinal purposes) that was effective on date of baseline visit; 0 otherwise.
`effREC`	Residential history derived dichotomous indicator = 1 if the individual lives in aState with a recreational adult-use cannabis law in effect at date of baseline visit; 0 otherwise.
`active_medlegdisp`	Residential history derived dichotomous indicator = 1 if the individual lives in a state where medical cannabis dispensaries were a legal source of supply as of the date of baseline visit; 0 otherwise.
`active_dispREC`	Residential history derived dichotomous indicator = 1 if the individual lives in a state where a rec store was opened in the state at date of baseline visit; 0 otherwise.
`decrimmmj_ind`	Residential history derived dichotomous indicator = 1 if the individual lives in a state that reduced the status offense for possessing up to one ounce of MJ to a non-criminal offense (civil offense / infraction / petty offense) as of the date of baseline visit; 0 otherwise.

Last Updated: July 7, 2021

Key references:

Pacula, R. L., Hunt, P., & Boustead, A. (2014). Journal of Drug Policy Analysis, 7(1), 1–19. https://doi.org/10.1515/jdpa-2014-0001
Pacula et al. (2014)
National Conference of State Legislatures. (2025). National Conference of State Legislatures.
National Conference of State Legislatures (2025)
Pacula, R. L., Powell, D., Heaton, P., & Sevigny, E. L. (2015). Journal of Policy Analysis and Management, 34(1), 7–31. https://doi.org/10.1002/pam.21804
Pacula et al. (2015)
Powell, D., Pacula, R. L., & Jacobson, M. (2018). Journal of Health Economics, 58, 29–42. https://doi.org/10.1016/j.jhealeco.2017.12.007
Powell et al. (2018)

OPTIC-Vetted Naloxone Policy Data (Optic)

le_l_polnalox

tabulated

Measure description:Dimensions of naloxone policy data included in this data release are based on a review of relevant protections granted through different variations of these laws as described in C. S. Davis & Carr (2015) and C. Davis & Carr (2017).

Information was obtained from PDAPS (originally downloaded May 27, 2016, and redownloaded March 4, 2020), with a few modifications made in consultation with Corey Davis (who was also consulting with PDAPS to update their laws).

Constructed Variables Contained in Analytic Files

Variable Name	Description
`any_nal`	Residential history derived dichotomous indicator = 1 if any type of naloxone law was effective at date of baseline visit; 0 otherwise.
`nal_protocol_standing`	Residential history derived dichotomous indicator = 1 if a naloxone law allowing distribution through a standing or protocol order was effective at date of baseline visit; 0 otherwise.
`nal_Rx_prescriptive_auth`	Residential history derived dichotomous indicator = 1 if naloxone law allowing pharmacists prescriptive authority was effective at date of baseline visit; 0 otherwise

Last Updated: July 29, 2020

Notes and special considerations:

In Iowa, the legislature adopted two different bills (one House bill and one Senate bill) regarding this section, both with an effective date of May 27, 2016. However, one amended the section and made those amendments retroactive to April 6, 2016. However, given no action could be taken on the retroactive date, we assume the PDAPS effective date of May 27,2016; thus, it is coded as June 2016.

In Jan 2008, California piloted naloxone programs in several counties (including the most populous LA and SF). However, this was not expanded statewide until January 2014. PDAPS uses the pilot date for the first law (2008), which is what we use here for “any NAL.” However, one could make the argument that the 2014 date is preferable.

Key references:

Davis, C. S. & Carr, D. (2015). Drug and Alcohol Dependence, 157, 112–120. https://doi.org/10.1016/j.drugalcdep.2015.10.013
C. S. Davis & Carr (2015)
Davis, C. & Carr, D. (2017). Journal of the American Pharmacists Association, 57(2), S180–S184. https://doi.org/10.1016/j.japh.2016.11.007
C. Davis & Carr (2017)

OPTIC-Vetted Prescription Drug Monitoring Program (PDMP) Policy Data (Optic)

le_l_rxmonit

tabulated

Measure description: Information for three variables included in the OPTIC Vetted prescription drug monitoring policy data set have been imported into the ABCD data. These data were based on the following information:

PDAPS (policy data last reported on June 1, 2017, but last downloaded by us and checked in September 19, 2019). These data were used primarily for the construction of (1) AnyPDMP_partial / date_AnyPDMP for new first-time PDMP laws passed after 1/1/1998; (2) electronic_partial / date_electronic; and (3) Must Access_partial / date_prescriber_mustaccess.
Use of Brandeis University PDMP Training and Technical Assistance Center (TTAC) to obtain dates on first PDMP policy (including paper-based systems) that were adopted prior to January 1, 1998, (for AnyPDMP_partial / date_AnyPDMP).
Use of Horwitz et al. (2018) for two variables: (1) Any_PMP_Horowitz_Partial / date_Any_PMP_Horowitz (source: Table 2, Column 1 of their table) and (2) Op_PDMP_partial / date_Op_PDMP (source: Table 2, Column 4 of their table).
Consultation with Corey Davis who was also consulting with PDAPS to update their data.

Variable Name	Description
`mustaccess`	Residential history derived dichotomous indicator = 1 if legislation requiring prescribers to access PDMP before prescribing was effective at date of baseline visit; 0 otherwise.
`AnyPDMP`	Residential history derived dichotomous indicator = 1 if PDMP enabling legislation for any type of PDMP is in effect (including paper-based systems) at the time of baseline visit; 0 otherwise.
`Op_PDMP`	Residential history derived dichotomous indicator = 1 if a “modern PDMP system” is in effect at the time of baseline visit; 0 otherwise.

Notes and special considerations: Regarding “must access” provisions, various data sources use the term “mandatory use” to mean different things. Mandatory use, which differs from mandatory registration, may simply mean that prescribers must have an account on the system; that they must input their patient’s information in the system, or that they must check the system before prescribing an opioid under different circumstances (see here). These are ultimately different levels of specificity of the law, and in the spirit of Horwitz et al. (2018), which encourages greater specificity in how these laws are defined. It is appropriate to use the term “physician must access” only when prescribers are mandated to access the database (regardless of the circumstance, which could differ if it is new patient, existing patient, and so on).

Key references:

Horwitz, J., Davis, C., McClelland, L., Fordon, R., & Meara, E. (2018). https://doi.org/10.3386/w24947
Horwitz et al. (2018)
(2018). PEW.
“When Are Prescribers Required to Use Prescription Drug Monitoring Programs?” (2018)
Bureau of Justice Assurance. (2025).
Justice Assurance (2025)

File-based data

SEDA

file-based concatenated

abcd
└── concatenated
    └── linked_external
        └── seda
            ├── seda_mo.<csv|parquet>
            ├── seda_schlyr.<csv|parquet>
            ├── sedarec_mo.<csv|parquet>
            └── sedarec_schlyr.<csv|parquet>

File details

Four file-based datasets are provided in csv and parquet format:

seda_mo: School history by month linked to SEDA data
sedarec_mo: School history by month linked to SEDA Recovery data
seda_schlyr: School history by school year linked to SEDA data
sedarec_schlyr: School history by school year linked to SEDA Recovery data

To download these data, follow the instructions on the ‘Access & Download’ page.

File formats

The SEDA file-based data are made available as both csv and parquet files to support a range of tools and user preferences. However, since the parquet format ensures that data is imported with correctly specified data types and facilitates faster loading speeds, we recommend using parquet files over csv files whenever possible (see here).

Overview

SEDA-derived data are released as file-based data in both monthly and school-year formats, linked to either SEDA 5.0 or SEDA Recovery data. These file-based releases complement the tabulated SEDA tables described in the SEDA section, which link SEDA measures to ABCD assessment waves; the file-based datasets retain the full longitudinal resolution (monthly or by school year) for analyses that require finer temporal granularity.

Linkage and derivation

Monthly files

Monthly longitudinal data (seda_mo, sedarec_mo) were linked to SEDA data using NCES school IDs. Each participant-month was assigned SEDA measures based on the reported school or district. A homeschool indicator reflects whether the reported school name includes terms such as homeschool, unschool, virtual, remote, cyber, or online. School grade is also provided at the month level.

School-year files

To derive school-year data (seda_schlyr, sedarec_schlyr), monthly data were restricted to September through May, with each September–May period defined as a single school year to avoid summer gaps. For each school year, the school or district in which the participant spent the greatest number of months was selected. If multiple schools tied, no values were assigned. The number of schools/districts is reported for each school year, and the homeschool indicator reflects whether any school record included homeschooling terms.

Metadata information

The metadata (data dictionary and categorical levels tables) for each of the SEDA files are provided below. The data dictionary provides detailed information about the variables included in each file, including variable names, labels, and data types; the categorical levels table provides the mapping between values provided in the data and corresponding labels for categorical variables. Users are encouraged to consult the metadata to understand the structure and content of the data before working with it. Select the table of interest and switch between the Data Dictionary and Categorical Levels tabs.

References

Abad, S., Schachner, J., Cardenas-Iniguez, C., Ip, K. I., & Herting, M. (2023). https://doi.org/10.17605/OSF.IO/NMDTG

Amini, H., Danesh-Yazdi, M., Di, Q., Requia, W., Wei, Y., Abu-Awad, Y., Shi, L., Franklin, M., Kang, C.-M., Wolfson, J., James, P., Habre, R., Zhu, Q., Apte, J., Andersen, Z., kloog, I., Dominici, F., Koutrakis, P., & Schwartz, J. (2022). https://doi.org/10.21203/rs.3.rs-1745433/v1

Bureau. (2010). In Census.gov. https://www.census.gov/programs-surveys/geography/about/faq/2010-urban-area-faq.html.

Bureau. (2021). In Census.gov. https://www.census.gov/programs-surveys/acs/technical-documentation/table-and-geography-changes/2018/5-year.html.

Bureau, U. C. (2025a). https://opportunityatlas.org/.

Bureau, U. C. (2010). https://www.census.gov/cgi-bin/geo/shapefiles/index.php.

Cardenas-Iniguez, C., Schachner, J. N., Ip, K. I., Schertz, K. E., Gonzalez, M. R., Abad, S., & Herting, M. M. (2024). Developmental Cognitive Neuroscience, 65, 101338. https://doi.org/10.1016/j.dcn.2023.101338

CDC. (2024a). In Place and Health - Geospatial Research, Analysis, and Services Program (GRASP). https://www.atsdr.cdc.gov/place-health/php/svi/index.html.

CDC. (2024b). In Overdose Prevention. https://www.cdc.gov/overdose-prevention/data-research/facts-stats/us-dispensing-rate-maps.html.

CDC. (2025). In Smoking and Tobacco Use. https://www.cdc.gov/tobacco/other-tobacco-products/cigars.html.

Clarke, P., Melendez, R., & Chenoweth, M. (2020). Inter-university Consortium for Political and Social Research (ICPSR). https://doi.org/10.3886/E117921V1

Corporation, R. (2025). In RAND Corporation. https://www.rand.org/health-care/centers/optic/resources/datasets.html.

Davis, C. S., & Carr, D. (2015). Drug and Alcohol Dependence, 157, 112–120. https://doi.org/10.1016/j.drugalcdep.2015.10.013

Davis, C., & Carr, D. (2017). Journal of the American Pharmacists Association, 57(2), S180–S184. https://doi.org/10.1016/j.japh.2016.11.007

Di, Q., Amini, H., Shi, L., Kloog, I., Silvern, R., Kelly, J., Sabath, M. B., Choirat, C., Koutrakis, P., Lyapustin, A., Wang, Y., Mickley, L. J., & Schwartz, J. (2019). Environment International, 130, 104909. https://doi.org/10.1016/j.envint.2019.104909

Di, Q., Amini, H., Shi, L., Kloog, I., Silvern, R., Kelly, J., Sabath, M. B., Choirat, C., Koutrakis, P., Lyapustin, A., Wang, Y., Mickley, L. J., & Schwartz, J. (2020). Environmental Science & Technology, 54(3), 1372–1384. https://doi.org/10.1021/acs.est.9b03358

EPA. (2014).

EPA, O. (2015). https://www.epa.gov/national-air-toxics-assessment.

ESRI. (2025). https://doc.arcgis.com/en/esri-demographics/latest/esri-demographics/kalibrate.htm.

Fahle, E. M., Chavez, B., Kalogrides, D., Shear, B. R., Reardon, S. F., & Ho, A. D. (2021). Stanford University.

Fan, C. C., Marshall, A., Smolker, H., Gonzalez, M. R., Tapert, S. F., Barch, D. M., Sowell, E., Dowling, G. J., Cardenas-Iniguez, C., Ross, J., Thompson, W. K., & Herting, M. M. (2021). Developmental Cognitive Neuroscience, 52, 101030. https://doi.org/10.1016/j.dcn.2021.101030

Finlay, J., Li, M., Esposito, M., Gomez-Lopez, I., Khan, A., Clarke, P., & Chenoweth, M. (2020). Inter-university Consortium for Political and Social Research (ICPSR). https://doi.org/10.3886/E117163V2

Goldblatt, R., Holz, N., Tate, G., Sherman, K., Ghebremicael, S., Bhuyan, S. S., Al-Ajlouni, Y., Santillanes, S., Araya, G., Abad, S., Herting, M. M., Thompson, W., Thapaliya, B., Sapkota, R., Xu, J., Liu, J., the environMENTAL consortium, Schumann, G., & Calhoun, V. D. (2024). https://doi.org/10.1101/2023.11.06.23298044

Haffajee, R. L., Cherney, S., & Smart, R. (2020). Drug and Alcohol Dependence, 209, 107896. https://doi.org/10.1016/j.drugalcdep.2020.107896

Hatzenbuehler, M. L., Weissman, D. G., McKetta, S., Lattanner, M. R., Ford, J. V., Barch, D. M., & McLaughlin, K. A. (2022). Journal of the American Academy of Child & Adolescent Psychiatry, 61(6), 809–819. https://doi.org/10.1016/j.jaac.2021.08.017

Herting, M. M., Burnor, E., Ahmadi, H., Eckel, S. P., Gauderman, W., Schwartz, J., Berhane, K. T., McConnell, R., & Chen, J.-C. (2025). Research Reports: Health Effects Institute, 2025, 225. https://doi.org/https://pmc.ncbi.nlm.nih.gov/articles/PMC12093475/

Horwitz, J., Davis, C., McClelland, L., Fordon, R., & Meara, E. (2018). w24947. https://doi.org/10.3386/w24947

Iceland, J. (2004).

Index, C. O. (2023). https://data.diversitydatakids.org/dataset/coi20-child-opportunity-index-2-0-database.

Jin, T., Amini, H., Kosheleva, A., Danesh Yazdi, M., Wei, Y., Castro, E., Di, Q., Shi, L., & Schwartz, J. (2022). Environmental Health, 21(1), 96. https://doi.org/10.1186/s12940-022-00907-2

Justice Assurance, B. of. (2025). https://www.pdmpassist.org/Policies/Legislative.

KFF. (2025). In Kaiser Family Foundation. https://www.kff.org/affordable-care-act/state-indicator/state-activity-around-expanding-medicaid-under-the-affordable-care-act/.

Kind, A. J. H., Jencks, S., Brock, J., Yu, M., Bartels, C., Ehlenbach, W., Greenberg, C., & Smith, M. (2014). Annals of Internal Medicine, 161(11), 765–774. https://doi.org/10.7326/M13-2946

Krieger, N., Kim, R., Feldman, J., & Waterman, P. D. (2018). International Journal of Epidemiology, 47(3), 788–819. https://doi.org/10.1093/ije/dyy004

Lei, S. Z., Amica-Carpenter, G., Bowler, C., & Michie, E. (2022).

Matthews, S. A., McCarthy, J. D., & Rafail, P. S. (2011). Addictive Behaviors, 36(7), 777–780. https://doi.org/10.1016/j.addbeh.2011.02.009

Melendez, R., Clarke, P., Khan, A., Gomez-Lopez, I., Li, M., & Chenoweth, M. (2020). Inter-university Consortium for Political and Social Research (ICPSR). https://doi.org/10.3886/E119451V2

Mennitt, D. J., & Fristrup, K. M. (2016). Noise Control Engineering Journal, 64(3), 342–353. https://doi.org/10.3397/1/376384

Mennitt, D., Fristrup, K., Nelson, L., & McKenna, M. (2013). The Journal of the Acoustical Society of America, 134(5_Supplement), 4159–4159. https://doi.org/10.1121/1.4831244

Mitchell, A. (2014).

Morenoff, J. D., House, J. S., Hansen, B. B., Williams, D. R., Kaplan, G. A., & Hunte, H. E. (2007). Social Science & Medicine, 65(9), 1853–1866. https://doi.org/10.1016/j.socscimed.2007.05.038

Multi-Resolution Land Characteristics (MRLC) Consortium. (2025). https://www.mrlc.gov/data.

NASA. (2025). In Your Gateway to NASA Earth Observation Data. https://www.earthdata.nasa.gov/; Earth Science Data Systems, NASA.

National Conference of State Legislatures. (2025). In National Conference of State Legislatures. https://www.ncsl.org/health/state-medical-cannabis-laws.

NHGIS Data Finder. (2025). https://data2.nhgis.org/.

Pacula, R. L., Hunt, P., & Boustead, A. (2014). Journal of Drug Policy Analysis, 7(1), 1–19. https://doi.org/10.1515/jdpa-2014-0001

Pacula, R. L., Powell, D., Heaton, P., & Sevigny, E. L. (2015). Journal of Policy Analysis and Management, 34(1), 7–31. https://doi.org/10.1002/pam.21804

Powell, D., Pacula, R. L., & Jacobson, M. (2018). Journal of Health Economics, 58, 29–42. https://doi.org/10.1016/j.jhealeco.2017.12.007

Project, M. P. (2025). In Marijuana Policy Project. https://www.mpp.org/states/.

Reardon, S., Andrew, H., Shear, B., Fahle, E., Kalogrides, D., & Saliba, J. (2025). Stanford Digital Repository. https://doi.org/10.25740/CS829JN7849

Requia, W. J., Di, Q., Silvern, R., Kelly, J. T., Koutrakis, P., Mickley, L. J., Sulprizio, M. P., Amini, H., Shi, L., & Schwartz, J. (2020). Environmental Science & Technology, 54(18), 11037–11047. https://doi.org/10.1021/acs.est.0c01791

Schertz, K., Kardan, O., Rosenberg, M., & Berman, M. (2022). https://doi.org/10.17605/OSF.IO/HXMWJ

Spangler, K. R., Weinberger, K. R., & Wellenius, G. A. (2019). Journal of Exposure Science & Environmental Epidemiology, 29(6), 777–789. https://doi.org/10.1038/s41370-018-0105-2

Theodos, B., Meixell, B., & Hedman, C. (2018). Urban Institute.

United States Department of Justice, Office of Justice Programs, & Federal Bureau of Investigation. (2014). Inter-university Consortium for Political and Social Research. https://doi.org/10.3886/ICPSR33523.v2

US Census Bureau Center for Economic Studies. (2025). https://lehd.ces.census.gov/data/.

Vox. (2016).

When Are Prescribers Required to Use Prescription Drug Monitoring Programs? (2018). In PEW. http://pew.org/2Djqkky.

Xu, J., Liu, X., Li, Q., Goldblatt, R., Qin, W., Liu, F., Chu, C., Luo, Q., Ing, A., Guo, L., Liu, N., Liu, H., Huang, C., Cheng, J., Wang, M., Geng, Z., Zhu, W., Zhang, B., Liao, W., … IMAGEN Consortia. (2021). Nature Human Behaviour, 6(2), 279–293. https://doi.org/10.1038/s41562-021-01204-7