Metadata
NBDC data dictionary & levels table
Background
ABCD’s tabulated data resource is organized into tables, each of which contains a set of related variables (e.g., all items as well as summary scores and administrative variables for a given assessment instrument; scores for a given imaging measure for all regions of interest for a given brain atlas; etc.). The tables are accompanied with metadata that describe the variables and provide essential information for understanding the data and how to use them in analyses.
Data dictionary and levels table
The metadata for studies released on the NBDC datahub consists of two elements—the data dictionary and an accompanying levels table—which serve the following purpose.
- Data dictionary: Provides detailed information about the variables in the tabulated data resource, with one row per variable (
name
being the unique identifier) and one column per characteristic of the variable (e.g., variable label, data type, unit, etc.). - Levels table: Provides information about the levels of categorical variables in the tabulated data resource, with one row per level within a given variable (
name
andvalue
being the unique identifiers) and one column per characteristic (e.g., label, order, etc.).
The tables below provide information about the columns of the data dictionary and levels table1, with one row per column of the respective metadata table.
Additional information
Metadata exploration: Both DEAP and Lasso (for an overview of the data tools available in the NBDC datahub, see here) can be used to download the metainformation or explore it directly in the respective platform. The tooltipsfwarni in the tables above point out some differences between the two platforms with regard to how this information is presented.
BIDS JSON sidecar files: In addition to the complete data dictionary and levels tables that contain information for all tables of the tabulated data resource in tabular format, the metadata is also provided as BIDS JSON sidecar files in the rawdata/phenotype/
directory as part of the file-based data, with one JSON file per table. The tooltips in the tables above point out which BIDS JSON element corresponds to (or is derived from) which data dictionary column.
Mapping to previous releases: For the 6.0 data release, the complete ABCD tabulated data resource was renamed using a standardized naming scheme (for details, see here). To relate the new variable and table names to previously used names, e.g., to update existing analysis code, historic variable and table names for a given variable are provided as metadata in the data dictionary. The data dictionary columns providing this information are marked in red
above.
Data/responsible use warnings & documentation: This data documentation website includes warnings that provide critical context for interpreting the data, e.g., information about data quality issues and guidance on appropriate use of data (for details, see here). These warnings, which may apply at the variable or table level, are included directly in the data dictionary as hyperlinks to the documentation website2. Furthermore, the data dictionary provides hyperlinks to the documentation of data tables and, where applicable, to the documentation of summary score algorithms. The data dictionary columns providing this information are marked in blue
above.