Data Dictionary/Codebook

Best Practice

 
 
Document Number:
BP_1240
Date Last Updated:
4/2/2020

General Description

Category:

Data Management Planning

Purpose of this best practice:

A data dictionary consists of definitions of every data item (variable) that is being collected for a study.  It is an essential part of successful data management and should be updated whenever a variable is modified, added, or deleted. Such changes include all aspects of the data field, including field name, data type, menu definition (when appropriate), changes to the way the question is asked, validation requirements, etc.
Categories
  • Best Practice
  • Data Management
  • Data Management Planning
  • Data Sharing
Recommendations and Considerations
  • 1
    • 1.1. 
    • 1.2. 
    • 1.3. 
    • 1.4. 
    • 1.5. 
    • 1.6. 
  • 2
    • 2.1. 
      • 2.1.1. 
  • 3
  • 4
    • 4.1. 
  • 5
    • 5.1. 
  • 6
    • 6.1. 
  • 7
    • 7.1. 
  • 8
    • 8.1. 
  • 9
    • 9.1. 
  • 10
    • 10.1. 
  • 11
    • 11.1. 
Resources and Examples

Topic Experts:

Johns Hopkins School of Public Health Biostatistics Center
ICTR
Oncology Clinical Research Office
Responsibilities

Position:

Responsibility:

Principal Investigator
The Principal Investigator (PI) should participate in development of the database. At a minimum, the PI should review the final data dictionary prior to going live.
Data Manager, Study Coordinator
The Data Manager and Study Coordinator are responsible for generating the data dictionary and updating the data dictionary as needed.
Data Analyst / Biostatistician
Because the Data Dictionary will be a primary reference during data analysis, the Data Analyst / Biostatistician should review the Data Dictionary to ensure variables are being created appropriately for analysis.