Data Dictionary

Purpose

Define the variables, types, and expected values for the datasets used in this project.

Instructions

The structured definitions for this project’s variables can be found in data/codebook.csv.

Contextual Notes

  • deprivation_quintile: In this synthetic dataset, the quintiles were generated uniformly. In a real-world scenario, this might be derived from a patient’s postcode linked to a national deprivation index.
  • previous_missed: This is a count variable simulated using a Poisson distribution.

NoteAI Capability Checkpoint

Awareness & Orientation: This data dictionary format was populated with human understanding of common health services data structures. No AI was used to derive the definitions.