Modernizing Data Systems in Environmental Public Health: A Blueprint for Action
Metadata Equally important as the data itself is the presence of metadata, the descrip - tive information that explains how the data was collected, its source, limitations, structure, and update frequency. Metadata allow analysts to understand the context and make informed decisions about appropriate uses of the data. Com- prehensive metadata support data transparency, accuracy, interoperability, and reusability, enabling EPH professionals to use and share data confidently. Metadata are often described as “data about data.” It provides critical details about the origin, content, format, and evolution of a dataset. Well-documented metadata helps users to understand what the data represents, how the data was created, its current relevance, and how the data should or should not be used. Different types of metadata serve various purposes: Descriptive metadata identify and summarize the dataset, which includes el - ements such as the dataset title, a summary, and relevant tags that help users understand the general content and context. Structural metadata explain how the data are organized. This includes details such as field names, data types, file formats, and the overall layout of the dataset, making it easier to navigate and interpret. Administrative metadata provide information about the creation, ownership, and access rights of the dataset. Examples include the author’s name, version history, and any applicable licenses or usage permissions. Provenance metadata document the origin and history of the dataset. It de- scribes how the data were collected, processed, or transformed, such as data cleaning steps or quality control checks that were performed. Statistical metadata outline the methods used during data collection or anal - ysis, which might involve sampling strategies, aggregation techniques, or any statistical procedures that influence how the data can be interpreted.
Table 6 provides a simple example of metadata for an EPH dataset.
Table 6
METADATA FIELD EXAMPLE Dataset Title
2022 Food Safety Inspections: County A
Description
All restaurant inspection results in County A for 2022
Collected By
County A Environmental Health
Date Range
January 1, 2022–December 31, 2022
Last Updated
February 15, 2023
Format
CSV
Variables
Facility_ID, Inspection_Score, Violation_Code, Date
Confidential?
No (public dataset)
9
Powered by FlippingBook