Evaluating data quality for blended data using a data quality framework
-
3 15 2024
-
-
Source: Stat J IAOS. 40(1):125-136
Details:
-
Alternative Title:Stat J IAOS
-
Personal Author:
-
Description:In 2020 the U.S. Federal Committee on Statistical Methodology (FCSM) released "A Framework for Data Quality", organized by 11 dimensions of data quality grouped among three domains of quality (utility, objectivity, integrity). This paper addresses the use of the FCSM Framework for data quality assessments of blended data. The FCSM Framework applies to all types of data, however best practices for implementation have not been documented. We applied the FCSM Framework for three health-research related case studies. For each case study, assessments of data quality dimensions were performed to identify threats to quality, possible mitigations of those threats, and trade-offs among them. From these assessments the authors concluded: 1) data quality assessments are more complex in practice than anticipated and expert guidance and documentation are important; 2) each dimension may not be equally important for different data uses; 3) data quality assessments can be subjective and having a quantitative tool could help explain the results, however, quantitative assessments may be closely tied to the intended use of the dataset; 4) there are common trade-offs and mitigations for some threats to quality among dimensions. This paper is one of the first to apply the FCSM Framework to specific use-cases and illustrates a process for similar data uses.
-
Keywords:
-
Source:
-
Pubmed ID:38800620
-
Pubmed Central ID:PMC11117461
-
Document Type:
-
Funding:
-
Collection(s):
-
Main Document Checksum:
-
File Type:
-
Supporting Files:No Additional Files