Insights from Adopting a Data Commons Approach for Large-Scale Observational Cohort Studies: The California Teachers Study
Supporting Files
-
4 2020
-
File Language:
English
Details
-
Alternative Title:Cancer Epidemiol Biomarkers Prev
-
Personal Author:
-
Description:Background:
Large-scale cancer epidemiology cohorts (CECs) have successfully collected, analyzed, and shared patient-reported data for years. CECs increasingly need to make their data more Findable, Accessible, Interoperable, and Reusable, or FAIR. How CECs should approach this transformation is unclear.
Methods:
The California Teachers Study (CTS) is an observational CEC of 133,477 participants followed since 1995–1996. In 2014, we began updating our data storage, management, analysis, and sharing strategy. With the San Diego Supercomputer Center, we deployed a new infrastructure based on a Data Warehouse, to integrate and manage data; and a secure and shared workspace with documentation, software, and analytic tools that facilitate collaboration and accelerate analyses.
Results:
Our new CTS infrastructure includes a Data Warehouse and data marts, which are focused subsets from the Data Warehouse designed for efficiency. The secure CTS workspace utilizes a Remote Desktop service that operates within a HIPAA and FISMA compliant platform. Our infrastructure offers broad access to CTS data; includes statistical analysis and data visualization software and tools; flexibly manages other key data activities (e.g., cleaning, updates, & data sharing); and will continue to evolve to advance FAIR principles.
Conclusion:
Our scalable infrastructure provides the security, authorization, data model, metadata, and analytic tools needed to manage, share, and analyze CTS data in ways that are consistent with the NCI’s Cancer Research Data Commons Framework.
Impact:
The CTS’s implementation of new infrastructure in an ongoing CEC demonstrates how population sciences can explore and embrace new cloud-based and analytics infrastructure to accelerate cancer research and translation.
-
Subjects:
-
Source:Cancer Epidemiol Biomarkers Prev. 29(4):777-786
-
Pubmed ID:32051191
-
Pubmed Central ID:PMC9005205
-
Document Type:
-
Funding:HHSN261201800032C/CA/NCI NIH HHSUnited States/ ; HHSN261201800009C/CA/NCI NIH HHSUnited States/ ; NU58DP006344/DP/NCCDPHP CDC HHSUnited States/ ; HHSN261201800015I/CA/NCI NIH HHSUnited States/ ; HHSN261201800032I/CA/NCI NIH HHSUnited States/ ; P30 CA033572/CA/NCI NIH HHSUnited States/ ; HHSN261201800015C/CA/NCI NIH HHSUnited States/ ; HHSN261201800009I/CA/NCI NIH HHSUnited States/ ; U01 CA199277/CA/NCI NIH HHSUnited States/ ; UM1 CA164917/CA/NCI NIH HHSUnited States/ ; P30 CA023100/CA/NCI NIH HHSUnited States/ ; R01 CA077398/CA/NCI NIH HHSUnited States/
-
Place as Subject:
-
Volume:29
-
Issue:4
-
Collection(s):
-
Main Document Checksum:urn:sha256:c526ddc132e2300399043e11b27a891294845d3915ce35cf7d08bee3d052f319
-
Download URL:
-
File Type:
Supporting Files
File Language:
English
ON THIS PAGE
CDC STACKS serves as an archival repository of CDC-published products including
scientific findings,
journal articles, guidelines, recommendations, or other public health information authored or
co-authored by CDC or funded partners.
As a repository, CDC STACKS retains documents in their original published format to ensure public access to scientific information.
As a repository, CDC STACKS retains documents in their original published format to ensure public access to scientific information.
You May Also Like
COLLECTION
CDC Public Access