Your browser doesn't support javascript.
loading
Harmonizing Norwegian registries onto OMOP common data model: Mapping challenges and opportunities for pregnancy and COVID-19 research.
Trinh, Nhung Th; Houghtaling, Jared; Bernal, Fabian Lm; Hayati, Saeed; Maglanoc, Luigi A; Lupattelli, Angela; Halvorsen, Lars; Nordeng, Hedvig Me.
Afiliación
  • Trinh NT; PharmacoEpidemiology and Drug Safety Research Group, Department of Pharmacy, Faculty of Mathematics and Natural Sciences, University of Oslo, Oslo, Norway. Electronic address: nhung.trinh@farmasi.uio.no.
  • Houghtaling J; EdenceHealth NV, Belgium.
  • Bernal FL; Division for Research, Dissemination and Education, IT-department, University of Oslo, Oslo, Norway.
  • Hayati S; PharmacoEpidemiology and Drug Safety Research Group, Department of Pharmacy, Faculty of Mathematics and Natural Sciences, University of Oslo, Oslo, Norway.
  • Maglanoc LA; Division for Research, Dissemination and Education, IT-department, University of Oslo, Oslo, Norway.
  • Lupattelli A; PharmacoEpidemiology and Drug Safety Research Group, Department of Pharmacy, Faculty of Mathematics and Natural Sciences, University of Oslo, Oslo, Norway.
  • Halvorsen L; EdenceHealth NV, Belgium.
  • Nordeng HM; PharmacoEpidemiology and Drug Safety Research Group, Department of Pharmacy, Faculty of Mathematics and Natural Sciences, University of Oslo, Oslo, Norway; Department of Child Health and Development, Norwegian Institute of Public Health, Oslo, Norway.
Int J Med Inform ; 191: 105602, 2024 Aug 14.
Article en En | MEDLINE | ID: mdl-39153282
ABSTRACT

OBJECTIVE:

Norwegian health registries covering entire population are used for administration, research, and emergency preparedness. We harmonized these data onto the Observational Medical Outcomes Partnership common data model (OMOP CDM) and enrich real-world data in OMOP format with COVID-19 related data.

METHODS:

Data from six registries (2018-2021) covering birth registrations, selected primary and secondary care events, vaccinations, and communicable disease notifications were mapped onto the OMOP CDM v5.3. An Extract-Transform-Load (ETL) pipeline was developed on simulated data using data characterization documents and scanning tools. We ran dashboard quality checks, cohort generations, investigated differences between source and mapped data, and refined the ETL accordingly.

RESULTS:

We mapped 1.5 billion rows of data of 5,673,845 individuals. Among these, there were 804,277 pregnancies, 483,585 mothers together with 792,477 children, and 472,948 fathers. We identified 382,516 positive tests for COVID-19 in 380,794 patients. These figures are consistent with results from source data. In addition to 11 million source codes mapped automatically, we mapped 237 non-standard codes to standard concepts and introduced 38 custom concepts to accommodate pregnancy-related terminologies that were not supported by OMOP CDM vocabularies. A total of 3,700/3,705 (99.8%) checks passed. The 5 failed checks could be explained by the nature of the data and only represent a small number of records. DISCUSSION AND

CONCLUSION:

Norwegian registry data were successfully harmonized onto OMOP CDM with high level of concordance and provides valuable source for federated COVID-19 related research. Our mapping experience is highly valuable for data partners with Nordic health registries.
Palabras clave

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Idioma: En Revista: Int J Med Inform Asunto de la revista: INFORMATICA MEDICA Año: 2024 Tipo del documento: Article

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Idioma: En Revista: Int J Med Inform Asunto de la revista: INFORMATICA MEDICA Año: 2024 Tipo del documento: Article