Search | VHL Regional Portal

Transforming Estonian health data to the Observational Medical Outcomes Partnership (OMOP) Common Data Model: lessons learned.

Oja, Marek; Tamm, Sirli; Mooses, Kerli; Pajusalu, Maarja; Talvik, Harry-Anton; Ott, Anne; Laht, Marianna; Malk, Maria; Lõo, Marcus; Holm, Johannes; Haug, Markus; Suvalov, Hendrik; Särg, Dage; Vilo, Jaak; Laur, Sven; Kolde, Raivo; Reisberg, Sulev.

JAMIA Open ; 6(4): ooad100, 2023 Dec.

Article in English | MEDLINE | ID: mdl-38058679

ABSTRACT

Objective: To describe the reusable transformation process of electronic health records (EHR), claims, and prescriptions data into Observational Medical Outcome Partnership (OMOP) Common Data Model (CDM), together with challenges faced and solutions implemented. Materials and Methods: We used Estonian national health databases that store almost all residents' claims, prescriptions, and EHR records. To develop and demonstrate the transformation process of Estonian health data to OMOP CDM, we used a 10% random sample of the Estonian population (n = 150 824 patients) from 2012 to 2019 (MAITT dataset). For the sample, complete information from all 3 databases was converted to OMOP CDM version 5.3. The validation was performed using open-source tools. Results: In total, we transformed over 100 million entries to standard concepts using standard OMOP vocabularies with the average mapping rate 95%. For conditions, observations, drugs, and measurements, the mapping rate was over 90%. In most cases, SNOMED Clinical Terms were used as the target vocabulary. Discussion: During the transformation process, we encountered several challenges, which are described in detail with concrete examples and solutions. Conclusion: For a representative 10% random sample, we successfully transferred complete records from 3 national health databases to OMOP CDM and created a reusable transformation process. Our work helps future researchers to transform linked databases into OMOP CDM more efficiently, ultimately leading to better real-world evidence.

Information Extraction from Medical Texts with BERT Using Human-in-the-Loop Labeling.

Suvalov, Hendrik; Laur, Sven; Kolde, Raivo.

Stud Health Technol Inform ; 302: 831-832, 2023 May 18.

Article in English | MEDLINE | ID: mdl-37203510

ABSTRACT

Neural network language models, such as BERT, can be used for information extraction from medical texts with unstructured free text. These models can be pre-trained on a large corpus to learn the language and characteristics of the relevant domain and then fine-tuned with labeled data for a specific task. We propose a pipeline using human-in-the-loop labeling to create annotated data for Estonian healthcare information extraction. This method is particularly useful for low-resource languages and is more accessible to those in the medical field than rule-based methods like regular expressions.

Subject(s)

Information Storage and Retrieval , Natural Language Processing , Humans , Neural Networks, Computer , Language , Health Facilities

ABSTRACT

ABSTRACT

Subject(s)

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL