Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 1 de 1
Filtrar
Más filtros










Base de datos
Intervalo de año de publicación
1.
Int J Med Inform ; 168: 104880, 2022 12.
Artículo en Inglés | MEDLINE | ID: mdl-36272315

RESUMEN

BACKGROUND: Electronic medical records (EMRs) contain valuable information for clinical research, however, the presence of personally identifying information (PII) restricts their use. Anonymisation of PII from EMRs enables clinical information to be shared for research purposes. Since there is limited research relating to the anonymisation of Australian EMRs, the performance of Microsoft Presidio with customisation on clinical documents from an Australian radiation oncology information system (OIS) was evaluated. METHODS: A random sample of 300 unstructured free-text clinical documents were extracted from the Prince of Wales Cancer Centre OIS on patients diagnosed with cancer of the head and neck between 2000 and 2017. Anonymisation of clinical text was performed using Microsoft Presidio, implemented in Python programming language. Each clinical document was manually compared pre- and post-anonymisation for the identification and redaction of 13 PII. Model performance was evaluated using three classification criteria; correct, partial, and missed classification, to determine recall, precision, and F1-score. These three metrics were performed under relaxed conditions, where partial classifications were considered correct, and under strict conditions, where only correct classifications were considered correct. RESULTS: A total of 8,713 PII were identified, of which 7,026 (81%) were classified as correct, 850 (10%) as partial, and 837 (9%) as missed. There were 245 instances of incorrect classifications. Evaluation of the model demonstrated an average precision of 0.8921, recall (strict) of 0.8064, F1-score (strict) of 0.8471, recall (relaxed) of 0.9039, and F1-score (relaxed) of 0.8980. CONCLUSION: This is the first example of an open-source anonymisation model to be customised and tested on clinical documents from an Australian radiation oncology EMR. These findings support the use of Presidio for the safe use and sharing of cancer data within Australia for certain PII, however, additional checks are required to ensure person names are successfully anonymised.


Asunto(s)
Registros Electrónicos de Salud , Oncología por Radiación , Humanos , Australia , Procesamiento de Lenguaje Natural
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...