Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 1 de 1
Filter
Add more filters










Database
Language
Publication year range
1.
Stud Health Technol Inform ; 178: 250-6, 2012.
Article in English | MEDLINE | ID: mdl-22797049

ABSTRACT

OBJECTIVE: To evaluate the effects of Optical Character Recognition (OCR) on the automatic cancer classification of pathology reports. METHOD: Scanned images of pathology reports were converted to electronic free-text using a commercial OCR system. A state-of-the-art cancer classification system, the Medical Text Extraction (MEDTEX) system, was used to automatically classify the OCR reports. Classifications produced by MEDTEX on the OCR versions of the reports were compared with the classification from a human amended version of the OCR reports. RESULTS: The employed OCR system was found to recognise scanned pathology reports with up to 99.12% character accuracy and up to 98.95% word accuracy. Errors in the OCR processing were found to minimally impact on the automatic classification of scanned pathology reports into notifiable groups. However, the impact of OCR errors is not negligible when considering the extraction of cancer notification items, such as primary site, histological type, etc. CONCLUSIONS: The automatic cancer classification system used in this work, MEDTEX, has proven to be robust to errors produced by the acquisition of freetext pathology reports from scanned images through OCR software. However, issues emerge when considering the extraction of cancer notification items.


Subject(s)
Copying Processes/standards , Medical Records , Neoplasms/pathology , Pathology, Clinical , Pathology/classification , Automation , Humans , Natural Language Processing
SELECTION OF CITATIONS
SEARCH DETAIL
...