Your browser doesn't support javascript.
loading
Automated classification of ICD-O-3 morphology code from pathology reports using text-mining and support vector machine / 预防医学
Journal of Preventive Medicine ; (12): 255-258, 2021.
Article in Chinese | WPRIM | ID: wpr-876539
ABSTRACT
Objective@#To evaluate the accuracy of automated classification of ICD-O-3 morphology code from pathology reports by text-mining and support vector machine ( SVM ) , in order to provide basis for automated tumor coding in Chinese. @*Methods@#The tumor report cards of Zhejiang residents from 2017 to 2019 were collected from Chronic Disease Surveillance Information Management System of Zhejiang Province. According to ICD-O-3, the keywords of the pathology reports were extracted, and SVM was used for automatic classification. The classification results were compared with those of 16 professionals with more than two years of experience in tumor coding, and the accuracy rate, recall rate and F-score were calculated for effect evaluation. @*Results@#Totally 83 082 cases from 2017 to 2019 were included and were categorized into 17 morphological classifications, with 52 877 ( 63.65% ) cases of adenocarcinoma, squamous carcinoma and transitional cell carcinoma. A total of 1 090 keywords were enrolled into main corpus. The total F-score, accuracy rate and recall rate are 85.69, 77.20% and 96.27%, respectively. @*Conclusion@#Text-mining combined with SVM can improve the efficiency of ICD-O-3 morphology coding; however, the accuracy needs to be further improved.

Full text: Available Index: WPRIM (Western Pacific) Language: Chinese Journal: Journal of Preventive Medicine Year: 2021 Type: Article

Similar

MEDLINE

...
LILACS

LIS

Full text: Available Index: WPRIM (Western Pacific) Language: Chinese Journal: Journal of Preventive Medicine Year: 2021 Type: Article