Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 1 de 1
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
Health Informatics J ; 22(3): 523-35, 2016 09.
Artigo em Inglês | MEDLINE | ID: mdl-25759063

RESUMO

This article examines methods for automated question classification applied to cancer-related questions that people have asked on the web. This work is part of a broader effort to provide automated question answering for health education. We created a new corpus of consumer-health questions related to cancer and a new taxonomy for those questions. We then compared the effectiveness of different statistical methods for developing classifiers, including weighted classification and resampling. Basic methods for building classifiers were limited by the high variability in the natural distribution of questions and typical refinement approaches of feature selection and merging categories achieved only small improvements to classifier accuracy. Best performance was achieved using weighted classification and resampling methods, the latter yielding an accuracy of F1 = 0.963. Thus, it would appear that statistical classifiers can be trained on natural data, but only if natural distributions of classes are smoothed. Such classifiers would be useful for automated question answering, for enriching web-based content, or assisting clinical professionals to answer questions.


Assuntos
Algoritmos , Armazenamento e Recuperação da Informação/classificação , Neoplasias , Bases de Dados Factuais , Educação em Saúde , Humanos , Disseminação de Informação/métodos , Internet
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...