Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 1 de 1
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
Comput Math Methods Med ; 2014: 781807, 2014.
Artigo em Inglês | MEDLINE | ID: mdl-24587817

RESUMO

Multilabel classification is often hindered by incompletely labeled training datasets; for some items of such dataset (or even for all of them) some labels may be omitted. In this case, we cannot know if any item is labeled fully and correctly. When we train a classifier directly on incompletely labeled dataset, it performs ineffectively. To overcome the problem, we added an extra step, training set modification, before training a classifier. In this paper, we try two algorithms for training set modification: weighted k-nearest neighbor (WkNN) and soft supervised learning (SoftSL). Both of these approaches are based on similarity measurements between data vectors. We performed the experiments on AgingPortfolio (text dataset) and then rechecked on the Yeast (nontext genetic data). We tried SVM and RF classifiers for the original datasets and then for the modified ones. For each dataset, our experiments demonstrated that both classification algorithms performed considerably better when preceded by the training set modification step.


Assuntos
Biologia Computacional/métodos , Genes Fúngicos , Reconhecimento Automatizado de Padrão/métodos , Envelhecimento , Algoritmos , Área Sob a Curva , Inteligência Artificial , Análise por Conglomerados , Bases de Dados Factuais , Europa (Continente) , Proteínas Fúngicas/metabolismo , Humanos , Reprodutibilidade dos Testes , Software , Máquina de Vetores de Suporte , Estados Unidos
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...