Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 1 de 1
Filtrar
Añadir filtros








Intervalo de año
1.
Braz. arch. biol. technol ; 64: e21210163, 2021. tab, graf
Artículo en Inglés | LILACS-Express | LILACS | ID: biblio-1355796

RESUMEN

Abstract The Internet is chosen to be one among the primary source of biomedical information. To retrieve necessary biomedical information, the search engine needs an efficient, focused crawler mechanism. But the area of research concerned with the focused crawler for biomedical topics is notably scanty. However, the quantity, momentum, diversity, and quality of the available online biomedical information, challenges and calls for enhanced aid to crawl. This paper surmounts the challenges and proposes a new learning approach for focused web crawling adopting Attention Enhanced Siamese Long Short Term Memory (AE-SLSTM) Networks with peephole connections which predicts topical relevance of the web page. The proposed AE-SLSTM model accurately computes the semantic similarity between the topic and the web pages. The performance of the newly designed crawler is assessed using two well known metrics namely harvest rate ( h r a t e ) and irrelevance ratio ( p r a t e ). The presented crawler surpass the existing focused crawlers with an average h r a t e of 0.39 and an average p r a t e of 0.61 after crawling 5,000 web pages relating to biomedical topics. The results clearly depicts that the proposed methodology aids to download more relevant biomedical web pages related to the particular topic from the internet.

SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA