Your browser doesn't support javascript.
Deep Mining Covid-19 Literature
5th International Conference on Applied Informatics, ICAI 2022 ; 1643 CCIS:121-133, 2022.
Article in English | Scopus | ID: covidwho-2148607
ABSTRACT
In this paper we investigate how scientific and medical papers about Covid-19 can be effectively mined. For this purpose we use the CORD19 dataset which is a huge collection of all papers published about and around the SARS-CoV2 virus and the pandemic it caused. We discuss how classical text mining algorithms like Latent Semantic Analysis (LSA) or its modern version Latent Drichlet Allocation (LDA) can be used for this purpose and also touch more modern variant of these algorithms like word2vec which came with deep learning wave and show their advantages and disadvantages each. We finish the paper with showing some topic examples from the corpus and answer questions such as which topics are the most prominent for the corpus or how many percentage of the corpus is dedicated to them. We also give a discussion of how topics around RNA research in connection with Covid-19 can be examined. © 2022, The Author(s), under exclusive license to Springer Nature Switzerland AG.
Keywords

Full text: Available Collection: Databases of international organizations Database: Scopus Language: English Journal: 5th International Conference on Applied Informatics, ICAI 2022 Year: 2022 Document Type: Article

Similar

MEDLINE

...
LILACS

LIS


Full text: Available Collection: Databases of international organizations Database: Scopus Language: English Journal: 5th International Conference on Applied Informatics, ICAI 2022 Year: 2022 Document Type: Article