Your browser doesn't support javascript.
New Frontiers of Scientific Text Mining: Tasks, Data, and Tools
28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, KDD 2022 ; : 4832-4833, 2022.
Article in English | Scopus | ID: covidwho-2020400
ABSTRACT
Exploring the vast amount of rapidly growing scientific text data is highly beneficial for real-world scientific discovery. However, scientific text mining is particularly challenging due to the lack of specialized domain knowledge in natural language context, complex sentence structures in scientific writing, and multi-modal representations of scientific knowledge. This tutorial presents a comprehensive overview of recent research and development on scientific text mining, focusing on the biomedical and chemistry domains. First, we introduce the motivation and unique challenges of scientific text mining. Then we discuss a set of methods that perform effective scientific information extraction, such as named entity recognition, relation extraction, and event extraction. We also introduce real-world applications such as textual evidence retrieval, scientific topic contrasting for drug discovery, and molecule representation learning for reaction prediction. Finally, we conclude our tutorial by demonstrating, on real-world datasets (COVID-19 and organic chemistry literature), how the information can be extracted and retrieved, and how they can assist further scientific discovery. We also discuss the emerging research problems and future directions for scientific text mining. © 2022 Owner/Author.
Keywords

Full text: Available Collection: Databases of international organizations Database: Scopus Language: English Journal: 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, KDD 2022 Year: 2022 Document Type: Article

Similar

MEDLINE

...
LILACS

LIS


Full text: Available Collection: Databases of international organizations Database: Scopus Language: English Journal: 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, KDD 2022 Year: 2022 Document Type: Article