Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 3 de 3
Filter
Add more filters










Database
Language
Publication year range
1.
ACS Synth Biol ; 11(6): 2043-2054, 2022 06 17.
Article in English | MEDLINE | ID: mdl-35671034

ABSTRACT

Scientific articles contain a wealth of information about experimental methods and results describing biological designs. Due to its unstructured nature and multiple sources of ambiguity and variability, extracting this information from text is a difficult task. In this paper, we describe the development of the synthetic biology knowledge system (SBKS) text processing pipeline. The pipeline uses natural language processing techniques to extract and correlate information from the literature for synthetic biology researchers. Specifically, we apply named entity recognition, relation extraction, concept grounding, and topic modeling to extract information from published literature to link articles to elements within our knowledge system. Our results show the efficacy of each of the components on synthetic biology literature and provide future directions for further advancement of the pipeline.


Subject(s)
Data Mining , Synthetic Biology , Data Mining/methods , Natural Language Processing
2.
ACS Synth Biol ; 10(9): 2276-2285, 2021 09 17.
Article in English | MEDLINE | ID: mdl-34387462

ABSTRACT

The Synthetic Biology Knowledge System (SBKS) is an instance of the SynBioHub repository that includes text and data information that has been mined from papers published in ACS Synthetic Biology. This paper describes the SBKS curation framework that is being developed to construct the knowledge stored in this repository. The text mining pipeline performs automatic annotation of the articles using natural language processing techniques to identify salient content such as key terms, relationships between terms, and main topics. The data mining pipeline performs automatic annotation of the sequences extracted from the supplemental documents with the genetic parts used in them. Together these two pipelines link genetic parts to papers describing the context in which they are used. Ultimately, SBKS will reduce the time necessary for synthetic biologists to find the information necessary to complete their designs.


Subject(s)
Synthetic Biology , User-Computer Interface , Animals , Cell Line , Data Mining , Humans
3.
Philos Trans A Math Phys Eng Sci ; 369(1949): 3300-17, 2011 Aug 28.
Article in English | MEDLINE | ID: mdl-21768141

ABSTRACT

The growing quantity of digital recorded music available in large-scale resources such as the Internet archive provides an important new resource for musical analysis. An e-Research approach has been adopted in order to create a very substantive web-accessible corpus of musical analyses in a common framework for use by music scholars, students and beyond, and to establish a methodology and tooling that will enable others to add to the resource in the future. The enabling infrastructure brings together scientific workflow and Semantic Web technologies with a set of algorithms and tools for extracting features from recorded music. It has been used to deliver a prototype system, described here, that demonstrates the utility of LINKED DATA for enhancing the curation of collections of music signal data for analysis and publishing results that can be simply and readily correlated to these and other sources. This paper describes the motivation, infrastructure design and the proof-of-concept case study and reflects on emerging e-Research practice as researchers embrace the scale of the Web.

SELECTION OF CITATIONS
SEARCH DETAIL
...