Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 2 de 2
Filter
Add more filters










Database
Language
Publication year range
1.
J Chem Inf Model ; 50(2): 251-61, 2010 Feb 22.
Article in English | MEDLINE | ID: mdl-20088574

ABSTRACT

The SPECTRa-T project has developed text-mining tools to extract named chemical entities (NCEs), such as chemical names and terms, and chemical objects (COs), e.g., experimental spectral assignments and physical chemistry properties, from electronic theses (e-theses). Although NCEs were readily identified within the two major document formats studied, only the use of structured documents enabled identification of chemical objects and their association with the relevant chemical entity (e.g., systematic chemical name). A corpus of theses was analyzed and it is shown that a high degree of semantic information can be extracted from structured documents. This integrated information has been deposited in a persistent Resource Description Framework (RDF) triple-store that allows users to conduct semantic searches. The strength and weaknesses of several document formats are reviewed.


Subject(s)
Academic Dissertations as Topic , Chemistry/education , Data Mining/methods , Software , Databases, Factual , Electronic Data Processing , False Positive Reactions
2.
J Chem Inf Model ; 48(8): 1571-81, 2008 Aug.
Article in English | MEDLINE | ID: mdl-18661966

ABSTRACT

The SPECTRa (Submission, Preservation and Exposure of Chemistry Teaching and Research Data) project has investigated the practices of chemists in archiving and disseminating primary chemical data from academic research laboratories. To redress the loss of the large amount of data never archived or disseminated, we have developed software for data publication into departmental and institutional Open Access digital repositories (DSpace). Data adhering to standard formats in selected disciplines (crystallography, NMR, computational chemistry) is transformed to XML (CML, Chemical Markup Language) which provides added validation. Context-specific chemical metadata and persistent Handle identifiers are added to enable long-term data reuse. It was found essential to provide an embargo mechanism, and policies for operating this and other processes are presented.


Subject(s)
Combinatorial Chemistry Techniques , Crystallography, X-Ray , Models, Molecular , Software
SELECTION OF CITATIONS
SEARCH DETAIL
...