Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 22
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
SLAS Technol ; 29(3): 100135, 2024 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-38703999

RESUMO

Laboratory management automation is essential for achieving interoperability in the domain of experimental research and accelerating scientific discovery. The integration of resources and the sharing of knowledge across organisations enable scientific discoveries to be accelerated by increasing the productivity of laboratories, optimising funding efficiency, and addressing emerging global challenges. This paper presents a novel framework for digitalising and automating the administration of research laboratories through The World Avatar, an all-encompassing dynamic knowledge graph. This Digital Laboratory Framework serves as a flexible tool, enabling users to efficiently leverage data from diverse systems and formats without being confined to a specific software or protocol. Establishing dedicated ontologies and agents and combining them with technologies such as QR codes, RFID tags, and mobile apps, enabled us to develop modular applications that tackle some key challenges related to lab management. Here, we showcase an automated tracking and intervention system for explosive chemicals as well as an easy-to-use mobile application for asset management and information retrieval. Implementing these, we have achieved semantic linking of BIM and BMS data with laboratory inventory and chemical knowledge. Our approach can capture the crucial data points and reduce inventory processing time. All data provenance is recorded following the FAIR principles, ensuring its accessibility and interoperability.


Assuntos
Automação Laboratorial , Automação Laboratorial/métodos , Laboratórios , Armazenamento e Recuperação da Informação/métodos
2.
ACS Omega ; 9(12): 13883-13896, 2024 Mar 26.
Artigo em Inglês | MEDLINE | ID: mdl-38559914

RESUMO

In this study, we present a question answering (QA) system for chemistry, named Marie, with the use of a text-to-text pretrained language model to attain accurate data retrieval. The underlying data store is "The World Avatar" (TWA), a general world model consisting of a knowledge graph that evolves over time. TWA includes information about chemical species such as their chemical and physical properties, applications, and chemical classifications. Building upon our previous work on KGQA for chemistry, this advanced version of Marie leverages a fine-tuned Flan-T5 model to seamlessly translate natural language questions into SPARQL queries with no separate components for entity and relation linking. The developed QA system demonstrates competence in providing accurate results for complex queries that involve many relation hops as well as showcasing the ability to balance correctness and speed for real-world usage. This new approach offers significant advantages over the prior implementation that relied on knowledge graph embedding. Specifically, the updated system boasts high accuracy and great flexibility in accommodating changes and evolution of the data stored in the knowledge graph without necessitating retraining. Our evaluation results underscore the efficacy of the improved system, highlighting its superior accuracy and the ability in answering complex questions compared to its predecessor.

3.
Nat Commun ; 15(1): 462, 2024 Jan 23.
Artigo em Inglês | MEDLINE | ID: mdl-38263405

RESUMO

The ability to integrate resources and share knowledge across organisations empowers scientists to expedite the scientific discovery process. This is especially crucial in addressing emerging global challenges that require global solutions. In this work, we develop an architecture for distributed self-driving laboratories within The World Avatar project, which seeks to create an all-encompassing digital twin based on a dynamic knowledge graph. We employ ontologies to capture data and material flows in design-make-test-analyse cycles, utilising autonomous agents as executable knowledge components to carry out the experimentation workflow. Data provenance is recorded to ensure its findability, accessibility, interoperability, and reusability. We demonstrate the practical application of our framework by linking two robots in Cambridge and Singapore for a collaborative closed-loop optimisation for a pharmaceutically-relevant aldol condensation reaction in real-time. The knowledge graph autonomously evolves toward the scientist's research goals, with the two robots effectively generating a Pareto front for cost-yield optimisation in three days.

4.
J Chem Inf Model ; 63(21): 6569-6586, 2023 11 13.
Artigo em Inglês | MEDLINE | ID: mdl-37883649

RESUMO

Web ontologies are important tools in modern scientific research because they provide a standardized way to represent and manage web-scale amounts of complex data. In chemistry, a semantic database for chemical species is indispensable for its ability to interrelate and infer relationships, enabling a more precise analysis and prediction of chemical behavior. This paper presents OntoSpecies, a web ontology designed to represent chemical species and their properties. The ontology serves as a core component of The World Avatar knowledge graph chemistry domain and includes a wide range of identifiers, chemical and physical properties, chemical classifications and applications, and spectral information associated with each species. The ontology includes provenance and attribution metadata, ensuring the reliability and traceability of data. Most of the information about chemical species are sourced from PubChem and ChEBI data on the respective compound Web pages using a software agent, making OntoSpecies a comprehensive semantic database of chemical species able to solve novel types of problems in the field. Access to this reliable source of chemical data is provided through a SPARQL end point. The paper presents example use cases to demonstrate the contribution of OntoSpecies in solving complex tasks that require integrated semantically searchable chemical data. The approach presented in this paper represents a significant advancement in the field of chemical data management, offering a powerful tool for representing, navigating, and analyzing chemical information to support scientific research.


Assuntos
Descoberta do Conhecimento , Software , Reprodutibilidade dos Testes , Bases de Dados Factuais , Semântica
5.
ACS Omega ; 8(36): 33039-33057, 2023 Sep 12.
Artigo em Inglês | MEDLINE | ID: mdl-37720754

RESUMO

This paper presents a novel knowledge graph question answering (KGQA) system for chemistry, which is implemented on hybrid knowledge graph embeddings, aiming to provide fact-oriented information retrieval for chemistry-related research and industrial applications. Unlike other existing designs, the system operates on multiple embedding spaces, which use various embedding methods and queries the embedding spaces in parallel. With the answers returned from multiple embedding spaces, the system leverages a score alignment model to adjust the answer scores and rerank the answers. Further, the system implements an algorithm to derive implicit multihop relations to handle the complexities of deep ontologies and improve multihop question answering. The system also implements a BERT-based bidirectional entity-linking model to enhance the robustness and accuracy of the entity-linking module. The system uses a joint numerical embedding model to efficiently handle numerical filtering questions. Further, it can invoke semantic agents to perform dynamic calculations autonomously. Finally, the KGQA system handles numerous chemical reaction mechanisms using semantic parsing supported by a Linked Data Fragment server. This paper evaluates the accuracy of each module within the KGQA system with a chemistry question data set.

6.
ACS Omega ; 8(2): 2462-2475, 2023 Jan 17.
Artigo em Inglês | MEDLINE | ID: mdl-36687109

RESUMO

In this work, a new OntoPESScan ontology is developed for the semantic representation of one-dimensional potential energy surface (PES) scans, a central concept in computational chemistry. This ontology is developed in line with knowledge graph principles and The World Avatar (TWA) project. OntoPESScan is linked to other ontologies for chemistry in TWA, including OntoSpecies, which helps uniquely identify species along the PES and access their properties, and OntoCompChem, which allows the association of potential energy surfaces with quantum chemical calculations and the concepts used to derive them. A force-field fitting agent is also developed that makes use of the information in the OntoPESScan ontology to fit force fields to reactive surfaces of interest on the fly by making use of the empirical valence bond methodology. This agent is demonstrated to successfully parametrize two cases, namely, a PES scan on ethanol and a PES scan on a localized π-radical PAH hypothesized to play a role in soot formation during combustion. OntoPESScan is an extension to the capabilities of TWA and, in conjunction with potential further ontological support for molecular dynamics and reactions, will further progress toward an open, continuous, and self-growing knowledge graph for chemistry.

7.
Acc Chem Res ; 56(2): 128-139, 2023 01 17.
Artigo em Inglês | MEDLINE | ID: mdl-36516456

RESUMO

Passing knowledge from human to human is a natural process that has continued since the beginning of humankind. Over the past few decades, we have witnessed that knowledge is no longer passed only between humans but also from humans to machines. The latter form of knowledge transfer represents a cornerstone in artificial intelligence (AI) and lays the foundation for knowledge engineering (KE). In order to pass knowledge to machines, humans need to structure, formalize, and make knowledge machine-readable. Subsequently, humans also need to develop software that emulates their decision-making process. In order to engineer chemical knowledge, chemists are often required to challenge their understanding of chemistry and thinking processes, which may help improve the structure of chemical knowledge.Knowledge engineering in chemistry dates from the development of expert systems that emulated the thinking process of analytical and organic chemists. Since then, many different expert systems employing rather limited knowledge bases have been developed, solving problems in retrosynthesis, analytical chemistry, chemical risk assessment, etc. However, toward the end of the 20th century, the AI winters slowed down the development of expert systems for chemistry. At the same time, the increasing complexity of chemical research, alongside the limitations of the available computing tools, made it difficult for many chemistry expert systems to keep pace.In the past two decades, the semantic web, the popularization of object-oriented programming, and the increase in computational power have revitalized knowledge engineering. Knowledge formalization through ontologies has become commonplace, triggering the subsequent development of knowledge graphs and cognitive software agents. These tools enable the possibility of interoperability, enabling the representation of more complex systems, inference capabilities, and the synthesis of new knowledge.This Account introduces the history, the core principles of KE, and its applications within the broad realm of chemical research and engineering. In this regard, we first discuss how chemical knowledge is formalized and how a chemist's cognition can be emulated with the help of reasoning algorithms. Following this, we discuss various applications of knowledge graph and agent technology used to solve problems in chemistry related to molecular engineering, chemical mechanisms, multiscale modeling, automation of calculations and experiments, and chemist-machine interactions. These developments are discussed in the context of a universal and dynamic knowledge ecosystem, referred to as The World Avatar (TWA).


Assuntos
Inteligência Artificial , Sistemas Inteligentes , Humanos , Ecossistema , Algoritmos
8.
J Am Chem Soc ; 144(26): 11713-11728, 2022 Jul 06.
Artigo em Inglês | MEDLINE | ID: mdl-35731954

RESUMO

Metal-organic polyhedra (MOPs) are hybrid organic-inorganic nanomolecules, whose rational design depends on harmonious consideration of chemical complementarity and spatial compatibility between two or more types of chemical building units (CBUs). In this work, we apply knowledge engineering technology to automate the derivation of MOP formulations based on existing knowledge. For this purpose we have (i) curated relevant MOP and CBU data; (ii) developed an assembly model concept that embeds rules in the MOP construction; (iii) developed an OntoMOPs ontology that defines MOPs and their key properties; (iv) input agents that populate The World Avatar (TWA) knowledge graph; and (v) input agents that, using information from TWA, derive a list of new constructible MOPs. Our result provides rapid and automated instantiation of MOPs in TWA and unveils the immediate chemical space of known MOPs, thus shedding light on new MOP targets for future investigations.

9.
JACS Au ; 2(2): 292-309, 2022 Feb 28.
Artigo em Inglês | MEDLINE | ID: mdl-35252980

RESUMO

High-fidelity computer-aided experimentation is becoming more accessible with the development of computing power and artificial intelligence tools. The advancement of experimental hardware also empowers researchers to reach a level of accuracy that was not possible in the past. Marching toward the next generation of self-driving laboratories, the orchestration of both resources lies at the focal point of autonomous discovery in chemical science. To achieve such a goal, algorithmically accessible data representations and standardized communication protocols are indispensable. In this perspective, we recategorize the recently introduced approach based on Materials Acceleration Platforms into five functional components and discuss recent case studies that focus on the data representation and exchange scheme between different components. Emerging technologies for interoperable data representation and multi-agent systems are also discussed with their recent applications in chemical automation. We hypothesize that knowledge graph technology, orchestrating semantic web technologies and multi-agent systems, will be the driving force to bring data to knowledge, evolving our way of automating the laboratory.

10.
ACS Omega ; 6(37): 23764-23775, 2021 Sep 21.
Artigo em Inglês | MEDLINE | ID: mdl-34568656

RESUMO

In this paper, the ability of three selected machine learning neural and baseline models in predicting the power conversion efficiency (PCE) of organic photovoltaics (OPVs) using molecular structure information as an input is assessed. The bidirectional long short-term memory (gFSI/BiLSTM), attentive fingerprints (attentive FP), and simple graph neural networks (simple GNN) as well as baseline support vector regression (SVR), random forests (RF), and high-dimensional model representation (HDMR) methods are trained to both the large and computational Harvard clean energy project database (CEPDB) and the much smaller experimental Harvard organic photovoltaic 15 dataset (HOPV15). It was found that the neural-based models generally performed better on the computational dataset with the attentive FP model reaching a state-of-the-art performance with the test set mean squared error of 0.071. The experimental dataset proved much harder to fit, with all of the models exhibiting a rather poor performance. Contrary to the computational dataset, the baseline models were found to perform better than the neural models. To improve the ability of machine learning models to predict PCEs for OPVs, either better computational results that correlate well with experiments or more experimental data at well-controlled conditions are likely required.

11.
J Chem Inf Model ; 61(8): 3868-3880, 2021 08 23.
Artigo em Inglês | MEDLINE | ID: mdl-34338504

RESUMO

This paper describes the implementation and evaluation of a proof-of-concept Question Answering (QA) system for accessing chemical data from knowledge graphs (KGs) which offer data from chemical kinetics to the chemical and physical properties of species. We trained the question classification and named the entity recognition models that specialize in interpreting chemistry questions. The system has a novel design which applies a topic model to identify the question-to-ontology affiliation to handle ontologies with different structures. The topic model also helps the system to provide answers with a higher quality. Moreover, a new method that automatically generates training questions from ontologies is also implemented. The question set generated for training contains 432,989 questions under 11 types. Such a training set has been proven to be effective for training both the question classification model and the named entity recognition model. We evaluated the system using other KGQA systems as baselines. The system outperforms the chosen KGQA system answering chemistry-related questions. The QA system is also compared to the Google search engine and the WolframAlpha engine. It shows that the QA system can answer certain types of questions better than the search engines.


Assuntos
Ferramenta de Busca
12.
J Am Chem Soc ; 143(31): 12212-12219, 2021 08 11.
Artigo em Inglês | MEDLINE | ID: mdl-34338507

RESUMO

Soot emitted from incomplete combustion of hydrocarbon fuels contributes to global warming and causes human disease. The mechanism by which soot nanoparticles form within hydrocarbon flames is still an unsolved problem in combustion science. Mechanisms proposed to date involving purely chemical growth are limited by slow reaction rates, whereas mechanisms relying on solely physical interactions between molecules are limited by weak intermolecular interactions that are unstable at flame temperatures. Here, we show evidence for a reactive π-diradical aromatic soot precursor imaged using non-contact atomic force microscopy. Localization of π-electrons on non-hexagonal rings was found to allow for Kekulé aromatic soot precursors to possess a triplet diradical ground state. Barrierless chain reactions are shown between these reactive sites, which provide thermally stable aromatic rim-linked hydrocarbons under flame conditions. Quantum molecular dynamics simulations demonstrate physical condensation of aromatics that survive for tens of picoseconds. Bound internal rotors then enable the reactive sites to find each other and become chemically cross-linked before dissociation. These species provide a rapid, thermally stable chain reaction toward soot nanoparticle formation and could provide molecular targets for limiting the emission of these toxic combustion products.

13.
J Chem Inf Model ; 61(4): 1701-1717, 2021 04 26.
Artigo em Inglês | MEDLINE | ID: mdl-33825473

RESUMO

In this paper, we develop a knowledge graph-based framework for the automated calibration of combustion reaction mechanisms and demonstrate its effectiveness on a case study of poly(oxymethylene)dimethyl ether (PODEn, where n = 3) oxidation. We develop an ontological representation for combustion experiments, OntoChemExp, that allows for the semantic enrichment of experiments within the J-Park simulator (JPS, theworldavatar.com), an existing cross-domain knowledge graph. OntoChemExp is fully capable of supporting experimental results in the Process Informatics Model (PrIMe) database. Following this, a set of software agents are developed to perform experimental result retrieval, sensitivity analysis, and calibration tasks. The sensitivity analysis agent is used for both generic sensitivity analyses and reaction selection for subsequent calibration. The calibration process is performed as a sampling task, followed by an optimization task. The agents are designed for use with generic models but are demonstrated with ignition delay time and laminar flame speed simulations. We find that calibration times are reduced, while accuracy is increased compared to manual calibration, achieving a 79% decrease in the objective function value, as defined in this study. Further, we demonstrate how this workflow is implemented as an extension of the JPS.


Assuntos
Reconhecimento Automatizado de Padrão , Software , Calibragem , Éteres Metílicos , Tecnologia
14.
J Phys Chem A ; 124(48): 10040-10052, 2020 Dec 03.
Artigo em Inglês | MEDLINE | ID: mdl-33202128

RESUMO

The thermodynamics and kinetics of cross-linking reactions between PAHs of various reactive edge types that are observed in soot precursors are explored using density functional theory. The forward rate constants confirm that reactions involving aryl σ-radicals are faster than others, but rate constants for reactions between aryl σ-radicals and localized π-radicals can be as large or even larger than for two aryl σ-radicals. However, rates for all cross-linking reactions between small PAHs are likely too slow to explain soot formation. The equilibrium constants show that reactions involving σ and π-radical PAHs are the most favorable at flame temperatures. Equilibrium constants for larger PAHs show that the ability to form bonded-and-stacked structures results in enhanced equilibrium constants for the reaction of two large localized π-radicals compared to those for other edge types. This suggests that combined physical and chemical interactions between larger π-radical PAHs could be important in flame environments.

15.
J Chem Inf Model ; 60(12): 6155-6166, 2020 12 28.
Artigo em Inglês | MEDLINE | ID: mdl-33242243

RESUMO

In this paper, we develop a set of software agents which improve a knowledge-graph containing thermodynamic data of chemical species by means of quantum chemical calculations and error-canceling balanced reactions. The knowledge-graph represents species-associated information by making use of the principles of linked data, as employed in the Semantic Web, where concepts correspond to vertices and relationships between the concepts correspond to edges of the graph. We implement this representation by means of ontologies, which formalize the definition of concepts and their relationships, as a critical step to achieve interoperability between heterogeneous data formats and software. The agents, which conduct quantum chemical calculations and derive the estimates of standard enthalpies of formation, update the knowledge-graph with newly obtained results, improving data values, and adding nodes and connections between them. A key distinguishing feature of our approach is that it extends an existing, general-purpose knowledge-graph, called J-Park Simulator (http://theworldavatar.com), and its ecosystem of autonomous agents, thus enabling seamless cross-domain applications in wider contexts. To this end, we demonstrate how quantum calculations can directly affect the atmospheric dispersion of pollutants in an industrial emission use-case.


Assuntos
Ecossistema , Software , Termodinâmica
16.
ACS Omega ; 5(29): 18342-18348, 2020 Jul 28.
Artigo em Inglês | MEDLINE | ID: mdl-32743209

RESUMO

In this paper, we demonstrate through examples how the concept of a Semantic Web based knowledge graph can be used to integrate combustion modeling into cross-disciplinary applications and in particular how inconsistency issues in chemical mechanisms can be addressed. We discuss the advantages of linked data that form the essence of a knowledge graph and how we implement this in a number of interconnected ontologies, specifically in the context of combustion chemistry. Central to this is OntoKin, an ontology we have developed for capturing both the content and the semantics of chemical kinetic reaction mechanisms. OntoKin is used to represent the example mechanisms from the literature in a knowledge graph, which itself is part of the existing, more general knowledge graph and ecosystem of autonomous software agents that are acting on it. We describe a web interface, which allows users to interact with the system, upload and compare the existing mechanisms, and query species and reactions across the knowledge graph. The utility of the knowledge-graph approach is demonstrated for two use-cases: querying across multiple mechanisms from the literature and modeling the atmospheric dispersion of pollutants emitted by ships. As part of the query use-case, our ontological tools are applied to identify variations in the rate of a hydrogen abstraction reaction from methane as represented by 10 different mechanisms.

17.
J Chem Inf Model ; 60(1): 108-120, 2020 01 27.
Artigo em Inglês | MEDLINE | ID: mdl-31846323

RESUMO

An ontology for capturing both data and the semantics of chemical kinetic reaction mechanisms has been developed. Such mechanisms can be applied to simulate and understand the behavior of chemical processes, for example, the emission of pollutants from internal combustion engines. An ontology development methodology was used to produce the semantic model of the mechanisms, and a tool was developed to automate the assertion process. As part of the development methodology, the ontology is formally represented using a web ontology language (OWL), assessed by domain experts, and validated by applying a reasoning tool. The resulting ontology, termed OntoKin, has been used to represent example mechanisms from the literature. OntoKin and its instantiations are integrated to create a knowledge base (KB), which is deployed using the RDF4J triple store. The use of the OntoKin ontology and the KB is demonstrated for three use cases-querying across mechanisms, modeling atmospheric pollution dispersion, and as a mechanism browser tool. As part of the query use case, the OntoKin tools have been applied by a chemist to identify variations in the rate of a prompt NOx formation reaction in the combustion of ammonia as represented by four mechanisms in the literature.


Assuntos
Modelos Químicos , Semântica , Cinética , Software
18.
Phys Chem Chem Phys ; 21(29): 16240-16251, 2019 Aug 07.
Artigo em Inglês | MEDLINE | ID: mdl-31298672

RESUMO

In this work, the optical band gaps of polycyclic aromatic hydrocarbons (PAHs) crosslinked via an aliphatic bond, curved via pentagon integration and with radical character were computed using density functional theory. A variety of different functionals were benchmarked against optical band gaps (OBGs) measured by ultraviolet-visible spectroscopy with HSE06 being most accurate with a percentage error of 6% for a moderate basis set. Pericondensed aromatics with different symmetries were calculated with this improved functional providing new scaling relationships for the OBG versus size. Further calculations showed crosslinks cause a small decrease in the OBG of the monomers which saturates after 3-4 crosslinks. Curvature in PAHs was shown to increase the optical band gap due to the resulting change in hybridisation of the system, but this increase saturated at larger sizes. The increase in OBG between a flat PAH and a strained curved one was shown to be equivalent to a difference of several rings in size for pericondensed aromatic systems. The effect of σ-radicals on the optical band gap was also shown to be negligible, however, π-radicals were found to decrease the band gap by ∼0.5 eV. These findings have applications in understanding the molecular species involved in soot formation.

19.
J Chem Inf Model ; 59(7): 3154-3165, 2019 07 22.
Artigo em Inglês | MEDLINE | ID: mdl-31150242

RESUMO

The purpose of this article is to present an ontology, termed OntoCompChem, for quantum chemistry calculations as performed by the Gaussian quantum chemistry software, as well as a semantic web service named MolHub. The OntoCompChem ontology has been developed based on the semantics of concepts specified in the CompChem convention of Chemical Markup Language (CML) and by extending the Gainesville Core (GNVC) ontology. MolHub is developed in order to establish semantic interoperability between different tools used in quantum chemistry and thermochemistry calculations, and as such is integrated into the J-Park Simulator (JPS)-a multidomain interactive simulation platform and expert system. It uses the OntoCompChem ontology and implements a formal language based on propositional logic as a part of its query engine, which verifies satisfiability through reasoning. This paper also presents a NASA polynomial use-case scenario to demonstrate semantic interoperability between Gaussian and a tool for thermodynamic data calculations within MolHub.


Assuntos
Fenômenos Químicos , Software , Terminologia como Assunto , Internet , Modelos Moleculares , Estrutura Molecular , Termodinâmica
20.
Appl Opt ; 58(10): 2662-2670, 2019 Apr 01.
Artigo em Inglês | MEDLINE | ID: mdl-31045067

RESUMO

A new method is presented for performing the Abel inversion by fitting the line-of-sight projection of a predefined intensity distribution (FLiPPID) to the recorded 2D projections. The aim is to develop a methodology that is less prone to experimental noise when analyzing the projection of axisymmetric objects-in this case, co-flow diffusion flame images for color ratio pyrometry. A regression model is chosen for the light emission intensity distribution of the flame cross section as a function of radial distance from the flame center line. The forward Abel transform of this model function is fitted to the projected light intensity recorded by a color camera. For each of the three color channels, the model function requires three fitting parameters to match the radial intensity profile at each height above the burner. This results in a very smooth Abel inversion with no artifacts such as oscillations or negative values of the light source intensity, as is commonly observed for alternative Abel inversion techniques, such as the basis-set expansion or onion peeling. The advantages of the new FLiPPID method are illustrated by calculating the soot temperature and volume fraction profiles inside a co-flow diffusion flame, both being significantly smoother than those produced by the alternative inversion methods. The developed FLiPPID methodology can be applied to numerous other optical techniques for which smooth inverse Abel transforms are required.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...