Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 15 de 15
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
BMC Med Inform Decis Mak ; 20(Suppl 10): 305, 2020 12 15.
Artigo em Inglês | MEDLINE | ID: mdl-33319709

RESUMO

BACKGROUND: Ontologies house various kinds of domain knowledge in formal structures, primarily in the form of concepts and the associative relationships between them. Ontologies have become integral components of many health information processing environments. Hence, quality assurance of the conceptual content of any ontology is critical. Relationships are foundational to the definition of concepts. Missing relationship errors (i.e., unintended omissions of important definitional relationships) can have a deleterious effect on the quality of an ontology. An abstraction network is a structure that overlays an ontology and provides an alternate, summarization view of its contents. One kind of abstraction network is called an area taxonomy, and a variation of it is called a subtaxonomy. A methodology based on these taxonomies for more readily finding missing relationship errors is explored. METHODS: The area taxonomy and the subtaxonomy are deployed to help reveal concepts that have a high likelihood of exhibiting missing relationship errors. A specific top-level grouping unit found within the area taxonomy and subtaxonomy, when deemed to be anomalous, is used as an indicator that missing relationship errors are likely to be found among certain concepts. Two hypotheses pertaining to the effectiveness of our Quality Assurance approach are studied. RESULTS: Our Quality Assurance methodology was applied to the Biological Process hierarchy of the National Cancer Institute thesaurus (NCIt) and SNOMED CT's Eye/vision finding subhierarchy within its Clinical finding hierarchy. Many missing relationship errors were discovered and confirmed in our analysis. For both test-bed hierarchies, our Quality Assurance methodology yielded a statistically significantly higher number of concepts with missing relationship errors in comparison to a control sample of concepts. Two hypotheses are confirmed by these findings. CONCLUSIONS: Quality assurance is a critical part of an ontology's lifecycle, and automated or semi-automated tools for supporting this process are invaluable. We introduced a Quality Assurance methodology targeted at missing relationship errors. Its successful application to the NCIt's Biological Process hierarchy and SNOMED CT's Eye/vision finding subhierarchy indicates that it can be a useful addition to the arsenal of tools available to ontology maintenance personnel.


Assuntos
Systematized Nomenclature of Medicine , Vocabulário Controlado , Processamento Eletrônico de Dados , Humanos , Probabilidade
2.
Methods Inf Med ; 56(3): 200-208, 2017 May 18.
Artigo em Inglês | MEDLINE | ID: mdl-28244549

RESUMO

OBJECTIVES: Ontologies are knowledge structures that lend support to many health-information systems. A study is carried out to assess the quality of ontological concepts based on a measure of their complexity. The results show a relation between complexity of concepts and error rates of concepts. METHODS: A measure of lateral complexity defined as the number of exhibited role types is used to distinguish between more complex and simpler concepts. Using a framework called an area taxonomy, a kind of abstraction network that summarizes the structural organization of an ontology, concepts are divided into two groups along these lines. Various concepts from each group are then subjected to a two-phase QA analysis to uncover and verify errors and inconsistencies in their modeling. A hierarchy of the National Cancer Institute thesaurus (NCIt) is used as our test-bed. A hypothesis pertaining to the expected error rates of the complex and simple concepts is tested. RESULTS: Our study was done on the NCIt's Biological Process hierarchy. Various errors, including missing roles, incorrect role targets, and incorrectly assigned roles, were discovered and verified in the two phases of our QA analysis. The overall findings confirmed our hypothesis by showing a statistically significant difference between the amounts of errors exhibited by more laterally complex concepts vis-à-vis simpler concepts. CONCLUSIONS: QA is an essential part of any ontology's maintenance regimen. In this paper, we reported on the results of a QA study targeting two groups of ontology concepts distinguished by their level of complexity, defined in terms of the number of exhibited role types. The study was carried out on a major component of an important ontology, the NCIt. The findings suggest that more complex concepts tend to have a higher error rate than simpler concepts. These findings can be utilized to guide ongoing efforts in ontology QA.


Assuntos
Ontologias Biológicas , Compreensão , Uso Significativo/normas , Modelos Estatísticos , National Cancer Institute (U.S.)/normas , Neoplasias/classificação , Simulação por Computador , Humanos , Processamento de Linguagem Natural , Garantia da Qualidade dos Cuidados de Saúde/normas , Estados Unidos , Vocabulário Controlado
3.
AMIA Annu Symp Proc ; 2016: 618-627, 2016.
Artigo em Inglês | MEDLINE | ID: mdl-28269858

RESUMO

The National Cancer Institute Thesaurus (NCIt) is a reference terminology used to support clinical, translational and basic research as well as administrative activities. As medical knowledge evolves, concepts that might be missing from a particular needed subdomain are regularly added to the NCIt. However, terminology development is known to be labor-intensive and error-prone. Therefore, cost-effective semi-automated methods for identifying potentially missing concepts would be useful to terminology curators. Previously, we have developed a structural method leveraging the native term mappings of the Unified Medical Language System to identify potential concepts in several of its source vocabularies to enrich the SNOMED CT. In this paper, we tested an analogous method for NCIt. Concepts from eight UMLS source terminologies were identified as possibilities to enrich NCIt's conceptual content.


Assuntos
National Cancer Institute (U.S.) , Descritores , Unified Medical Language System , Vocabulário Controlado , Humanos , Neoplasias , Systematized Nomenclature of Medicine , Estados Unidos
4.
Database (Oxford) ; 2012: bar066, 2012.
Artigo em Inglês | MEDLINE | ID: mdl-22434834

RESUMO

The overall objective of the Mouse-Human Anatomy Project (MHAP) was to facilitate the mapping and harmonization of anatomical terms used for mouse and human models by Mouse Genome Informatics (MGI) and the National Cancer Institute (NCI). The anatomy resources designated for this study were the Adult Mouse Anatomy (MA) ontology and the set of anatomy concepts contained in the NCI Thesaurus (NCIt). Several methods and software tools were identified and evaluated, then used to conduct an in-depth comparative analysis of the anatomy ontologies. Matches between mouse and human anatomy terms were determined and validated, resulting in a highly curated set of mappings between the two ontologies that has been used by other resources. These mappings will enable linking of data from mouse and human. As the anatomy ontologies have been expanded and refined, the mappings have been updated accordingly. Insights are presented into the overall process of comparing and mapping between ontologies, which may prove useful for further comparative analyses and ontology mapping efforts, especially those involving anatomy ontologies. Finally, issues concerning further development of the ontologies, updates to the mapping files, and possible additional applications and significance were considered. DATABASE URL: http://obofoundry.org/cgi-bin/detail.cgi?id=ma2ncit.


Assuntos
Anatomia/métodos , Bases de Dados Factuais , Vocabulário Controlado , Animais , Genômica , Humanos , Camundongos , Reprodutibilidade dos Testes
5.
J Biomed Inform ; 42(3): 530-9, 2009 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-19475726

RESUMO

The National Cancer Institute Enterprise Vocabulary Services (NCI EVS) uses a wide range of quality assurance (QA) techniques to maintain and extend NCI Thesaurus (NCIt). NCIt is a reference terminology and biomedical ontology used in a growing number of NCI and other systems that extend from translational and basic research through clinical care to public information and administrative activities. Both automated and manual QA techniques are employed throughout the editing and publication cycle, which includes inserting and editing NCIt in NCI Metathesaurus. NCI EVS conducts its own additional periodic and ongoing content QA. External reviews, and extensive evaluation by and interaction with EVS partners and other users, have also played an important part in the QA process. There have always been tensions and compromises between meeting the needs of dependent systems and providing consistent and well-structured content; external QA and feedback have been important in identifying and addressing such issues. Currently, NCI EVS is exploring new approaches to broaden external participation in the terminology development and QA process.


Assuntos
Terminologia como Assunto , Vocabulário Controlado , National Institutes of Health (U.S.) , Estados Unidos
6.
AMIA Annu Symp Proc ; : 520-4, 2008 Nov 06.
Artigo em Inglês | MEDLINE | ID: mdl-18998901

RESUMO

The development of ontologies that define entities and relationships among them has become essential for modern work in biomedicine. Ontologies are becoming so large in their coverage that no single centralized group of people can develop them effectively and ontology development becomes a community-based enterprise. In this paper we present Collaborative Protégé-a prototype tool that supports many aspects of community-based development, such as discussions integrated with ontology-editing process, chats, and annotation of changes. We have evaluated Collaborative Protégé in the context of the NCI Thesaurus development. Users have found the tool effective for carrying out discussions and recording design rationale.


Assuntos
Comportamento Cooperativo , Armazenamento e Recuperação da Informação/métodos , Processamento de Linguagem Natural , Neoplasias/classificação , Reconhecimento Automatizado de Padrão/métodos , Software , Descritores , Vocabulário Controlado , Algoritmos , Inteligência Artificial , Humanos , Estados Unidos
7.
Appl Ontol ; 3(3): 173-190, 2008 Jan 01.
Artigo em Inglês | MEDLINE | ID: mdl-19789731

RESUMO

The National Cancer Institute's (NCI) Thesaurus is a biomedical reference ontology. The NCI Thesaurus is represented using Description Logic, more specifically Ontylog, a Description logic implemented by Apelon, Inc. We are exploring the use of the DL species of the Web Ontology Language (OWL DL)-a W3C recommended standard for ontology representation-instead of Ontylog for representing the NCI Thesaurus. We have studied the requirements for knowledge representation of the NCI Thesaurus, and considered how OWL DL (and its implementation in Protégé-OWL) satisfies these requirements. In this paper, we discuss the areas where OWL DL was sufficient for representing required components, where tool support that would hide some of the complexity and extra levels of indirection would be required, and where language expressiveness is not sufficient given the representation requirements. Because many of the knowledge-representation issues that we encountered are very similar to the issues in representing other biomedical terminologies and ontologies in general, we believe that the lessons that we learned and the approaches that we developed will prove useful and informative for other researchers.

8.
J Biomed Inform ; 40(1): 30-43, 2007 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-16697710

RESUMO

Over the last 8 years, the National Cancer Institute (NCI) has launched a major effort to integrate molecular and clinical cancer-related information within a unified biomedical informatics framework, with controlled terminology as its foundational layer. The NCI Thesaurus is the reference terminology underpinning these efforts. It is designed to meet the growing need for accurate, comprehensive, and shared terminology, covering topics including: cancers, findings, drugs, therapies, anatomy, genes, pathways, cellular and subcellular processes, proteins, and experimental organisms. The NCI Thesaurus provides a partial model of how these things relate to each other, responding to actual user needs and implemented in a deductive logic framework that can help maintain the integrity and extend the informational power of what is provided. This paper presents the semantic model for cancer diseases and its uses in integrating clinical and molecular knowledge, more briefly examines the models and uses for drug, biochemical pathway, and mouse terminology, and discusses limits of the current approach and directions for future work.


Assuntos
Pesquisa Biomédica/métodos , Sistemas de Gerenciamento de Base de Dados , Bases de Dados Factuais , Armazenamento e Recuperação da Informação/métodos , Neoplasias/classificação , Neoplasias/fisiopatologia , Vocabulário Controlado , Biologia Computacional/métodos , Humanos , National Institutes of Health (U.S.) , Proteínas de Neoplasias/metabolismo , Semântica , Integração de Sistemas , Estados Unidos , Interface Usuário-Computador
9.
AMIA Annu Symp Proc ; : 165-70, 2007 Oct 11.
Artigo em Inglês | MEDLINE | ID: mdl-18693819

RESUMO

NCI Thesaurus entries reference corresponding nodes in the UMLS Semantic Network (SN). Adapting a process previously used to refine relationship definitions in the UMLS Metathesaurus, we used these Thesaurus-to-Network references to analyze alignment of the Thesaurus with the OBO Relations Ontology and at the same time validate and improve Thesaurus structure. Given this experience, we offer suggestions for enhancement of the UMLS SN so that it can be even more useful in the future.


Assuntos
National Cancer Institute (U.S.) , Neoplasias/classificação , Unified Medical Language System , Vocabulário Controlado , Biologia Computacional , Humanos , Semântica , Estados Unidos
10.
J Biomed Inform ; 38(2): 114-29, 2005 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-15797001

RESUMO

The National Cancer Institute has developed the NCI Thesaurus, a biomedical vocabulary for cancer research, covering terminology across a wide range of cancer research domains. A major design goal of the NCI Thesaurus is to facilitate translational research. We describe: the features of Ontylog, a description logic used to build NCI Thesaurus; our methodology for enhancing the terminology through collaboration between ontologists and domain experts, and for addressing certain real world challenges arising in modeling the Thesaurus; and finally, we describe the conversion of NCI Thesaurus from Ontylog into Web Ontology Language Lite. Ontylog has proven well suited for constructing big biomedical vocabularies. We have capitalized on the Ontylog constructs Kind and Role in the collaboration process described in this paper to facilitate communication between ontologists and domain experts. The artifacts and processes developed by NCI for collaboration may be useful in other biomedical terminology development efforts.


Assuntos
Bases de Dados Factuais , Dicionários como Assunto , Armazenamento e Recuperação da Informação/métodos , Oncologia/métodos , Neoplasias/classificação , Projetos de Pesquisa , Terminologia como Assunto , Vocabulário Controlado , Animais , Sistemas de Gerenciamento de Base de Dados , Humanos , Modelos Teóricos , Processamento de Linguagem Natural
11.
AMIA Annu Symp Proc ; : 61-5, 2005.
Artigo em Inglês | MEDLINE | ID: mdl-16779002

RESUMO

This paper reports on the alignment between mouse and human anatomies, a critical resource for comparative science as diseases in mice are used as mod-els of human disease. The two ontologies under investigation are the NCI Thesaurus (human anatomy) and the Adult Mouse Anatomical Dictionary, each comprising about 2500 anatomical concepts. This study compares two approaches to aligning ontologies. One is fully automatic, based on a combination of lexical and structural similarity; the other is manual. The resulting mappings were evaluated by an expert. 715 and 781 mappings were identified by each method respectively, of which 639 are common to both and all valid. The applications of the map-ping are discussed from the perspective of biology and from that of ontology.


Assuntos
Anatomia/classificação , Vocabulário Controlado , Animais , Dicionários como Assunto , Humanos , Camundongos , Semântica
12.
Stud Health Technol Inform ; 107(Pt 1): 33-7, 2004.
Artigo em Inglês | MEDLINE | ID: mdl-15360769

RESUMO

Cancer researchers need to be able to organize and report their results in a way that others can find, build upon, and relate to the specific clinical conditions of individual patients. NCI Thesaurus is a description logic terminology based on current science that helps individuals and software applications connect and organize the results of cancer research, e.g., by disease and underlying biology. Currently containing some 34,000 concepts--covering chemicals, drugs and other therapies, diseases, genes and gene products, anatomy, organisms, animal models, techniques, biologic processes, and administrative categories--NCI Thesaurus serves applications and the Web from a terminology server. As a scalable, formal terminology, the deployed Thesaurus, and associated applications and interfaces, are a model for some of the standards required for the NHII (National Health Information Infrastructure) and the Semantic Web.


Assuntos
Neoplasias , Terminologia como Assunto , Sistemas Computacionais , Humanos , Oncologia , National Institutes of Health (U.S.) , Estados Unidos , Vocabulário Controlado
13.
Comp Funct Genomics ; 5(8): 648-54, 2004.
Artigo em Inglês | MEDLINE | ID: mdl-18629178

RESUMO

The NCI Thesaurus is a reference terminology covering areas of basic and clinical science, built with the goal of facilitating translational research in cancer. It contains nearly 110 000 terms in approximately 36000 concepts, partitioned in 20 subdomains, which include diseases, drugs, anatomy, genes, gene products, techniques, and biological processes, among others, all with a cancer-centric focus in content, and originally designed to support coding activities across the National Cancer Institute. Each concept represents a unit of meaning and contains a number of annotations, such as synonyms and preferred name, as well as annotations such as textual definitions and optional references to external authorities. In addition, concepts are modelled with description logic (DL) and defined by their relationships to other concepts; there are currently approximately 90 types of named relations declared in the terminology. The NCI Thesaurus is produced by the Enterprise Vocabulary Services project, a collaborative effort between the NCI Center for Bioinformatics and the NCI Office of Communications, and is part of the caCORE infrastructure stack (http://ncicb.nci.nih.gov/NCICB/core). It can be accessed programmatically through the open caBIO API and browsed via the web (http://nciterms.nci.nih.gov). A history of editing changes is also accessible through the API. In addition, the Thesaurus is available for download in various file formats, including OWL, the web ontology language, to facilitate its utilization by others.

14.
Bioinformatics ; 19(18): 2404-12, 2003 Dec 12.
Artigo em Inglês | MEDLINE | ID: mdl-14668224

RESUMO

MOTIVATION: Sites with substantive bioinformatics operations are challenged to build data processing and delivery infrastructure that provides reliable access and enables data integration. Locally generated data must be processed and stored such that relationships to external data sources can be presented. Consistency and comparability across data sets requires annotation with controlled vocabularies and, further, metadata standards for data representation. Programmatic access to the processed data should be supported to ensure the maximum possible value is extracted. Confronted with these challenges at the National Cancer Institute Center for Bioinformatics, we decided to develop a robust infrastructure for data management and integration that supports advanced biomedical applications. RESULTS: We have developed an interconnected set of software and services called caCORE. Enterprise Vocabulary Services (EVS) provide controlled vocabulary, dictionary and thesaurus services. The Cancer Data Standards Repository (caDSR) provides a metadata registry for common data elements. Cancer Bioinformatics Infrastructure Objects (caBIO) implements an object-oriented model of the biomedical domain and provides Java, Simple Object Access Protocol and HTTP-XML application programming interfaces. caCORE has been used to develop scientific applications that bring together data from distinct genomic and clinical science sources. AVAILABILITY: caCORE downloads and web interfaces can be accessed from links on the caCORE web site (http://ncicb.nci.nih.gov/core). caBIO software is distributed under an open source license that permits unrestricted academic and commercial use. Vocabulary and metadata content in the EVS and caDSR, respectively, is similarly unrestricted, and is available through web applications and FTP downloads. SUPPLEMENTARY INFORMATION: http://ncicb.nci.nih.gov/core/publications contains links to the caBIO 1.0 class diagram and the caCORE 1.0 Technical Guide, which provide detailed information on the present caCORE architecture, data sources and APIs. Updated information appears on a regular basis on the caCORE web site (http://ncicb.nci.nih.gov/core).


Assuntos
Bases de Dados Factuais/normas , Armazenamento e Recuperação da Informação/métodos , Armazenamento e Recuperação da Informação/normas , Processamento de Linguagem Natural , Neoplasias/classificação , Interface Usuário-Computador , Animais , Biologia Computacional/métodos , Biologia Computacional/normas , Dicionários Médicos como Assunto , Humanos , Internet , National Institutes of Health (U.S.) , Estados Unidos , Vocabulário Controlado
15.
Blood ; 100(1): 238-45, 2002 Jul 01.
Artigo em Inglês | MEDLINE | ID: mdl-12070033

RESUMO

The hematopathology subcommittee of the Mouse Models of Human Cancers Consortium recognized the need for a classification of murine hematopoietic neoplasms that would allow investigators to diagnose lesions as well-defined entities according to accepted criteria. Pathologists and investigators worked cooperatively to develop proposals for the classification of lymphoid and nonlymphoid hematopoietic neoplasms. It is proposed here that nonlymphoid hematopoietic neoplasms of mice be classified in 4 broad categories: nonlymphoid leukemias, nonlymphoid hematopoietic sarcomas, myeloid dysplasias, and myeloid proliferations (nonreactive). Criteria for diagnosis and subclassification of these lesions include peripheral blood findings, cytologic features of hematopoietic tissues, histopathology, immunophenotyping, genetic features, and clinical course. Differences between murine and human lesions are reflected in the terminology and methods used for classification. This classification will be of particular value to investigators seeking to develop, use, and communicate about mouse models of human hematopoietic neoplasms.


Assuntos
Neoplasias Hematológicas/classificação , Camundongos , Animais , Neoplasias Hematológicas/patologia , Humanos , Leucemia/classificação , Leucemia/patologia , Transtornos Mieloproliferativos/classificação , Transtornos Mieloproliferativos/patologia , National Institutes of Health (U.S.) , Defeitos do Tubo Neural/classificação , Defeitos do Tubo Neural/patologia , Sarcoma/classificação , Sarcoma/patologia , Estados Unidos
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...