Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 8 de 8
Filtrar
1.
JCO Clin Cancer Inform ; 3: 1-11, 2019 11.
Artigo em Inglês | MEDLINE | ID: mdl-31834820

RESUMO

PURPOSE: Data sharing creates potential cost savings, supports data aggregation, and facilitates reproducibility to ensure quality research; however, data from heterogeneous systems require retrospective harmonization. This is a major hurdle for researchers who seek to leverage existing data. Efforts focused on strategies for data interoperability largely center around the use of standards but ignore the problems of competing standards and the value of existing data. Interoperability remains reliant on retrospective harmonization. Approaches to reduce this burden are needed. METHODS: The Cancer Imaging Archive (TCIA) is an example of an imaging repository that accepts data from a diversity of sources. It contains medical images from investigators worldwide and substantial nonimage data. Digital Imaging and Communications in Medicine (DICOM) standards enable querying across images, but TCIA does not enforce other standards for describing nonimage supporting data, such as treatment details and patient outcomes. In this study, we used 9 TCIA lung and brain nonimage files containing 659 fields to explore retrospective harmonization for cross-study query and aggregation. It took 329.5 hours, or 2.3 months, extended over 6 months to identify 41 overlapping fields in 3 or more files and transform 31 of them. We used the Genomic Data Commons (GDC) data elements as the target standards for harmonization. RESULTS: We characterized the issues and have developed recommendations for reducing the burden of retrospective harmonization. Once we harmonized the data, we also developed a Web tool to easily explore harmonized collections. CONCLUSION: While prospective use of standards can support interoperability, there are issues that complicate this goal. Our work recognizes and reveals retrospective harmonization issues when trying to reuse existing data and recommends national infrastructure to address these issues.


Assuntos
Neoplasias Encefálicas/diagnóstico por imagem , Curadoria de Dados/normas , Interoperabilidade da Informação em Saúde/normas , Neoplasias Pulmonares/diagnóstico por imagem , Neoplasias Encefálicas/diagnóstico , Curadoria de Dados/métodos , Bases de Dados Factuais , Guias como Assunto , Humanos , Neoplasias Pulmonares/diagnóstico , Reprodutibilidade dos Testes , Estudos Retrospectivos
2.
AMIA Annu Symp Proc ; 2019: 681-690, 2019.
Artigo em Inglês | MEDLINE | ID: mdl-32308863

RESUMO

Developing promising treatments in biomedicine often requires aggregation and analysis of data from disparate sources across the healthcare and research spectrum. To facilitate these approaches, there is a growing focus on supporting interoperation of datasets by standardizing data-capture and reporting requirements. Common Data Elements (CDEs)-precise specifications of questions and the set of allowable answers to each question-are increasingly being adopted to help meet these standardization goals. While CDEs can provide a strong conceptual foundation for interoperation, there are no widely recognized serialization or interchange formats to describe and exchange their definitions. As a result, CDEs defined in one system cannot be easily be reused by other systems. An additional problem is that current CDE-based systems tend to be rather heavyweight and cannot be easily adopted and used by third-parties. To address these problems, we developed extensions to a metadata management system called the CEDAR Workbench to provide a platform to simplify the creation, exchange, and use of CDEs. We show how the resulting system allows users to quickly define and share CDEs and to immediately use these CDEs to build and deploy Web-based forms to acquire conforming metadata. We also show how we incorporated a large CDE library from the National Cancer Institute's caDSR system and made these CDEs publicly available for general use.


Assuntos
Pesquisa Biomédica , Elementos de Dados Comuns , Coleta de Dados/normas , Gerenciamento de Dados/métodos , Elementos de Dados Comuns/normas , Gerenciamento de Dados/normas , Humanos , Internet , Metadados , National Institutes of Health (U.S.) , Sistema de Registros , Estados Unidos , Interface Usuário-Computador
4.
J Biomed Inform ; 41(1): 106-23, 2008 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-17512259

RESUMO

One of the requirements for a federated information system is interoperability, the ability of one computer system to access and use the resources of another system. This feature is particularly important in biomedical research systems, which need to coordinate a variety of disparate types of data. In order to meet this need, the National Cancer Institute Center for Bioinformatics (NCICB) has created the cancer Common Ontologic Representation Environment (caCORE), an interoperability infrastructure based on Model Driven Architecture. The caCORE infrastructure provides a mechanism to create interoperable biomedical information systems. Systems built using the caCORE paradigm address both aspects of interoperability: the ability to access data (syntactic interoperability) and understand the data once retrieved (semantic interoperability). This infrastructure consists of an integrated set of three major components: a controlled terminology service (Enterprise Vocabulary Services), a standards-based metadata repository (the cancer Data Standards Repository) and an information system with an Application Programming Interface (API) based on Domain Model Driven Architecture. This infrastructure is being leveraged to create a Semantic Service-Oriented Architecture (SSOA) for cancer research by the National Cancer Institute's cancer Biomedical Informatics Grid (caBIG).


Assuntos
Biologia Computacional/métodos , Sistemas de Gerenciamento de Base de Dados , Bases de Dados Factuais , Armazenamento e Recuperação da Informação/métodos , Metanálise como Assunto , Modelos Teóricos , Vocabulário Controlado , Internet , National Cancer Institute (U.S.) , Semântica , Estados Unidos
5.
BMC Med Inform Decis Mak ; 6: 25, 2006 Jun 20.
Artigo em Inglês | MEDLINE | ID: mdl-16787533

RESUMO

BACKGROUND: The Cancer Biomedical Informatics Grid (caBIG) is a network of individuals and institutions, creating a world wide web of cancer research. An important aspect of this informatics effort is the development of consistent practices for data standards development, using a multi-tier approach that facilitates semantic interoperability of systems. The semantic tiers include (1) information models, (2) common data elements, and (3) controlled terminologies and ontologies. The College of American Pathologists (CAP) cancer protocols and checklists are an important reporting standard in pathology, for which no complete electronic data standard is currently available. METHODS: In this manuscript, we provide a case study of Cancer Common Ontologic Representation Environment (caCORE) data standard implementation of the CAP cancer protocols and checklists model--an existing and complex paper based standard. We illustrate the basic principles, goals and methodology for developing caBIG models. RESULTS: Using this example, we describe the process required to develop the model, the technologies and data standards on which the process and models are based, and the results of the modeling effort. We address difficulties we encountered and modifications to caCORE that will address these problems. In addition, we describe four ongoing development projects that will use the emerging CAP data standards to achieve integration of tissue banking and laboratory information systems. CONCLUSION: The CAP cancer checklists can be used as the basis for an electronic data standard in pathology using the caBIG semantic modeling methodology.


Assuntos
Sistemas de Gerenciamento de Base de Dados , Internet , Informática Médica , Oncologia/normas , Neoplasias/patologia , Patologia Clínica/normas , Protocolos Clínicos , Humanos , National Institutes of Health (U.S.) , Processamento de Linguagem Natural , Neoplasias/classificação , Semântica , Integração de Sistemas , Estados Unidos , Interface Usuário-Computador , Vocabulário Controlado
6.
BMC Med Inform Decis Mak ; 6: 2, 2006 Jan 06.
Artigo em Inglês | MEDLINE | ID: mdl-16398930

RESUMO

BACKGROUND: Robust, programmatically accessible biomedical information services that syntactically and semantically interoperate with other resources are challenging to construct. Such systems require the adoption of common information models, data representations and terminology standards as well as documented application programming interfaces (APIs). The National Cancer Institute (NCI) developed the cancer common ontologic representation environment (caCORE) to provide the infrastructure necessary to achieve interoperability across the systems it develops or sponsors. The caCORE Software Development Kit (SDK) was designed to provide developers both within and outside the NCI with the tools needed to construct such interoperable software systems. RESULTS: The caCORE SDK requires a Unified Modeling Language (UML) tool to begin the development workflow with the construction of a domain information model in the form of a UML Class Diagram. Models are annotated with concepts and definitions from a description logic terminology source using the Semantic Connector component. The annotated model is registered in the Cancer Data Standards Repository (caDSR) using the UML Loader component. System software is automatically generated using the Codegen component, which produces middleware that runs on an application server. The caCORE SDK was initially tested and validated using a seven-class UML model, and has been used to generate the caCORE production system, which includes models with dozens of classes. The deployed system supports access through object-oriented APIs with consistent syntax for retrieval of any type of data object across all classes in the original UML model. The caCORE SDK is currently being used by several development teams, including by participants in the cancer biomedical informatics grid (caBIG) program, to create compatible data services. caBIG compatibility standards are based upon caCORE resources, and thus the caCORE SDK has emerged as a key enabling technology for caBIG. CONCLUSION: The caCORE SDK substantially lowers the barrier to implementing systems that are syntactically and semantically interoperable by providing workflow and automation tools that standardize and expedite modeling, development, and deployment. It has gained acceptance among developers in the caBIG program, and is expected to provide a common mechanism for creating data service nodes on the data grid that is under development.


Assuntos
Armazenamento e Recuperação da Informação , Informática Médica/métodos , Neoplasias , Integração de Sistemas , Interface Usuário-Computador , Humanos , Internet , National Institutes of Health (U.S.) , Processamento de Linguagem Natural , Linguagens de Programação , Semântica , Design de Software , Unified Medical Language System , Estados Unidos
7.
Int J Med Inform ; 70(1): 41-8, 2003 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-12706181

RESUMO

There have been an increasing number of large research consortia in recent years funded by the National Cancer Institute (NCI) to facilitate multi-disciplinary, multi-institutional cancer research. Some of these consortia have central data collection plans similar to a multi-center clinical trial whereas others plan to store data locally and pool or share the data at a later date. Regardless of the goal of the consortium, there is a need to standardize the way certain data are collected and stored, transferred, or reported across the institutions involved. This communication is a report of the process and current status of the development of common data elements (CDEs) by the Early Detection Research Network (EDRN). The development of the CDEs involved several stages with each stage requiring input from multi-disciplinary experts in oncology, epidemiology, biostatistics, pathology, informatics, and study coordination. An effort was made to be consistent with other consortia developing similar CDEs and to follow data standards when available. Initial focus was on identifying the minimum data that would be necessary to collect on all EDRN study participants and EDRN specimens. There are currently CDEs in the development or pilot phase for eight different organ sites and 13 different types of specimen procurements and plans to develop CDEs for 12 or more additional types of specimens.


Assuntos
Coleta de Dados/normas , Informática Médica/normas , Neoplasias/diagnóstico , Coleta de Dados/métodos , Humanos , Oncologia , National Institutes of Health (U.S.) , Projetos Piloto , Padrões de Referência , Estados Unidos
8.
AMIA Annu Symp Proc ; 2003(2003): 1048, 2003.
Artigo em Inglês | MEDLINE | ID: mdl-14728551

RESUMO

The NCI provides the cancer Data Standards Repository (caDSR) to support development and deployment of CDEs in cancer research. The caDSR, part of the NCI caCORE infrastructure, supports data management workflow requirements and adherence to ISO/IEC 11179 metadata standards. CDEs are developed using standard terminology from caCORE vocabulary services, and are then deployed to multi-site clinical trials data management systems. Here we describe the caDSR and how CDEs are managed and deployed in clinical research.


Assuntos
Ensaios Clínicos como Assunto/normas , Pesquisa Biomédica/normas , Ensaios Clínicos como Assunto/estatística & dados numéricos , Humanos , Terminologia como Assunto , Vocabulário Controlado
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...