Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 6 de 6
Filter
1.
Cancer Res ; 77(21): e115-e118, 2017 11 01.
Article in English | MEDLINE | ID: mdl-29092954

ABSTRACT

Precise phenotype information is needed to understand the effects of genetic and epigenetic changes on tumor behavior and responsiveness. Extraction and representation of cancer phenotypes is currently mostly performed manually, making it difficult to correlate phenotypic data to genomic data. In addition, genomic data are being produced at an increasingly faster pace, exacerbating the problem. The DeepPhe software enables automated extraction of detailed phenotype information from electronic medical records of cancer patients. The system implements advanced Natural Language Processing and knowledge engineering methods within a flexible modular architecture, and was evaluated using a manually annotated dataset of the University of Pittsburgh Medical Center breast cancer patients. The resulting platform provides critical and missing computational methods for computational phenotyping. Working in tandem with advanced analysis of high-throughput sequencing, these approaches will further accelerate the transition to precision cancer treatment. Cancer Res; 77(21); e115-8. ©2017 AACR.


Subject(s)
Computer Systems , Electronic Health Records , Natural Language Processing , Neoplasms/therapy , Data Mining/methods , Humans , Medical Informatics/methods , Neoplasms/diagnosis , Neoplasms/genetics , Phenotype , Precision Medicine/methods , Reproducibility of Results
2.
BMC Bioinformatics ; 17: 32, 2016 Jan 14.
Article in English | MEDLINE | ID: mdl-26763894

ABSTRACT

BACKGROUND: Natural language processing (NLP) applications are increasingly important in biomedical data analysis, knowledge engineering, and decision support. Concept recognition is an important component task for NLP pipelines, and can be either general-purpose or domain-specific. We describe a novel, flexible, and general-purpose concept recognition component for NLP pipelines, and compare its speed and accuracy against five commonly used alternatives on both a biological and clinical corpus. NOBLE Coder implements a general algorithm for matching terms to concepts from an arbitrary vocabulary set. The system's matching options can be configured individually or in combination to yield specific system behavior for a variety of NLP tasks. The software is open source, freely available, and easily integrated into UIMA or GATE. We benchmarked speed and accuracy of the system against the CRAFT and ShARe corpora as reference standards and compared it to MMTx, MGrep, Concept Mapper, cTAKES Dictionary Lookup Annotator, and cTAKES Fast Dictionary Lookup Annotator. RESULTS: We describe key advantages of the NOBLE Coder system and associated tools, including its greedy algorithm, configurable matching strategies, and multiple terminology input formats. These features provide unique functionality when compared with existing alternatives, including state-of-the-art systems. On two benchmarking tasks, NOBLE's performance exceeded commonly used alternatives, performing almost as well as the most advanced systems. Error analysis revealed differences in error profiles among systems. CONCLUSION: NOBLE Coder is comparable to other widely used concept recognition systems in terms of accuracy and speed. Advantages of NOBLE Coder include its interactive terminology builder tool, ease of configuration, and adaptability to various domains and tasks. NOBLE provides a term-to-concept matching system suitable for general concept recognition in biomedical NLP pipelines.


Subject(s)
Natural Language Processing , Software , Algorithms , Humans
3.
Cancer Res ; 75(24): 5194-201, 2015 Dec 15.
Article in English | MEDLINE | ID: mdl-26670560

ABSTRACT

Advances in cancer research and personalized medicine will require significant new bridging infrastructures, including more robust biorepositories that link human tissue to clinical phenotypes and outcomes. In order to meet that challenge, four cancer centers formed the Text Information Extraction System (TIES) Cancer Research Network, a federated network that facilitates data and biospecimen sharing among member institutions. Member sites can access pathology data that are de-identified and processed with the TIES natural language processing system, which creates a repository of rich phenotype data linked to clinical biospecimens. TIES incorporates multiple security and privacy best practices that, combined with legal agreements, network policies, and procedures, enable regulatory compliance. The TIES Cancer Research Network now provides integrated access to investigators at all member institutions, where multiple investigator-driven pilot projects are underway. Examples of federated search across the network illustrate the potential impact on translational research, particularly for studies involving rare cancers, rare phenotypes, and specific biologic behaviors. The network satisfies several key desiderata including local control of data and credentialing, inclusion of rich phenotype information, and applicability to diverse research objectives. The TIES Cancer Research Network presents a model for a national data and biospecimen network.


Subject(s)
Biological Specimen Banks/organization & administration , Biomedical Research , Neoplasms , Registries/standards , Translational Research, Biomedical , Humans , United States
4.
J Family Med Prim Care ; 4(3): 439-43, 2015.
Article in English | MEDLINE | ID: mdl-26288789

ABSTRACT

INTRODUCTION: Diabetes is an important public health problem of India. Studies have shown that increase in patients' knowledge regarding the disease results in better compliance to treatment and decrease in complications. This study was planned to assess the knowledge about diabetes and its correlation with pharmacological and non-pharmacological compliance, among the diabetic patients attending rural health center from Sangli District, Maharashtra (India). MATERIALS AND METHODS: The study was conducted during September to November 2014. The study subjects were all willing adult patients with type II diabetes mellitus attending a selected rural hospital. The study tool was pretested and self-administered questionnaire. Analysis was done using Microsoft Excel and SPSS-22. RESULTS: Total study participants were 307 in number, with the mean age of 55.6 years. The mean morbidity with diabetes was 10.7 years. Only 23.8% had good knowledge regarding diabetes, while 19.2% participants had poor knowledge. Knowledge was significantly associated with the compliance to the pharmacological and non-pharmacological management. CONCLUSION: Although most of the patients were suffering with diabetes for many years there is lack of knowledge regarding the disease and self care. The compliance to the management of diabetes was better in patients with good knowledge. Seminars, counseling sessions and workshop should be arranged periodically for diabetic patients to increase their awareness.

5.
J Am Med Inform Assoc ; 17(3): 253-64, 2010.
Article in English | MEDLINE | ID: mdl-20442142

ABSTRACT

The authors report on the development of the Cancer Tissue Information Extraction System (caTIES)--an application that supports collaborative tissue banking and text mining by leveraging existing natural language processing methods and algorithms, grid communication and security frameworks, and query visualization methods. The system fills an important need for text-derived clinical data in translational research such as tissue-banking and clinical trials. The design of caTIES addresses three critical issues for informatics support of translational research: (1) federation of research data sources derived from clinical systems; (2) expressive graphical interfaces for concept-based text mining; and (3) regulatory and security model for supporting multi-center collaborative research. Implementation of the system at several Cancer Centers across the country is creating a potential network of caTIES repositories that could provide millions of de-identified clinical reports to users. The system provides an end-to-end application of medical natural language processing to support multi-institutional translational research programs.


Subject(s)
Biological Specimen Banks , Data Mining , Information Dissemination , Natural Language Processing , Neoplasms/pathology , Translational Research, Biomedical , Computer Graphics , Computer Security , Humans , Interinstitutional Relations , Multicenter Studies as Topic , Neoplasms/surgery , United States , User-Computer Interface
6.
AMIA Annu Symp Proc ; : 654-8, 2005.
Article in English | MEDLINE | ID: mdl-16779121

ABSTRACT

Think-aloud usability analysis provides extremely useful data but is very time-consuming and expensive to perform because of the extensive manual video analysis that is required. We describe a simple method for automated detection of usability problems from client user interface events for a developing medical intelligent tutoring system. The method incorporates (1) an agent-based method for communication that funnels all interface events and system responses to a centralized database, (2) a simple schema for representing interface events and higher order subgoals, and (3) an algorithm that reproduces the criteria used for manual coding of usability problems. A correction factor was empirically determining to account for the slower task performance of users when thinking aloud. We tested the validity of the method by simultaneously identifying usability problems using TAU and manually computing them from stored interface event data using the proposed algorithm. All usability problems that did not rely on verbal utterances were detectable with the proposed method.


Subject(s)
Algorithms , Computer-Assisted Instruction , Electronic Data Processing , User-Computer Interface , Databases as Topic , Dermatology/education , Evaluation Studies as Topic , Humans , Pathology/education
SELECTION OF CITATIONS
SEARCH DETAIL
...