Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 14 de 14
Filtrar
1.
Stud Health Technol Inform ; 310: 94-98, 2024 Jan 25.
Artigo em Inglês | MEDLINE | ID: mdl-38269772

RESUMO

Drug development in rare diseases is challenging due to the limited availability of subjects with the diseases and recruiting from a small patient population. The high cost and low success rate of clinical trials motivate deliberate analysis of existing clinical trials to understand status of clinical development of orphan drugs and discover new insight for new trial. In this project, we aim to develop a user centered Rare disease based Clinical Trial Knowledge Graph (RCTKG) to integrate publicly available clinical trial data with rare diseases from the Genetic and Rare Disease (GARD) program in a semantic and standardized form for public use. To better serve and represent the interests of rare disease users, user stories were defined for three types of users, patients, healthcare providers and informaticians, to guide the RCTKG design in supporting the GARD program at NCATS/NIH and the broad clinical/research community in rare diseases.


Assuntos
Reconhecimento Automatizado de Padrão , Doenças Raras , Humanos , Doenças Raras/tratamento farmacológico , Doenças Raras/genética , Pessoal de Saúde , Conhecimento
3.
J Transl Med ; 21(1): 157, 2023 02 28.
Artigo em Inglês | MEDLINE | ID: mdl-36855134

RESUMO

BACKGROUND: The United Nations recently made a call to address the challenges of an estimated 300 million persons worldwide living with a rare disease through the collection, analysis, and dissemination of disaggregated data. Epidemiologic Information (EI) regarding prevalence and incidence data of rare diseases is sparse and current paradigms of identifying, extracting, and curating EI rely upon time-intensive, error-prone manual processes. With these limitations, a clear understanding of the variation in epidemiology and outcomes for rare disease patients is hampered. This challenges the public health of rare diseases patients through a lack of information necessary to prioritize research, policy decisions, therapeutic development, and health system allocations. METHODS: In this study, we developed a newly curated epidemiology corpus for Named Entity Recognition (NER), a deep learning framework, and a novel rare disease epidemiologic information pipeline named EpiPipeline4RD consisting of a web interface and Restful API. For the corpus creation, we programmatically gathered a representative sample of rare disease epidemiologic abstracts, utilized weakly-supervised machine learning techniques to label the dataset, and manually validated the labeled dataset. For the deep learning framework development, we fine-tuned our dataset and adapted the BioBERT model for NER. We measured the performance of our BioBERT model for epidemiology entity recognition quantitatively with precision, recall, and F1 and qualitatively through a comparison with Orphanet. We demonstrated the ability for our pipeline to gather, identify, and extract epidemiology information from rare disease abstracts through three case studies. RESULTS: We developed a deep learning model to extract EI with overall F1 scores of 0.817 and 0.878, evaluated at the entity-level and token-level respectively, and which achieved comparable qualitative results to Orphanet's collection paradigm. Additionally, case studies of the rare diseases Classic homocystinuria, GRACILE syndrome, Phenylketonuria demonstrated the adequate recall of abstracts with epidemiology information, high precision of epidemiology information extraction through our deep learning model, and the increased efficiency of EpiPipeline4RD compared to a manual curation paradigm. CONCLUSIONS: EpiPipeline4RD demonstrated high performance of EI extraction from rare disease literature to augment manual curation processes. This automated information curation paradigm will not only effectively empower development of the NIH Genetic and Rare Diseases Information Center (GARD), but also support the public health of the rare disease community.


Assuntos
Acidose Láctica , Colestase , Humanos , Doenças Raras/diagnóstico , Doenças Raras/epidemiologia , Saúde Pública , Armazenamento e Recuperação da Informação
4.
J Clin Transl Sci ; 7(1): e33, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-36845315

RESUMO

The National Center for Advancing Translational Science (NCATS) seeks to improve upon the translational process to advance research and treatment across all diseases and conditions and bring these interventions to all who need them. Addressing the racial/ethnic health disparities and health inequities that persist in screening, diagnosis, treatment, and health outcomes (e.g., morbidity, mortality) is central to NCATS' mission to deliver more interventions to all people more quickly. Working toward this goal will require enhancing diversity, equity, inclusion, and accessibility (DEIA) in the translational workforce and in research conducted across the translational continuum, to support health equity. This paper discusses how aspects of DEIA are integral to the mission of translational science (TS). It describes recent NIH and NCATS efforts to advance DEIA in the TS workforce and in the research we support. Additionally, NCATS is developing approaches to apply a lens of DEIA in its activities and research - with relevance to the activities of the TS community - and will elucidate these approaches through related examples of NCATS-led, partnered, and supported activities, working toward the Center's goal of bringing more treatments to all people more quickly.

5.
Front Artif Intell ; 5: 932665, 2022.
Artigo em Inglês | MEDLINE | ID: mdl-36034595

RESUMO

Rare diseases (RDs) are naturally associated with a low prevalence rate, which raises a big challenge due to there being less data available for supporting preclinical and clinical studies. There has been a vast improvement in our understanding of RD, largely owing to advanced big data analytic approaches in genetics/genomics. Consequently, a large volume of RD-related publications has been accumulated in recent years, which offers opportunities to utilize these publications for accessing the full spectrum of the scientific research and supporting further investigation in RD. In this study, we systematically analyzed, semantically annotated, and scientifically categorized RD-related PubMed articles, and integrated those semantic annotations in a knowledge graph (KG), which is hosted in Neo4j based on a predefined data model. With the successful demonstration of scientific contribution in RD via the case studies performed by exploring this KG, we propose to extend the current effort by expanding more RD-related publications and more other types of resources as a next step.

6.
Orphanet J Rare Dis ; 16(1): 483, 2021 11 18.
Artigo em Inglês | MEDLINE | ID: mdl-34794473

RESUMO

BACKGROUND: Limited knowledge and unclear underlying biology of many rare diseases pose significant challenges to patients, clinicians, and scientists. To address these challenges, there is an urgent need to inspire and encourage scientists to propose and pursue innovative research studies that aim to uncover the genetic and molecular causes of more rare diseases and ultimately to identify effective therapeutic solutions. A clear understanding of current research efforts, knowledge/research gaps, and funding patterns as scientific evidence is crucial to systematically accelerate the pace of research discovery in rare diseases, which is an overarching goal of this study. METHODS: To semantically represent NIH funding data for rare diseases and advance its use of effectively promoting rare disease research, we identified NIH funded projects for rare diseases by mapping GARD diseases to the project based on project titles; subsequently we presented and managed those identified projects in a knowledge graph using Neo4j software, hosted at NCATS, based on a pre-defined data model that captures semantics among the data. With this developed knowledge graph, we were able to perform several case studies to demonstrate scientific evidence generation for supporting rare disease research discovery. RESULTS: Of 5001 rare diseases belonging to 32 distinct disease categories, we identified 1294 diseases that are mapped to 45,647 distinct, NIH-funded projects obtained from the NIH ExPORTER by implementing semantic annotation of project titles. To capture semantic relationships presenting amongst mapped research funding data, we defined a data model comprised of seven primary classes and corresponding object and data properties. A Neo4j knowledge graph based on this predefined data model has been developed, and we performed multiple case studies over this knowledge graph to demonstrate its use in directing and promoting rare disease research. CONCLUSION: We developed an integrative knowledge graph with rare disease funding data and demonstrated its use as a source from where we can effectively identify and generate scientific evidence to support rare disease research. With the success of this preliminary study, we plan to implement advanced computational approaches for analyzing more funding related data, e.g., project abstracts and PubMed article abstracts, and linking to other types of biomedical data to perform more sophisticated research gap analysis and identify opportunities for future research in rare diseases.


Assuntos
Pesquisa Biomédica , Doenças Raras , Humanos , Reconhecimento Automatizado de Padrão
7.
AMIA Jt Summits Transl Sci Proc ; 2021: 325-334, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-34457147

RESUMO

Rare diseases affect between 25 and 30 million people in the United States, and understanding their epidemiology is critical to focusing research efforts. However, little is known about the prevalence of many rare diseases. Given a lack of automated tools, current methods to identify and collect epidemiological data are managed through manual curation. To accelerate this process systematically, we developed a novel predictive model to programmatically identify epidemiologic studies on rare diseases from PubMed. A long short-term memory recurrent neural network was developed to predict whether a PubMed abstract represents an epidemiologic study. Our model performed well on our validation set (precision = 0.846, recall = 0.937, AUC = 0.967), and obtained satisfying results on the test set. This model thus shows promise to accelerate the pace of epidemiologic data curation in rare diseases and could be extended for use in other types of studies and in other disease domains.


Assuntos
Redes Neurais de Computação , Doenças Raras , Curadoria de Dados , Estudos Epidemiológicos , Humanos , PubMed , Doenças Raras/diagnóstico , Doenças Raras/epidemiologia , Estados Unidos
8.
J Biomed Semantics ; 11(1): 13, 2020 11 12.
Artigo em Inglês | MEDLINE | ID: mdl-33183351

RESUMO

BACKGROUND: The Genetic and Rare Diseases (GARD) Information Center was established by the National Institutes of Health (NIH) to provide freely accessible consumer health information on over 6500 genetic and rare diseases. As the cumulative scientific understanding and underlying evidence for these diseases have expanded over time, existing practices to generate knowledge from these publications and resources have not been able to keep pace. Through determining the applicability of computational approaches to enhance or replace manual curation tasks, we aim to both improve the sustainability and relevance of consumer health information, but also to develop a foundational database, from which translational science researchers may start to unravel disease characteristics that are vital to the research process. RESULTS: We developed a meta-ontology based integrative knowledge graph for rare diseases in Neo4j. This integrative knowledge graph includes a total of 3,819,623 nodes and 84,223,681 relations from 34 different biomedical data resources, including curated drug and rare disease associations. Semi-automatic mappings were generated for 2154 unique FDA orphan designations to 776 unique GARD diseases, and 3322 unique FDA designated drugs to UNII, as well as 180,363 associations between drug and indication from Inxight Drugs, which were integrated into the knowledge graph. We conducted four case studies to demonstrate the capabilities of this integrative knowledge graph in accelerating the curation of scientific understanding on rare diseases through the generation of disease mappings/profiles and pathogenesis associations. CONCLUSIONS: By integrating well-established database resources, we developed an integrative knowledge graph containing a large volume of biomedical and research data. Demonstration of several immediate use cases and limitations of this process reveal both the potential feasibility and barriers of utilizing graph-based resources and approaches to support their use by providers of consumer health information, such as GARD, that may struggle with the needs of maintaining knowledge reliant on an evolving and growing evidence-base. Finally, the successful integration of these datasets into a freely accessible knowledge graph highlights an opportunity to take a translational science view on the field of rare diseases by enabling researchers to identify disease characteristics, which may play a role in the translation of discover across different research domains.


Assuntos
Ontologias Biológicas , Gráficos por Computador , Bases de Dados Factuais , Doenças Raras/genética , Humanos , Pesquisa Translacional Biomédica
9.
Methods Inf Med ; 59(4-05): 131-139, 2020 08.
Artigo em Inglês | MEDLINE | ID: mdl-33147635

RESUMO

OBJECTIVE: In this study, we aimed to evaluate the capability of the Unified Medical Language System (UMLS) as one data standard to support data normalization and harmonization of datasets that have been developed for rare diseases. Through analysis of data mappings between multiple rare disease resources and the UMLS, we propose suggested extensions of the UMLS that will enable its adoption as a global standard in rare disease. METHODS: We analyzed data mappings between the UMLS and existing datasets on over 7,000 rare diseases that were retrieved from four publicly accessible resources: Genetic And Rare Diseases Information Center (GARD), Orphanet, Online Mendelian Inheritance in Men (OMIM), and the Monarch Disease Ontology (MONDO). Two types of disease mappings were assessed, (1) curated mappings extracted from those four resources; and (2) established mappings generated by querying the rare disease-based integrative knowledge graph developed in the previous study. RESULTS: We found that 100% of OMIM concepts, and over 50% of concepts from GARD, MONDO, and Orphanet were normalized by the UMLS and accurately categorized into the appropriate UMLS semantic groups. We analyzed 58,636 UMLS mappings, which resulted in 3,876 UMLS concepts across these resources. Manual evaluation of a random set of 500 UMLS mappings demonstrated a high level of accuracy (99%) of developing those mappings, which consisted of 414 mappings of synonyms (82.8%), 76 are subtypes (15.2%), and five are siblings (1%). CONCLUSION: The mapping results illustrated in this study that the UMLS was able to accurately represent rare disease concepts, and their associated information, such as genes and phenotypes, and can effectively be used to support data harmonization across existing resources developed on collecting rare disease data. We recommend the adoption of the UMLS as a data standard for rare disease to enable the existing rare disease datasets to support future applications in a clinical and community settings.


Assuntos
Doenças Raras , Unified Medical Language System , Humanos , Bases de Conhecimento , Doenças Raras/epidemiologia , Doenças Raras/genética , Semântica
10.
JMIR Med Inform ; 8(10): e18395, 2020 Oct 02.
Artigo em Inglês | MEDLINE | ID: mdl-33006565

RESUMO

BACKGROUND: Although many efforts have been made to develop comprehensive disease resources that capture rare disease information for the purpose of clinical decision making and education, there is no standardized protocol for defining and harmonizing rare diseases across multiple resources. This introduces data redundancy and inconsistency that may ultimately increase confusion and difficulty for the wide use of these resources. To overcome such encumbrances, we report our preliminary study to identify phenotypical similarity among genetic and rare diseases (GARD) that are presenting similar clinical manifestations, and support further data harmonization. OBJECTIVE: To support rare disease data harmonization, we aim to systematically identify phenotypically similar GARD diseases from a disease-oriented integrative knowledge graph and determine their similarity types. METHODS: We identified phenotypically similar GARD diseases programmatically with 2 methods: (1) We measured disease similarity by comparing disease mappings between GARD and other rare disease resources, incorporating manual assessment; 2) we derived clinical manifestations presenting among sibling diseases from disease classifications and prioritized the identified similar diseases based on their phenotypes and genotypes. RESULTS: For disease similarity comparison, approximately 87% (341/392) identified, phenotypically similar disease pairs were validated; 80% (271/392) of these disease pairs were accurately identified as phenotypically similar based on similarity score. The evaluation result shows a high precision (94%) and a satisfactory quality (86% F measure). By deriving phenotypical similarity from Monarch Disease Ontology (MONDO) and Orphanet disease classification trees, we identified a total of 360 disease pairs with at least 1 shared clinical phenotype and gene, which were applied for prioritizing clinical relevance. A total of 662 phenotypically similar disease pairs were identified and will be applied for GARD data harmonization. CONCLUSIONS: We successfully identified phenotypically similar rare diseases among the GARD diseases via 2 approaches, disease mapping comparison and phenotypical similarity derivation from disease classification systems. The results will not only direct GARD data harmonization in expanding translational science research but will also accelerate data transparency and consistency across different disease resources and terminologies, helping to build a robust and up-to-date knowledge resource on rare diseases.

11.
J Biomed Inform ; 100: 103325, 2019 12.
Artigo em Inglês | MEDLINE | ID: mdl-31676459

RESUMO

This special communication describes activities, products, and lessons learned from a recent hackathon that was funded by the National Center for Advancing Translational Sciences via the Biomedical Data Translator program ('Translator'). Specifically, Translator team members self-organized and worked together to conceptualize and execute, over a five-day period, a multi-institutional clinical research study that aimed to examine, using open clinical data sources, relationships between sex, obesity, diabetes, and exposure to airborne fine particulate matter among patients with severe asthma. The goal was to develop a proof of concept that this new model of collaboration and data sharing could effectively produce meaningful scientific results and generate new scientific hypotheses. Three Translator Clinical Knowledge Sources, each of which provides open access (via Application Programming Interfaces) to data derived from the electronic health record systems of major academic institutions, served as the source of study data. Jupyter Python notebooks, shared in GitHub repositories, were used to call the knowledge sources and analyze and integrate the results. The results replicated established or suspected relationships between sex, obesity, diabetes, exposure to airborne fine particulate matter, and severe asthma. In addition, the results demonstrated specific differences across the three Translator Clinical Knowledge Sources, suggesting cohort- and/or environment-specific factors related to the services themselves or the catchment area from which each service derives patient data. Collectively, this special communication demonstrates the power and utility of intense, team-oriented hackathons and offers general technical, organizational, and scientific lessons learned.


Assuntos
Asma/fisiopatologia , Diabetes Mellitus/fisiopatologia , Exposição Ambiental , Armazenamento e Recuperação da Informação , Obesidade/fisiopatologia , Material Particulado/toxicidade , Fatores Sexuais , Asma/complicações , Feminino , Humanos , Masculino , Obesidade/complicações , Índice de Gravidade de Doença
12.
BMC Med Ethics ; 20(1): 55, 2019 08 01.
Artigo em Inglês | MEDLINE | ID: mdl-31370847

RESUMO

BACKGROUND: Rare Disease research has seen tremendous advancements over the last decades, with the development of new technologies, various global collaborative efforts and improved data sharing. To maximize the impact of and to further build on these developments, there is a need for model consent clauses for rare diseases research, in order to improve data interoperability, to meet the informational needs of participants, and to ensure proper ethical and legal use of data sources and participants' overall protection. METHODS: A global Task Force was set up to develop model consent clauses specific to rare diseases research, that are comprehensive, harmonized, readily accessible, and internationally applicable, facilitating the recruitment and consent of rare disease research participants around the world. Existing consent forms and notices of consent were analyzed and classified under different consent themes, which were used as background to develop the model consent clauses. RESULTS: The IRDiRC-GA4GH MCC Task Force met in September 2018, to discuss and design model consent clauses. Based on analyzed consent forms, they listed generic core elements and designed the following rare disease research specific core elements; Rare Disease Research Introductory Clause, Familial Participation, Audio/Visual Imaging, Collecting, storing, sharing of rare disease data, Recontact for matching, Data Linkage, Return of Results to Family Members, Incapacity/Death, and Benefits. CONCLUSION: The model consent clauses presented in this article have been drafted to highlight consent elements that bear in mind the trends in rare disease research, while providing a tool to help foster harmonization and collaborative efforts.


Assuntos
Pesquisa Biomédica/ética , Termos de Consentimento/normas , Consentimento Livre e Esclarecido/normas , Doenças Raras/terapia , Pesquisa Biomédica/métodos , Pesquisa Biomédica/normas , Termos de Consentimento/ética , Humanos , Consentimento Livre e Esclarecido/ética
13.
Acad Med ; 91(7): 1002-6, 2016 07.
Artigo em Inglês | MEDLINE | ID: mdl-26839941

RESUMO

PURPOSE: The unequal representation of women and people of color compared with men and whites in medical school textbooks has been well documented, as have health care inequities, and biases-both overt and implicit-by health care providers and in access to care. The authors investigated whether this bias exists in PowerPoint slides used in didactic material for preclinical students at one medical school. METHOD: The authors analyzed 747 "decks" of slides from 33 preclinical courses in the medical school curriculum at the University of Washington School of Medicine in the years spanning 2009 to 2011. The authors coded the human images into various sex- and race-specific classifications and evaluated the distribution of images into these categories. RESULTS: Of the 4,033 images that could be coded by sex, 39.6% (1,595) were female and 60.5% (2,438) were male. Of the 5,230 images that could be coded by race/ethnicity, 78.4% (4,100) were white and 21.6% (1,130) were persons of color. Thus, images of whites and males predominated. CONCLUSIONS: The proportion of images used in didactic courses at one school of medicine is not representative of the U.S. population in terms of race or sex. The authors discuss the potential sources and impact of this bias, make a case for sex and race diversity in didactic imagery, and propose possible avenues for further research and curricular reform in an era when the population is becoming increasingly racially and ethnically diverse.


Assuntos
Recursos Audiovisuais/estatística & dados numéricos , Diversidade Cultural , Currículo/estatística & dados numéricos , Educação de Graduação em Medicina/métodos , Racismo/estatística & dados numéricos , Faculdades de Medicina/estatística & dados numéricos , Sexismo/estatística & dados numéricos , Educação de Graduação em Medicina/estatística & dados numéricos , Feminino , Humanos , Masculino , Variações Dependentes do Observador , Washington
14.
J Virol ; 79(24): 15258-64, 2005 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-16306597

RESUMO

Per os infectivity factors PIF1 (Ac119) and PIF2 (Ac022), like P74, are essential for oral infection of lepidopteran larval hosts of Autographa californica M nucleopolyhedrovirus (AcMNPV). Here we show that Ac115 also is a PIF (PIF3) and that, unlike PIF1 and PIF2, it does not mediate specific binding of AcMNPV occlusion-derived virus (ODV) to midgut target cells. We used an improved in vivo fluorescence dequenching assay to compare binding, fusion, and competition among control AcMNPV ODV and the ODVs of AcMNPV PIF1, PIF2, and PIF3 deletion mutants. Our results showed that binding and fusion of PIF1 and PIF2 mutants, but not the PIF3 mutant, were both qualitatively and quantitatively different from those of control ODV. Unlike control and PIF3-deficient ODV, an excess of PIF1- or PIF2-deficient ODV failed to compete effectively with control ODV's binding to specific receptors on midgut epithelial cells. Moreover, the levels of PIF1- and PIF2-deficient ODV binding were depressed threefold compared to control levels. Binding, fusion, and competition by PIF3-deficient ODV, however, were all indistinguishable from those of control ODV. These results implicated PIF1 and PIF2 as ODV envelope attachment proteins that mediate specific binding to primary target cells within the midgut. In contrast, PIF3 mediates another unidentified, but critical, early event during primary infection.


Assuntos
Proteínas de Insetos/fisiologia , Larva/citologia , Nucleopoliedrovírus/metabolismo , Proteínas do Envelope Viral/metabolismo , Proteínas Virais de Fusão/fisiologia , Animais , Ligação Competitiva , Sistema Digestório/virologia , Proteínas de Insetos/genética , Lepidópteros , Proteínas do Envelope Viral/genética
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...