Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 47
Filter
1.
Theor Appl Genet ; 137(7): 156, 2024 Jun 11.
Article in English | MEDLINE | ID: mdl-38858297

ABSTRACT

KEY MESSAGE: Phenomic prediction implemented on a large diversity set can efficiently predict seed germination, capture low-effect favorable alleles that are not revealed by GWAS and identify promising genetic resources. Oilseed rape faces many challenges, especially at the beginning of its developmental cycle. Achieving rapid and uniform seed germination could help to ensure a successful establishment and therefore enabling the crop to compete with weeds and tolerate stresses during the earliest developmental stages. The polygenic nature of seed germination was highlighted in several studies, and more knowledge is needed about low- to moderate-effect underlying loci in order to enhance seed germination effectively by improving the genetic background and incorporating favorable alleles. A total of 17 QTL were detected for seed germination-related traits, for which the favorable alleles often corresponded to the most frequent alleles in the panel. Genomic and phenomic predictions methods provided moderate-to-high predictive abilities, demonstrating the ability to capture small additive and non-additive effects for seed germination. This study also showed that phenomic prediction estimated phenotypic values closer to phenotypic values than GEBV. Finally, as the predictive ability of phenomic prediction was less influenced by the genetic structure of the panel, it is worth using this prediction method to characterize genetic resources, particularly with a view to design prebreeding populations.


Subject(s)
Alleles , Brassica napus , Germination , Phenotype , Quantitative Trait Loci , Seeds , Germination/genetics , Seeds/growth & development , Seeds/genetics , Brassica napus/genetics , Brassica napus/growth & development , Phenomics/methods , Genomics/methods , Genotype , Plant Breeding/methods
2.
Sci Adv ; 10(19): eadj1424, 2024 May 10.
Article in English | MEDLINE | ID: mdl-38718126

ABSTRACT

The ongoing expansion of human genomic datasets propels therapeutic target identification; however, extracting gene-disease associations from gene annotations remains challenging. Here, we introduce Mantis-ML 2.0, a framework integrating AstraZeneca's Biological Insights Knowledge Graph and numerous tabular datasets, to assess gene-disease probabilities throughout the phenome. We use graph neural networks, capturing the graph's holistic structure, and train them on hundreds of balanced datasets via a robust semi-supervised learning framework to provide gene-disease probabilities across the human exome. Mantis-ML 2.0 incorporates natural language processing to automate disease-relevant feature selection for thousands of diseases. The enhanced models demonstrate a 6.9% average classification power boost, achieving a median receiver operating characteristic (ROC) area under curve (AUC) score of 0.90 across 5220 diseases from Human Phenotype Ontology, OpenTargets, and Genomics England. Notably, Mantis-ML 2.0 prioritizes associations from an independent UK Biobank phenome-wide association study (PheWAS), providing a stronger form of triaging and mitigating against underpowered PheWAS associations. Results are exposed through an interactive web resource.


Subject(s)
Biological Specimen Banks , Neural Networks, Computer , Humans , Genome-Wide Association Study/methods , Phenotype , United Kingdom , Phenomics/methods , Genetic Predisposition to Disease , Genomics/methods , Databases, Genetic , Algorithms , Computational Biology/methods , UK Biobank
3.
EBioMedicine ; 103: 105116, 2024 May.
Article in English | MEDLINE | ID: mdl-38636199

ABSTRACT

BACKGROUND: Deep learning facilitates large-scale automated imaging evaluation of body composition. However, associations of body composition biomarkers with medical phenotypes have been underexplored. Phenome-wide association study (PheWAS) techniques search for medical phenotypes associated with biomarkers. A PheWAS integrating large-scale analysis of imaging biomarkers and electronic health record (EHR) data could discover previously unreported associations and validate expected associations. Here we use PheWAS methodology to determine the association of abdominal CT-based skeletal muscle metrics with medical phenotypes in a large North American cohort. METHODS: An automated deep learning pipeline was used to measure skeletal muscle index (SMI; biomarker of myopenia) and skeletal muscle density (SMD; biomarker of myosteatosis) from abdominal CT scans of adults between 2012 and 2018. A PheWAS was performed with logistic regression using patient sex and age as covariates to assess for associations between CT-derived muscle metrics and 611 common EHR-derived medical phenotypes. PheWAS P values were considered significant at a Bonferroni corrected threshold (α = 0.05/1222). FINDINGS: 17,646 adults (mean age, 56 years ± 19 [SD]; 57.5% women) were included. CT-derived SMI was significantly associated with 268 medical phenotypes; SMD with 340 medical phenotypes. Previously unreported associations with the highest magnitude of significance included higher SMI with decreased cardiac dysrhythmias (OR [95% CI], 0.59 [0.55-0.64]; P < 0.0001), decreased epilepsy (OR, 0.59 [0.50-0.70]; P < 0.0001), and increased elevated prostate-specific antigen (OR, 1.84 [1.47-2.31]; P < 0.0001), and higher SMD with decreased decubitus ulcers (OR, 0.36 [0.31-0.42]; P < 0.0001), sleep disorders (OR, 0.39 [0.32-0.47]; P < 0.0001), and osteomyelitis (OR, 0.43 [0.36-0.52]; P < 0.0001). INTERPRETATION: PheWAS methodology reveals previously unreported associations between CT-derived biomarkers of myopenia and myosteatosis and EHR medical phenotypes. The high-throughput PheWAS technique applied on a population scale can generate research hypotheses related to myopenia and myosteatosis and can be adapted to research possible associations of other imaging biomarkers with hundreds of EHR medical phenotypes. FUNDING: National Institutes of Health, Stanford AIMI-HAI pilot grant, Stanford Precision Health and Integrated Diagnostics, Stanford Cardiovascular Institute, Stanford Center for Digital Health, and Stanford Knight-Hennessy Scholars.


Subject(s)
Phenotype , Tomography, X-Ray Computed , Humans , Male , Female , Middle Aged , Tomography, X-Ray Computed/methods , Adult , Aged , Body Composition , Biomarkers , Phenomics/methods , Genome-Wide Association Study , Muscle, Skeletal/diagnostic imaging , Muscle, Skeletal/metabolism , Electronic Health Records , Deep Learning
6.
Eur J Heart Fail ; 26(4): 841-850, 2024 Apr.
Article in English | MEDLINE | ID: mdl-38311963

ABSTRACT

AIM: Pathophysiological differences between patients with heart failure with preserved (HFpEF) and reduced (HFrEF) ejection fraction (EF) remain unclear. Therefore we used a phenomics approach, integrating selected proteomics data with patient characteristics and cardiac structural and functional parameters, to get insight into differential pathophysiological mechanisms and identify potential treatment targets. METHODS AND RESULTS: We report data from a representative subcohort of the prospective Singapore Heart Failure Outcomes and Phenotypes (SHOP), including patients with HFrEF (EF <40%, n = 217), HFpEF (EF ≥50%, n = 213), and age- and sex-matched controls without HF (n = 216). We measured 92 biomarkers using a proximity extension assay and assessed cardiac structure and function in all participants using echocardiography. We used multi-block projection to latent structure analysis to integrate clinical, echocardiographic, and biomarker variables. Candidate biomarker targets were cross-referenced with small-molecule and drug databases. The total cohort had a median age of 65 years (interquartile range 60-71), and 50% were women. Protein profiles strongly discriminated patients with HFrEF (area under the curve [AUC] = 0.89) and HFpEF (AUC = 0.94) from controls. Phenomics analyses identified unique druggable inflammatory markers in HFpEF from the tumour necrosis factor receptor superfamily (TNFRSF), which were positively associated with hypertension, diabetes, and increased posterior and relative wall thickness. In HFrEF, interleukin (IL)-8 and IL-6 were possible targets related to lower EF and worsening renal function. CONCLUSION: We identified pathophysiological mechanisms related to increased cardiac wall thickness parameters and potentially druggable inflammatory markers from the TNFRSF in HFpEF.


Subject(s)
Biomarkers , Echocardiography , Heart Failure , Stroke Volume , Humans , Heart Failure/physiopathology , Heart Failure/diagnosis , Stroke Volume/physiology , Female , Male , Aged , Middle Aged , Biomarkers/blood , Echocardiography/methods , Phenomics/methods , Prospective Studies , Singapore/epidemiology , Proteomics/methods
7.
Gene ; 822: 146340, 2022 May 15.
Article in English | MEDLINE | ID: mdl-35183688

ABSTRACT

BACKGROUND: Epoxyeicosatrienoic acids (EETs) are protective factors against cardiovascular diseases (CVDs) because of their vasodilatory, cholesterol-lowering, and anti-inflammatory effects. Soluble epoxide hydrolase (sEH), encoded by the EPHX2 gene, degrades EETs into less biologically active metabolites. EPHX2 is highly polymorphic, and genetic polymorphisms in EPHX2 have been linked to various types of CVDs, such as coronary heart disease, essential hypertension, and atrial fibrillation recurrence. METHODS: Based on a priori hypothesis that EPHX2 genetic polymorphisms play an important role in the pathogenesis of CVDs, we comprehensively investigated the associations between 210 genetic polymorphisms in the EPHX2 gene and an array of 118 diseases in the circulatory system using a large sample from the UK Biobank (N = 307,516). The diseases in electronic health records were mapped to the phecode system, which was more representative of independent phenotypes. Survival analyses were employed to examine the effects of EPHX2 variants on CVD incidence, and a phenome-wide association study was conducted to study the impact of EPHX2 polymorphisms on 62 traits, including blood pressure, blood lipid levels, and inflammatory indicators. RESULTS: A novel association between the intronic variant rs116932590 and the phenotype "aneurysm and dissection of heart" was identified. In addition, the rs149467044 and rs200286838 variants showed nominal evidence of association with arterial aneurysm and cerebrovascular disease, respectively. Furthermore, the variant rs751141, which was linked with a lower hydrolase activity of sEH, was significantly associated with metabolic traits, including blood levels of triglycerides, creatinine, and urate. CONCLUSIONS: Multiple novel associations observed in the present study highlight the important role of EPHX2 genetic variation in the pathogenesis of CVDs.


Subject(s)
Cardiovascular Diseases/epidemiology , Epoxide Hydrolases/genetics , Phenomics/methods , Polymorphism, Single Nucleotide , Adult , Aged , Biological Specimen Banks , Cardiovascular Diseases/genetics , Electronic Health Records , Female , Genetic Association Studies , Genetic Predisposition to Disease , Humans , Incidence , Male , Middle Aged , United Kingdom/epidemiology
8.
Br J Cancer ; 126(5): 822-830, 2022 03.
Article in English | MEDLINE | ID: mdl-34912076

ABSTRACT

BACKGROUND: Associations between colorectal cancer (CRC) and other health outcomes have been reported, but these may be subject to biases, or due to limitations of observational studies. METHODS: We set out to determine whether genetic predisposition to CRC is also associated with the risk of other phenotypes. Under the phenome-wide association study (PheWAS) and tree-structured phenotypic model (TreeWAS), we studied 334,385 unrelated White British individuals (excluding CRC patients) from the UK Biobank cohort. We generated a polygenic risk score (PRS) from CRC genome-wide association studies as a measure of CRC risk. We performed sensitivity analyses to test the robustness of the results and searched the Danish Disease Trajectory Browser (DTB) to replicate the observed associations. RESULTS: Eight PheWAS phenotypes and 21 TreeWAS nodes were associated with CRC genetic predisposition by PheWAS and TreeWAS, respectively. The PheWAS detected associations were from neoplasms and digestive system disease group (e.g. benign neoplasm of colon, anal and rectal polyp and diverticular disease). The results from the TreeWAS corroborated the results from the PheWAS. These results were replicated in the observational data within the DTB. CONCLUSIONS: We show that benign colorectal neoplasms share genetic aetiology with CRC using PheWAS and TreeWAS methods. Additionally, CRC genetic predisposition is associated with diverticular disease.


Subject(s)
Colorectal Neoplasms/pathology , Genome-Wide Association Study/methods , Phenomics/methods , Polymorphism, Single Nucleotide , Adult , Aged , Biological Specimen Banks , Colorectal Neoplasms/genetics , Female , Humans , Male , Middle Aged , Phenotype , United Kingdom
9.
PLoS One ; 16(10): e0246874, 2021.
Article in English | MEDLINE | ID: mdl-34624043

ABSTRACT

The aim of this study is to optimize the simulation result of the WOFOST model and explore the possibility of assimilating unmanned aerial vehicle (UAV) imagery into this model. Field images of wheat during its key growth stages are acquired with a UAV, and the corresponding leaf area index (LAI), biomass, and final yield are experimentally measured. LAI data is retrieved from the UAV imagery and assimilated into a localized WOFOST model using least squares optimization. Sensitive parameters, i.e., specific leaf area (SLATB0, SLATB0.5, SLATB2) and maximum CO2 assimilation rate (AMAXTB1, AMAXTB1.3) are adjusted to minimize the discrepancy between the LAI obtained from the model simulation and inversion of the UAV data. The results show that the assimilated model provides a better estimation of the growth and development of winter wheat in the study area. The R2, RMSE, and NRMSE of winter wheat LAI simulated with the assimilated WOFOST model are 0.8812, 0.49, and 23.5% respectively. The R2, RMSE, and NRMSE of the simulated yield are 0.9489, 327.06 kg·hm-2, and 6.5%. The accuracy in model simulation of winter wheat growth is improved, which demonstrates the feasibility of integrating UAV data into crop models.


Subject(s)
Triticum/growth & development , Agriculture/methods , Biomass , Computer Simulation , Least-Squares Analysis , Phenomics/methods , Plant Leaves/growth & development , Remote Sensing Technology/methods , Seasons
10.
Open Heart ; 8(2)2021 09.
Article in English | MEDLINE | ID: mdl-34521746

ABSTRACT

OBJECTIVE: Red cell distribution width (RDW) is an enigmatic biomarker associated with the presence and severity of multiple cardiovascular diseases (CVDs). It is unclear whether elevated RDW contributes to, results from, or is pleiotropically related to CVDs. We used contemporary genetic techniques to probe for evidence of aetiological associations between RDW, CVDs, and CVD risk factors. METHODS: Using an electronic health record (EHR)-based cohort, we built and deployed a genetic risk score (GRS) for RDW to test for shared genetic architecture between RDW and the cardiovascular phenome. We also created GRSs for common CVDs (coronary artery disease, heart failure, atrial fibrillation, peripheral arterial disease, venous thromboembolism) and CVD risk factors (body mass index (BMI), low-density lipoprotein, high-density lipoprotein, systolic blood pressure, diastolic blood pressure, serum triglycerides, estimated glomerular filtration rate, diabetes mellitus) to test each for association with RDW. Significant GRS associations were further interrogated by two-sample Mendelian randomisation (MR). In a separate EHR-based cohort, RDW values from 1-year pre-gastric bypass surgery and 1-2 years post-gastric bypass surgery were compared. RESULTS: In a cohort of 17 937 subjects, there were no significant associations between the RDW GRS and CVDs. Of the CVDs and CVD risk factors, only genetically predicted BMI was associated with RDW. In subsequent analyses, BMI was associated with RDW by multiple MR methods. In subjects undergoing bariatric surgery, RDW decreased postsurgery and followed a linear relationship with BMI change. CONCLUSIONS: RDW is unlikely to be aetiologically upstream or downstream of CVDs or CVD risk factors except for BMI. Genetic and clinical association analyses support an aetiological relationship between BMI and RDW.


Subject(s)
Body Mass Index , Cardiovascular Diseases/genetics , Ethnicity , Genetic Markers/genetics , Mendelian Randomization Analysis/methods , Phenomics/methods , Risk Assessment/methods , Biomarkers/blood , Cardiovascular Diseases/blood , Cardiovascular Diseases/ethnology , Erythrocyte Indices , Female , Humans , Incidence , Male , Middle Aged , United States/epidemiology
11.
Int J Mol Sci ; 22(15)2021 Jul 31.
Article in English | MEDLINE | ID: mdl-34361030

ABSTRACT

Drought is a main factor limiting crop yields. Modern agricultural technologies such as irrigation systems, ground mulching, and rainwater storage can prevent drought, but these are only temporary solutions. Understanding the physiological, biochemical, and molecular reactions of plants to drought stress is therefore urgent. The recent rapid development of genomics tools has led to an increasing interest in phenomics, i.e., the study of phenotypic plant traits. Among phenomic strategies, high-throughput phenotyping (HTP) is attracting increasing attention as a way to address the bottlenecks of genomic and phenomic studies. HTP provides researchers a non-destructive and non-invasive method yet accurate in analyzing large-scale phenotypic data. This review describes plant responses to drought stress and introduces HTP methods that can detect changes in plant phenotypes in response to drought.


Subject(s)
Crops, Agricultural/genetics , Droughts , Phenomics/methods , Plant Breeding/methods , Crops, Agricultural/physiology , High-Throughput Screening Assays/methods , Stress, Physiological
12.
BMC Biol ; 19(1): 156, 2021 08 02.
Article in English | MEDLINE | ID: mdl-34334126

ABSTRACT

BACKGROUND: The emergence and continued global spread of the current COVID-19 pandemic has highlighted the need for methods to identify novel or repurposed therapeutic drugs in a fast and effective way. Despite the availability of methods for the discovery of antiviral drugs, the majority tend to focus on the effects of such drugs on a given virus, its constituent proteins, or enzymatic activity, often neglecting the consequences on host cells. This may lead to partial assessment of the efficacy of the tested anti-viral compounds, as potential toxicity impacting the overall physiology of host cells may mask the effects of both viral infection and drug candidates. Here we present a method able to assess the general health of host cells based on morphological profiling, for untargeted phenotypic drug screening against viral infections. RESULTS: We combine Cell Painting with antibody-based detection of viral infection in a single assay. We designed an image analysis pipeline for segmentation and classification of virus-infected and non-infected cells, followed by extraction of morphological properties. We show that this methodology can successfully capture virus-induced phenotypic signatures of MRC-5 human lung fibroblasts infected with human coronavirus 229E (CoV-229E). Moreover, we demonstrate that our method can be used in phenotypic drug screening using a panel of nine host- and virus-targeting antivirals. Treatment with effective antiviral compounds reversed the morphological profile of the host cells towards a non-infected state. CONCLUSIONS: The phenomics approach presented here, which makes use of a modified Cell Painting protocol by incorporating an anti-virus antibody stain, can be used for the unbiased morphological profiling of virus infection on host cells. The method can identify antiviral reference compounds, as well as novel antivirals, demonstrating its suitability to be implemented as a strategy for antiviral drug repurposing and drug discovery.


Subject(s)
Antiviral Agents/pharmacology , Drug Discovery/methods , Phenomics/methods , SARS-CoV-2/drug effects , Cell Line , Dose-Response Relationship, Drug , Drug Evaluation, Preclinical/methods , Humans , SARS-CoV-2/physiology
13.
PLoS One ; 16(7): e0254908, 2021.
Article in English | MEDLINE | ID: mdl-34297757

ABSTRACT

Drought is one of the most severe and unpredictable abiotic stresses, occurring at any growth stage and affecting crop yields worldwide. Therefore, it is essential to develop drought tolerant varieties to ensure sustainable crop production in an ever-changing climate. High-throughput digital phenotyping technologies in tandem with robust screening methods enable precise and faster selection of genotypes for breeding. To investigate the use of digital imaging to reliably phenotype for drought tolerance, a genetically diverse safflower population was screened under different drought stresses at Agriculture Victoria's high-throughput, automated phenotyping platform, Plant Phenomics Victoria, Horsham. In the first experiment, four treatments, control (90% field capacity; FC), 40% FC at initial branching, 40% FC at flowering and 50% FC at initial branching and flowering, were applied to assess the performance of four safflower genotypes. Based on these results, drought stress using 50% FC at initial branching and flowering stages was chosen to further screen 200 diverse safflower genotypes. Measured plant traits and dry biomass showed high correlations with derived digital traits including estimated shoot biomass, convex hull area, caliper length and minimum area rectangle, indicating the viability of using digital traits as proxy measures for plant growth. Estimated shoot biomass showed close association having moderately high correlation with drought indices yield index, stress tolerance index, geometric mean productivity, and mean productivity. Diverse genotypes were classified into four clusters of drought tolerance based on their performance (seed yield and digitally estimated shoot biomass) under stress. Overall, results show that rapid and precise image-based, high-throughput phenotyping in controlled environments can be used to effectively differentiate response to drought stress in a large numbers of safflower genotypes.


Subject(s)
Carthamus tinctorius/genetics , Droughts , Genotype , Phenomics/methods , Plant Breeding/methods , Stress, Physiological , Automation, Laboratory/methods , Biomass , Carthamus tinctorius/physiology , Phenotype
14.
Hum Mol Genet ; 30(R2): R274-R284, 2021 10 01.
Article in English | MEDLINE | ID: mdl-34089057

ABSTRACT

The mouse is the pre-eminent model organism for studies of mammalian gene function and has provided an extraordinarily rich range of insights into basic genetic mechanisms and biological systems. Over several decades, the characterization of mouse mutants has illuminated the relationship between gene and phenotype, providing transformational insights into the genetic bases of disease. However, if we are to deliver the promise of genomic and precision medicine, we must develop a comprehensive catalogue of mammalian gene function that uncovers the dark genome and elucidates pleiotropy. Advances in large-scale mouse mutagenesis programmes allied to high-throughput mouse phenomics are now addressing this challenge and systematically revealing novel gene function and multi-morbidities. Alongside the development of these pan-genomic mutational resources, mouse genetics is employing a range of diversity resources to delineate gene-gene and gene-environment interactions and to explore genetic context. Critically, mouse genetics is a powerful tool for assessing the functional impact of human genetic variation and determining the causal relationship between variant and disease. Together these approaches provide unique opportunities to dissect in vivo mechanisms and systems to understand pathophysiology and disease. Moreover, the provision and utility of mouse models of disease has flourished and engages cumulatively at numerous points across the translational spectrum from basic mechanistic studies to pre-clinical studies, target discovery and therapeutic development.


Subject(s)
Genetic Association Studies , Genetic Predisposition to Disease , Genome , Genomics , Alleles , Animals , Disease Models, Animal , Drug Discovery , Gene Expression Regulation , Genetic Association Studies/methods , Genetic Engineering , Genome-Wide Association Study , Genomics/methods , High-Throughput Screening Assays , Humans , Mice , Mutagenesis , Mutation , Phenomics/methods , Phenotype , Precision Medicine , Signal Transduction , Translational Research, Biomedical
15.
ScientificWorldJournal ; 2021: 6653677, 2021.
Article in English | MEDLINE | ID: mdl-33986637

ABSTRACT

Obesity and endometriosis are two very common entities, yet there is uncertainty on their exact relationship. Observational studies have repeatedly shown an inverse correlation between endometriosis and a low body mass index (BMI). However, obesity does not protect against endometriosis and on the contrary an increased BMI may lead to more severe forms of the disease. Besides, BMI is not accurate in all cases of obesity. Consequently, other anthropometric and phenomic traits have been studied, including body adiposity content, as well as the effect of BMI early in life on the manifestation of endometriosis in adulthood. Some studies have shown that the phenotypic inverse correlation between the two entities has a genetic background; however, others have indicated that certain polymorphisms are linked with endometriosis in females with increased BMI. The advent of metabolic bariatric surgery and pertinent research have led to the emergence of biomolecules that may be pivotal in understanding the pathophysiological interaction of the two entities, especially in the context of angiogenesis and inflammation. Future research should focus on three objectives: detection and interpretation of obesity-related biomarkers in experimental models with endometriosis; integration of endometriosis-related queries into bariatric registries; and multidisciplinary approach and collaboration among specialists.


Subject(s)
Endometriosis/genetics , Gene Expression Regulation , Gene-Environment Interaction , Obesity/genetics , Phenomics/methods , Adiponectin/genetics , Adiponectin/metabolism , Adiposity/genetics , Anthropometry , Bariatric Surgery/methods , Biomarkers/metabolism , Body Mass Index , Case-Control Studies , Chemokines/genetics , Chemokines/metabolism , Endometriosis/diagnosis , Endometriosis/pathology , Endometriosis/surgery , Female , Ghrelin/genetics , Ghrelin/metabolism , Humans , Leptin/genetics , Leptin/metabolism , Obesity/diagnosis , Obesity/pathology , Obesity/surgery , Phenotype
16.
Am J Epidemiol ; 190(10): 1977-1992, 2021 10 01.
Article in English | MEDLINE | ID: mdl-33861317

ABSTRACT

Genotype-phenotype association studies often combine phenotype data from multiple studies to increase statistical power. Harmonization of the data usually requires substantial effort due to heterogeneity in phenotype definitions, study design, data collection procedures, and data-set organization. Here we describe a centralized system for phenotype harmonization that includes input from phenotype domain and study experts, quality control, documentation, reproducible results, and data-sharing mechanisms. This system was developed for the National Heart, Lung, and Blood Institute's Trans-Omics for Precision Medicine (TOPMed) program, which is generating genomic and other -omics data for more than 80 studies with extensive phenotype data. To date, 63 phenotypes have been harmonized across thousands of participants (recruited in 1948-2012) from up to 17 studies per phenotype. Here we discuss challenges in this undertaking and how they were addressed. The harmonized phenotype data and associated documentation have been submitted to National Institutes of Health data repositories for controlled access by the scientific community. We also provide materials to facilitate future harmonization efforts by the community, which include 1) the software code used to generate the 63 harmonized phenotypes, enabling others to reproduce, modify, or extend these harmonizations to additional studies, and 2) the results of labeling thousands of phenotype variables with controlled vocabulary terms.


Subject(s)
Genetic Association Studies/methods , Phenomics/methods , Precision Medicine/methods , Data Aggregation , Humans , Information Dissemination , National Heart, Lung, and Blood Institute (U.S.) , Phenotype , Program Evaluation , United States
17.
Genes (Basel) ; 12(2)2021 02 20.
Article in English | MEDLINE | ID: mdl-33672535

ABSTRACT

There is a gap in the conceptual framework linking genes to phenotypes (G2P) for non-model organisms, as most non-model organisms do not yet have genomic resources readily available. To address this, researchers often perform literature reviews to understand G2P linkages by curating a list of likely gene candidates, hinging upon other studies already conducted in closely related systems. Sifting through hundreds to thousands of articles is a cumbersome task that slows down the scientific process and may introduce bias into a study. To fill this gap, we created G2PMineR, a free and open source literature mining tool developed specifically for G2P research. This R package uses automation to make the G2P review process efficient and unbiased, while also generating hypothesized associations between genes and phenotypes within a taxonomical framework. We applied the package to a literature review for drought-tolerance in plants. The analysis provides biologically meaningful results within the known framework of drought tolerance in plants. Overall, the package is useful for conducting literature reviews for genome to phenome projects, and also has broad appeal to scientists investigating a wide range of study systems as it can conduct analyses under the auspices of three different kingdoms (Plantae, Animalia, and Fungi).


Subject(s)
Computational Biology , Genome , Genomics/methods , Genotype , Phenomics/methods , Phenotype , Software , Animals , Computational Biology/methods , Data Mining/methods , Databases, Genetic , Humans , Plants/genetics , Web Browser
18.
Genet Sel Evol ; 53(1): 22, 2021 Mar 05.
Article in English | MEDLINE | ID: mdl-33673800

ABSTRACT

Improvements in genomic technologies have outpaced the most optimistic predictions, allowing industry-scale application of genomic selection. However, only marginal gains in genetic prediction accuracy can now be expected by increasing marker density up to sequence, unless causative mutations are identified. We argue that some of the most scientifically disrupting and industry-relevant challenges relate to 'phenomics' instead of 'genomics'. Thanks to developments in sensor technology and artificial intelligence, there is a wide range of analytical tools that are already available and many more will be developed. We can now address some of the pressing societal demands on the industry, such as animal welfare concerns or efficiency in the use of resources. From the statistical and computational point of view, phenomics raises two important issues that require further work: penalization and dimension reduction. This will be complicated by the inherent heterogeneity and 'missingness' of the data. Overall, we can expect that precision livestock technologies will make it possible to collect hundreds of traits on a continuous basis from large numbers of animals. Perhaps the main revolution will come from redesigning animal breeding schemes to explicitly allow for high-dimensional phenomics. In the meantime, phenomics data will definitely enlighten our knowledge on the biological basis of phenotypes.


Subject(s)
Livestock/genetics , Phenomics/methods , Selective Breeding , Animals , Livestock/physiology
19.
J Microbiol ; 59(3): 249-258, 2021 Mar.
Article in English | MEDLINE | ID: mdl-33496936

ABSTRACT

Raman spectroscopy is a promising tool for identifying microbial phenotypes based on single cell Raman spectra reflecting cellular biochemical biomolecules. Recent studies using Raman spectroscopy have mainly analyzed phenotypic changes caused by microbial interactions or stress responses (e.g., antibiotics) and evaluated the microbial activity or substrate specificity under a given experimental condition using stable isotopes. Lack of labelling and the nondestructive pretreatment and measurement process of Raman spectroscopy have also aided in the sorting of microbial cells with interesting phenotypes for subsequently conducting physiology experiments through cultivation or genome analysis. In this review, we provide an overview of the principles, advantages, and status of utilization of Raman spectroscopy for studies linking microbial phenotypes and functions. We expect Raman spectroscopy to become a next-generation phenotyping tool that will greatly contribute in enhancing our understanding of microbial functions in natural and engineered systems.


Subject(s)
Bacteria/chemistry , Phenomics/methods , Spectrum Analysis, Raman/methods , Bacteria/genetics , Bacterial Physiological Phenomena , Phenotype
20.
Drug Discov Today ; 26(4): 887-901, 2021 04.
Article in English | MEDLINE | ID: mdl-33484947

ABSTRACT

Research and development (R&D) productivity across the pharmaceutical industry has received close scrutiny over the past two decades, especially taking into consideration reports of attrition rates and the colossal cost for drug development. The respective merits of the two main drug discovery approaches, phenotypic and target based, have divided opinion across the research community, because each hold different advantages for identifying novel molecular entities with a successful path to the market. Nevertheless, both have low translatability in the clinic. Artificial intelligence (AI) and adoption of machine learning (ML) tools offer the promise of revolutionising drug development, and overcoming obstacles in the drug discovery pipeline. Here, we assess the potential of target-driven and phenotypic-based approaches and offer a holistic description of the current state of the field, from both a scientific and industry perspective. With the emerging partnerships between AI/ML and pharma still in their relative infancy, we investigate the potential and current limitations with a particular focus on phenotypic drug discovery. Finally, we emphasise the value of public-private partnerships (PPPs) and cross-disciplinary collaborations to foster innovation and facilitate efficient drug discovery programmes.


Subject(s)
Artificial Intelligence , Drug Development , Drug Discovery , Drug Development/methods , Drug Development/trends , Drug Discovery/methods , Drug Discovery/trends , Humans , Machine Learning , Molecular Targeted Therapy/methods , Phenomics/methods
SELECTION OF CITATIONS
SEARCH DETAIL
...