Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 41
Filtrar
1.
Nat Commun ; 15(1): 5640, 2024 Jul 05.
Artigo em Inglês | MEDLINE | ID: mdl-38965235

RESUMO

The Structural Genomics Consortium is an international open science research organization with a focus on accelerating early-stage drug discovery, namely hit discovery and optimization. We, as many others, believe that artificial intelligence (AI) is poised to be a main accelerator in the field. The question is then how to best benefit from recent advances in AI and how to generate, format and disseminate data to enable future breakthroughs in AI-guided drug discovery. We present here the recommendations of a working group composed of experts from both the public and private sectors. Robust data management requires precise ontologies and standardized vocabulary while a centralized database architecture across laboratories facilitates data integration into high-value datasets. Lab automation and opening electronic lab notebooks to data mining push the boundaries of data sharing and data modeling. Important considerations for building robust machine-learning models include transparent and reproducible data processing, choosing the most relevant data representation, defining the right training and test sets, and estimating prediction uncertainty. Beyond data-sharing, cloud-based computing can be harnessed to build and disseminate machine-learning models. Important vectors of acceleration for hit and chemical probe discovery will be (1) the real-time integration of experimental data generation and modeling workflows within design-make-test-analyze (DMTA) cycles openly, and at scale and (2) the adoption of a mindset where data scientists and experimentalists work as a unified team, and where data science is incorporated into the experimental design.


Assuntos
Ciência de Dados , Descoberta de Drogas , Aprendizado de Máquina , Descoberta de Drogas/métodos , Ciência de Dados/métodos , Humanos , Inteligência Artificial , Disseminação de Informação/métodos , Mineração de Dados/métodos , Computação em Nuvem , Bases de Dados Factuais
2.
J Med Chem ; 67(13): 11226-11241, 2024 Jul 11.
Artigo em Inglês | MEDLINE | ID: mdl-38949112

RESUMO

Published compounds from ChEMBL version 32 are used to seek evidence for the occurrence of "natural selection" in drug discovery. Three measures of natural product (NP) character were applied, to compare time- and target-matched compounds reaching the clinic (clinical compounds in phase 1-3 development and approved drugs) with background compounds (reference compounds). Pseudo-NPs (PNPs), containing NP fragments combined in ways inaccessible by nature, are increasing over time, reaching 67% of clinical compounds first disclosed since 2010. PNPs are 54% more likely to be found in post-2008 clinical versus reference compounds. The majority of target classes show increased clinical compound NP character versus their reference compounds. Only 176 NP fragments appear in >1000 clinical compounds published since 2008, yet these make up on average 63% of the clinical compound's core scaffolds. There is untapped potential awaiting exploitation, by applying nature's building blocks─"natural intelligence"─to drug design.


Assuntos
Produtos Biológicos , Descoberta de Drogas , Bibliotecas de Moléculas Pequenas , Produtos Biológicos/química , Produtos Biológicos/farmacologia , Humanos , Bibliotecas de Moléculas Pequenas/química , Desenho de Fármacos
3.
Drug Discov Today ; 29(3): 103882, 2024 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-38218214

RESUMO

The Knowledge Management Center (KMC) for the Illuminating the Druggable Genome (IDG) project aims to aggregate, update, and articulate protein-centric data knowledge for the entire human proteome, with emphasis on the understudied proteins from the three IDG protein families. KMC collates and analyzes data from over 70 resources to compile the Target Central Resource Database (TCRD), which is the web-based informatics platform (Pharos). These data include experimental, computational, and text-mined information on protein structures, compound interactions, and disease and phenotype associations. Based on this knowledge, proteins are classified into different Target Development Levels (TDLs) for identification of understudied targets. Additional work by the KMC focuses on enriching target knowledge and producing DrugCentral and other data visualization tools for expanding investigation of understudied targets.


Assuntos
Genoma , Gestão do Conhecimento , Humanos , Proteoma , Bases de Dados Factuais , Informática
4.
Nucleic Acids Res ; 52(D1): D10-D17, 2024 Jan 05.
Artigo em Inglês | MEDLINE | ID: mdl-38015445

RESUMO

The European Molecular Biology Laboratory's European Bioinformatics Institute (EMBL-EBI) is one of the world's leading sources of public biomolecular data. Based at the Wellcome Genome Campus in Hinxton, UK, EMBL-EBI is one of six sites of the European Molecular Biology Laboratory (EMBL), Europe's only intergovernmental life sciences organisation. This overview summarises the latest developments in the services provided by EMBL-EBI data resources to scientific communities globally. These developments aim to ensure EMBL-EBI resources meet the current and future needs of these scientific communities, accelerating the impact of open biological data for all.


Assuntos
Academias e Institutos , Biologia Computacional , Biologia Computacional/organização & administração , Biologia Computacional/tendências , Academias e Institutos/organização & administração , Academias e Institutos/tendências , Bases de Dados de Ácidos Nucleicos , Europa (Continente)
5.
Nucleic Acids Res ; 52(D1): D1180-D1192, 2024 Jan 05.
Artigo em Inglês | MEDLINE | ID: mdl-37933841

RESUMO

ChEMBL (https://www.ebi.ac.uk/chembl/) is a manually curated, high-quality, large-scale, open, FAIR and Global Core Biodata Resource of bioactive molecules with drug-like properties, previously described in the 2012, 2014, 2017 and 2019 Nucleic Acids Research Database Issues. Since its introduction in 2009, ChEMBL's content has changed dramatically in size and diversity of data types. Through incorporation of multiple new datasets from depositors since the 2019 update, ChEMBL now contains slightly more bioactivity data from deposited data vs data extracted from literature. In collaboration with the EUbOPEN consortium, chemical probe data is now regularly deposited into ChEMBL. Release 27 made curated data available for compounds screened for potential anti-SARS-CoV-2 activity from several large-scale drug repurposing screens. In addition, new patent bioactivity data have been added to the latest ChEMBL releases, and various new features have been incorporated, including a Natural Product likeness score, updated flags for Natural Products, a new flag for Chemical Probes, and the initial annotation of the action type for ∼270 000 bioactivity measurements.


Assuntos
Descoberta de Drogas , Bases de Dados Factuais , Fatores de Tempo
6.
ACS Med Chem Lett ; 14(12): 1733-1741, 2023 Dec 14.
Artigo em Inglês | MEDLINE | ID: mdl-38116432

RESUMO

Efforts to tackle malaria must continue for a disease that threatens half of the global population. Parasite resistance to current therapies requires new chemotypes that are able to demonstrate effectiveness and safety. Previously, we developed a machine-learning-based approach to predict compound antimalarial activity, which was trained on the compound collections of several organizations. The resulting prediction platform, MAIP, was made freely available to the scientific community and offers a solution to prioritize molecules of interest in virtual screening and hit-to-lead optimization. Here, we experimentally validate MAIP and demonstrate how the approach was used in combination with a robust compound selection workflow and a recently introduced innovative high-throughput screening (HTS) cascade to select and purchase compounds from a public library for subsequent experimental screening. We observed a 12-fold enrichment compared with a randomly selected set of molecules, and the eight hits we ultimately selected exhibit good potency and absorption, distribution, metabolism, and excretion (ADME) profiles.

8.
Sci Data ; 10(1): 655, 2023 09 25.
Artigo em Inglês | MEDLINE | ID: mdl-37749083

RESUMO

Advancing age is the greatest risk factor for developing multiple age-related diseases. Therapeutic approaches targeting the underlying pathways of ageing, rather than individual diseases, may be an effective way to treat and prevent age-related morbidity while reducing the burden of polypharmacy. We harness the Open Targets Genetics Portal to perform a systematic analysis of nearly 1,400 genome-wide association studies (GWAS) mapped to 34 age-related diseases and traits, identifying genetic signals that are shared between two or more of these traits. Using locus-to-gene (L2G) mapping, we identify 995 targets with shared genetic links to age-related diseases and traits, which are enriched in mechanisms of ageing and include known ageing and longevity-related genes. Of these 995 genes, 128 are the target of an approved or investigational drug, 526 have experimental evidence of binding pockets or are predicted to be tractable, and 341 have no existing tractability evidence, representing underexplored genes which may reveal novel biological insights and therapeutic opportunities. We present these candidate targets for exploration and prioritisation in a web application.


Assuntos
Envelhecimento , Estudo de Associação Genômica Ampla , Multimorbidade , Longevidade , Fenótipo , Envelhecimento/genética , Humanos
9.
Nat Commun ; 14(1): 3826, 2023 07 10.
Artigo em Inglês | MEDLINE | ID: mdl-37429843

RESUMO

We conduct a large-scale meta-analysis of heart failure genome-wide association studies (GWAS) consisting of over 90,000 heart failure cases and more than 1 million control individuals of European ancestry to uncover novel genetic determinants for heart failure. Using the GWAS results and blood protein quantitative loci, we perform Mendelian randomization and colocalization analyses on human proteins to provide putative causal evidence for the role of druggable proteins in the genesis of heart failure. We identify 39 genome-wide significant heart failure risk variants, of which 18 are previously unreported. Using a combination of Mendelian randomization proteomics and genetic cis-only colocalization analyses, we identify 10 additional putatively causal genes for heart failure. Findings from GWAS and Mendelian randomization-proteomics identify seven (CAMK2D, PRKD1, PRKD3, MAPK3, TNFSF12, APOC3 and NAE1) proteins as potential targets for interventions to be used in primary prevention of heart failure.


Assuntos
Estudo de Associação Genômica Ampla , Insuficiência Cardíaca , Humanos , Análise da Randomização Mendeliana , Proteômica , Insuficiência Cardíaca/tratamento farmacológico , Insuficiência Cardíaca/genética
11.
PeerJ ; 11: e15153, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-37151295

RESUMO

The patent literature is a potentially valuable source of bioactivity data. In this article we describe a process to prioritise 3.7 million life science relevant patents obtained from the SureChEMBL database (https://www.surechembl.org/), according to how likely they were to contain bioactivity data for potent small molecules on less-studied targets, based on the classification developed by the Illuminating the Druggable Genome (IDG) project. The overall goal was to select a smaller number of patents that could be manually curated and incorporated into the ChEMBL database. Using relatively simple annotation and filtering pipelines, we have been able to identify a substantial number of patents containing quantitative bioactivity data for understudied targets that had not previously been reported in the peer-reviewed medicinal chemistry literature. We quantify the added value of such methods in terms of the numbers of targets that are so identified, and provide some specific illustrative examples. Our work underlines the potential value in searching the patent corpus in addition to the more traditional peer-reviewed literature. The small molecules found in these patents, together with their measured activity against the targets, are now accessible via the ChEMBL database.


Assuntos
Química Farmacêutica , Descoberta de Drogas , Descoberta de Drogas/métodos , Bases de Dados Factuais
12.
Nat Rev Chem ; 6(4): 287-295, 2022 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-35783295

RESUMO

One aspirational goal of computational chemistry is to predict potent and drug-like binders for any protein, such that only those that bind are synthesized. In this Roadmap, we describe the launch of Critical Assessment of Computational Hit-finding Experiments (CACHE), a public benchmarking project to compare and improve small molecule hit-finding algorithms through cycles of prediction and experimental testing. Participants will predict small molecule binders for new and biologically relevant protein targets representing different prediction scenarios. Predicted compounds will be tested rigorously in an experimental hub, and all predicted binders as well as all experimental screening data, including the chemical structures of experimentally tested compounds, will be made publicly available, and not subject to any intellectual property restrictions. The ability of a range of computational approaches to find novel binders will be evaluated, compared, and openly published. CACHE will launch 3 new benchmarking exercises every year. The outcomes will be better prediction methods, new small molecule binders for target proteins of importance for fundamental biology or drug discovery, and a major technological step towards achieving the goal of Target 2035, a global initiative to identify pharmacological probes for all human proteins.

13.
RSC Med Chem ; 13(1): 13-21, 2022 Jan 27.
Artigo em Inglês | MEDLINE | ID: mdl-35211674

RESUMO

Twenty years after the publication of the first draft of the human genome, our knowledge of the human proteome is still fragmented. The challenge of translating the wealth of new knowledge from genomics into new medicines is that proteins, and not genes, are the primary executers of biological function. Therefore, much of how biology works in health and disease must be understood through the lens of protein function. Accordingly, a subset of human proteins has been at the heart of research interests of scientists over the centuries, and we have accumulated varying degrees of knowledge about approximately 65% of the human proteome. Nevertheless, a large proportion of proteins in the human proteome (∼35%) remains uncharacterized, and less than 5% of the human proteome has been successfully targeted for drug discovery. This highlights the profound disconnect between our abilities to obtain genetic information and subsequent development of effective medicines. Target 2035 is an international federation of biomedical scientists from the public and private sectors, which aims to address this gap by developing and applying new technologies to create by year 2035 chemogenomic libraries, chemical probes, and/or biological probes for the entire human proteome.

14.
Front Immunol ; 12: 669357, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-34349756

RESUMO

Development of adaptive immunity after COVID-19 and after vaccination against SARS-CoV-2 is predicated on recognition of viral peptides, presented on HLA class II molecules, by CD4+ T-cells. We capitalised on extensive high-resolution HLA data on twenty five human race/ethnic populations to investigate the role of HLA polymorphism on SARS-CoV-2 immunogenicity at the population and individual level. Within populations, we identify wide inter-individual variability in predicted peptide presentation from structural, non-structural and accessory SARS-CoV-2 proteins, according to individual HLA genotype. However, we find similar potential for anti-SARS-CoV-2 cellular immunity at the population level suggesting that HLA polymorphism is unlikely to account for observed disparities in clinical outcomes after COVID-19 among different race/ethnic groups. Our findings provide important insight on the potential role of HLA polymorphism on development of protective immunity after SARS-CoV-2 infection and after vaccination and a firm basis for further experimental studies in this field.


Assuntos
COVID-19/imunologia , Antígenos de Histocompatibilidade Classe II/genética , Imunidade Celular , SARS-CoV-2/imunologia , Apresentação de Antígeno , Linfócitos T CD4-Positivos/imunologia , COVID-19/genética , Genótipo , Antígenos de Histocompatibilidade Classe II/imunologia , Humanos , Peptídeos/imunologia , Polimorfismo Genético , Proteoma/imunologia , Proteínas Virais/imunologia
15.
Nat Rev Drug Discov ; 20(10): 789-797, 2021 10.
Artigo em Inglês | MEDLINE | ID: mdl-34285415

RESUMO

Proteolysis-targeting chimeras (PROTACs) are an emerging drug modality that may offer new opportunities to circumvent some of the limitations associated with traditional small-molecule therapeutics. By analogy with the concept of the 'druggable genome', the question arises as to which potential drug targets might PROTAC-mediated protein degradation be most applicable. Here, we present a systematic approach to the assessment of the PROTAC tractability (PROTACtability) of protein targets using a series of criteria based on data and information from a diverse range of relevant publicly available resources. Our approach could support decision-making on whether or not a particular target may be amenable to modulation using a PROTAC. Using our approach, we identified 1,067 proteins of the human proteome that have not yet been described in the literature as PROTAC targets that offer potential opportunities for future PROTAC-based efforts.


Assuntos
Desenho de Fármacos , Genoma , Animais , Humanos , Projetos de Pesquisa , Bibliotecas de Moléculas Pequenas
16.
J Med Chem ; 64(11): 7210-7230, 2021 06 10.
Artigo em Inglês | MEDLINE | ID: mdl-33983732

RESUMO

Physicochemical descriptors commonly used to define "drug-likeness" and ligand efficiency measures are assessed for their ability to differentiate marketed drugs from compounds reported to bind to their efficacious target or targets. Using ChEMBL version 26, a data set of 643 drugs acting on 271 targets was assembled, comprising 1104 drug-target pairs having ≥100 published compounds per target. Taking into account changes in their physicochemical properties over time, drugs are analyzed according to their target class, therapy area, and route of administration. Recent drugs, approved in 2010-2020, display no overall differences in molecular weight, lipophilicity, hydrogen bonding, or polar surface area from their target comparator compounds. Drugs are differentiated from target comparators by higher potency, ligand efficiency (LE), lipophilic ligand efficiency (LLE), and lower carboaromaticity. Overall, 96% of drugs have LE or LLE values, or both, greater than the median values of their target comparator compounds.


Assuntos
Ligantes , Preparações Farmacêuticas/química , Bases de Dados de Compostos Químicos , Vias de Administração de Medicamentos , Ligação de Hidrogênio , Interações Hidrofóbicas e Hidrofílicas , Peso Molecular , Preparações Farmacêuticas/metabolismo
17.
J Cheminform ; 13(1): 13, 2021 Feb 22.
Artigo em Inglês | MEDLINE | ID: mdl-33618772

RESUMO

Malaria is a disease affecting hundreds of millions of people across the world, mainly in developing countries and especially in sub-Saharan Africa. It is the cause of hundreds of thousands of deaths each year and there is an ever-present need to identify and develop effective new therapies to tackle the disease and overcome increasing drug resistance. Here, we extend a previous study in which a number of partners collaborated to develop a consensus in silico model that can be used to identify novel molecules that may have antimalarial properties. The performance of machine learning methods generally improves with the number of data points available for training. One practical challenge in building large training sets is that the data are often proprietary and cannot be straightforwardly integrated. Here, this was addressed by sharing QSAR models, each built on a private data set. We describe the development of an open-source software platform for creating such models, a comprehensive evaluation of methods to create a single consensus model and a web platform called MAIP available at https://www.ebi.ac.uk/chembl/maip/ . MAIP is freely available for the wider community to make large-scale predictions of potential malaria inhibiting compounds. This project also highlights some of the practical challenges in reproducing published computational methods and the opportunities that open-source software can offer to the community.

18.
Chem Res Toxicol ; 34(2): 385-395, 2021 02 15.
Artigo em Inglês | MEDLINE | ID: mdl-33507738

RESUMO

The safety of marketed drugs is an ongoing concern, with some of the more frequently prescribed medicines resulting in serious or life-threatening adverse effects in some patients. Safety-related information for approved drugs has been curated to include the assignment of toxicity class(es) based on their withdrawn status and/or black box warning information described on medicinal product labels. The ChEMBL resource contains a wide range of bioactivity data types, from early "Discovery" stage preclinical data for individual compounds through to postclinical data on marketed drugs; the inclusion of the curated drug safety data set within this framework can support a wide range of safety-related drug discovery questions. The curated drug safety data set will be made freely available through ChEMBL and updated in future database releases.


Assuntos
Preparações Farmacêuticas/química , Curadoria de Dados , Aprovação de Drogas , Efeitos Colaterais e Reações Adversas Relacionados a Medicamentos , Humanos , Modelos Moleculares
19.
Mol Syst Biol ; 16(7): e9405, 2020 07.
Artigo em Inglês | MEDLINE | ID: mdl-32627965

RESUMO

Low success rates during drug development are due, in part, to the difficulty of defining drug mechanism-of-action and molecular markers of therapeutic activity. Here, we integrated 199,219 drug sensitivity measurements for 397 unique anti-cancer drugs with genome-wide CRISPR loss-of-function screens in 484 cell lines to systematically investigate cellular drug mechanism-of-action. We observed an enrichment for positive associations between the profile of drug sensitivity and knockout of a drug's nominal target, and by leveraging protein-protein networks, we identified pathways underpinning drug sensitivity. This revealed an unappreciated positive association between mitochondrial E3 ubiquitin-protein ligase MARCH5 dependency and sensitivity to MCL1 inhibitors in breast cancer cell lines. We also estimated drug on-target and off-target activity, informing on specificity, potency and toxicity. Linking drug and gene dependency together with genomic data sets uncovered contexts in which molecular networks when perturbed mediate cancer cell loss-of-fitness and thereby provide independent and orthogonal evidence of biomarkers for drug development. This study illustrates how integrating cell line drug sensitivity with CRISPR loss-of-function screens can elucidate mechanism-of-action to advance drug development.


Assuntos
Antineoplásicos/farmacologia , Sistemas CRISPR-Cas , Desenvolvimento de Medicamentos/métodos , Ensaios de Seleção de Medicamentos Antitumorais/métodos , Redes Reguladoras de Genes/efeitos dos fármacos , Aptidão Genética/efeitos dos fármacos , Mapas de Interação de Proteínas/efeitos dos fármacos , Antineoplásicos/toxicidade , Biomarcadores/metabolismo , Linhagem Celular Tumoral , Técnicas de Inativação de Genes , Redes Reguladoras de Genes/genética , Aptidão Genética/genética , Genômica , Humanos , Modelos Lineares , Proteínas de Membrana/genética , Proteínas de Membrana/metabolismo , Proteína de Sequência 1 de Leucemia de Células Mieloides/antagonistas & inibidores , Preparações Farmacêuticas/metabolismo , Software , Ubiquitina-Proteína Ligases/genética , Ubiquitina-Proteína Ligases/metabolismo
20.
J Chem Inf Model ; 60(4): 1911-1916, 2020 04 27.
Artigo em Inglês | MEDLINE | ID: mdl-32207937

RESUMO

Methods that survey protein surfaces for binding hotspots can help to evaluate target tractability and guide exploration of potential ligand binding regions. Fragment Hotspot Maps builds upon interaction data mined from the CSD (Cambridge Structural Database) and exploits the idea of identifying hotspots using small chemical fragments, which is now widely used to design new drug leads. Prior to this publication, Fragment Hotspot Maps was only publicly available through a web application. To increase the accessibility of this algorithm we present the Hotspots API (application programming interface), a toolkit that offers programmatic access to the core Fragment Hotspot Maps algorithm, thereby facilitating the interpretation and application of the analysis. To demonstrate the package's utility, we present a workflow which automatically derives protein hydrogen-bond constraints for molecular docking with GOLD. The Hotspots API is available from https://github.com/prcurran/hotspots under the MIT license and is dependent upon the commercial CSD Python API.


Assuntos
Desenho de Fármacos , Software , Bases de Dados Factuais , Simulação de Acoplamento Molecular , Proteínas
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...