Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 16 de 16
Filter
Add more filters










Publication year range
1.
bioRxiv ; 2024 Mar 18.
Article in English | MEDLINE | ID: mdl-38562682

ABSTRACT

Despite the central role that antibodies play in modern medicine, there is currently no way to rationally design novel antibodies to bind a specific epitope on a target. Instead, antibody discovery currently involves time-consuming immunization of an animal or library screening approaches. Here we demonstrate that a fine-tuned RFdiffusion network is capable of designing de novo antibody variable heavy chains (VHH's) that bind user-specified epitopes. We experimentally confirm binders to four disease-relevant epitopes, and the cryo-EM structure of a designed VHH bound to influenza hemagglutinin is nearly identical to the design model both in the configuration of the CDR loops and the overall binding pose.

2.
Science ; 384(6693): eadl2528, 2024 Apr 19.
Article in English | MEDLINE | ID: mdl-38452047

ABSTRACT

Deep-learning methods have revolutionized protein structure prediction and design but are presently limited to protein-only systems. We describe RoseTTAFold All-Atom (RFAA), which combines a residue-based representation of amino acids and DNA bases with an atomic representation of all other groups to model assemblies that contain proteins, nucleic acids, small molecules, metals, and covalent modifications, given their sequences and chemical structures. By fine-tuning on denoising tasks, we developed RFdiffusion All-Atom (RFdiffusionAA), which builds protein structures around small molecules. Starting from random distributions of amino acid residues surrounding target small molecules, we designed and experimentally validated, through crystallography and binding measurements, proteins that bind the cardiac disease therapeutic digoxigenin, the enzymatic cofactor heme, and the light-harvesting molecule bilin.


Subject(s)
Deep Learning , Protein Engineering , Proteins , Amino Acids/chemistry , Crystallography , DNA/chemistry , Models, Molecular , Proteins/chemistry , Protein Engineering/methods
3.
bioRxiv ; 2024 Jan 20.
Article in English | MEDLINE | ID: mdl-38293065

ABSTRACT

A catalog of transcription factor (TF) binding sites in the genome is critical for deciphering regulatory relationships. Here we present the culmination of the modERN (model organism Encyclopedia of Regulatory Networks) consortium that systematically assayed TF binding events in vivo in two major model organisms, Drosophila melanogaster (fly) and Caenorhabditis elegans (worm). We describe key features of these datasets, comprising 604 TFs identifying 3.6M sites in the fly and 350 TFs identifying 0.9 M sites in the worm. Applying a machine learning model to these data identifies sets of TFs with a prominent role in promoting target gene expression in specific cell types. TF binding data are available through the ENCODE Data Coordinating Center and at https://epic.gs.washington.edu/modERNresource, which provides access to processed and summary data, as well as widgets to probe cell type-specific TF-target relationships. These data are a rich resource that should fuel investigations into TF function during development.

4.
bioRxiv ; 2023 Nov 02.
Article in English | MEDLINE | ID: mdl-37961294

ABSTRACT

Despite transformative advances in protein design with deep learning, the design of small-molecule-binding proteins and sensors for arbitrary ligands remains a grand challenge. Here we combine deep learning and physics-based methods to generate a family of proteins with diverse and designable pocket geometries, which we employ to computationally design binders for six chemically and structurally distinct small-molecule targets. Biophysical characterization of the designed binders revealed nanomolar to low micromolar binding affinities and atomic-level design accuracy. The bound ligands are exposed at one edge of the binding pocket, enabling the de novo design of chemically induced dimerization (CID) systems; we take advantage of this to create a biosensor with nanomolar sensitivity for cortisol. Our approach provides a general method to design proteins that bind and sense small molecules for a wide range of analytical, environmental, and biomedical applications.

5.
bioRxiv ; 2023 Sep 21.
Article in English | MEDLINE | ID: mdl-37790440

ABSTRACT

Sequence-specific DNA-binding proteins (DBPs) play critical roles in biology and biotechnology, and there has been considerable interest in the engineering of DBPs with new or altered specificities for genome editing and other applications. While there has been some success in reprogramming naturally occurring DBPs using selection methods, the computational design of new DBPs that recognize arbitrary target sites remains an outstanding challenge. We describe a computational method for the design of small DBPs that recognize specific target sequences through interactions with bases in the major groove, and employ this method in conjunction with experimental screening to generate binders for 5 distinct DNA targets. These binders exhibit specificity closely matching the computational models for the target DNA sequences at as many as 6 base positions and affinities as low as 30-100 nM. The crystal structure of a designed DBP-target site complex is in close agreement with the design model, highlighting the accuracy of the design method. The designed DBPs function in both Escherichia coli and mammalian cells to repress and activate transcription of neighboring genes. Our method is a substantial step towards a general route to small and hence readily deliverable sequence-specific DBPs for gene regulation and editing.

6.
bioRxiv ; 2023 Sep 20.
Article in English | MEDLINE | ID: mdl-37781607

ABSTRACT

Endocytosis and lysosomal trafficking of cell surface receptors can be triggered by interaction with endogenous ligands. Therapeutic approaches such as LYTAC1,2 and KineTAC3, have taken advantage of this to target specific proteins for degradation by fusing modified native ligands to target binding proteins. While powerful, these approaches can be limited by possible competition with the endogenous ligand(s), the requirement in some cases for chemical modification that limits genetic encodability and can complicate manufacturing, and more generally, there may not be natural ligands which stimulate endocytosis through a given receptor. Here we describe general protein design approaches for designing endocytosis triggering binding proteins (EndoTags) that overcome these challenges. We present EndoTags for the IGF-2R, ASGPR, Sortillin, and Transferrin receptors, and show that fusing these tags to proteins which bind to soluble or transmembrane protein leads to lysosomal trafficking and target degradation; as these receptors have different tissue distributions, the different EndoTags could enable targeting of degradation to different tissues. The modularity and genetic encodability of EndoTags enables AND gate control for higher specificity targeted degradation, and the localized secretion of degraders from engineered cells. The tunability and modularity of our genetically encodable EndoTags should contribute to deciphering the relationship between receptor engagement and cellular trafficking, and they have considerable therapeutic potential as targeted degradation inducers, signaling activators for endocytosis-dependent pathways, and cellular uptake inducers for targeted antibody drug and RNA conjugates.

7.
Nat Commun ; 14(1): 2625, 2023 05 06.
Article in English | MEDLINE | ID: mdl-37149653

ABSTRACT

Recently it has become possible to de novo design high affinity protein binding proteins from target structural information alone. There is, however, considerable room for improvement as the overall design success rate is low. Here, we explore the augmentation of energy-based protein binder design using deep learning. We find that using AlphaFold2 or RoseTTAFold to assess the probability that a designed sequence adopts the designed monomer structure, and the probability that this structure binds the target as designed, increases design success rates nearly 10-fold. We find further that sequence design using ProteinMPNN rather than Rosetta considerably increases computational efficiency.


Subject(s)
Deep Learning , Protein Engineering , Proteins/metabolism , Protein Binding
8.
bioRxiv ; 2023 Dec 21.
Article in English | MEDLINE | ID: mdl-38187589

ABSTRACT

A general method for designing proteins to bind and sense any small molecule of interest would be widely useful. Due to the small number of atoms to interact with, binding to small molecules with high affinity requires highly shape complementary pockets, and transducing binding events into signals is challenging. Here we describe an integrated deep learning and energy based approach for designing high shape complementarity binders to small molecules that are poised for downstream sensing applications. We employ deep learning generated psuedocycles with repeating structural units surrounding central pockets; depending on the geometry of the structural unit and repeat number, these pockets span wide ranges of sizes and shapes. For a small molecule target of interest, we extensively sample high shape complementarity pseudocycles to generate large numbers of customized potential binding pockets; the ligand binding poses and the interacting interfaces are then optimized for high affinity binding. We computationally design binders to four diverse molecules, including for the first time polar flexible molecules such as methotrexate and thyroxine, which are expressed at high levels and have nanomolar affinities straight out of the computer. Co-crystal structures are nearly identical to the design models. Taking advantage of the modular repeating structure of pseudocycles and central location of the binding pockets, we constructed low noise nanopore sensors and chemically induced dimerization systems by splitting the binders into domains which assemble into the original pseudocycle pocket upon target molecule addition.

9.
Genetics ; 208(3): 937-949, 2018 03.
Article in English | MEDLINE | ID: mdl-29284660

ABSTRACT

To develop a catalog of regulatory sites in two major model organisms, Drosophila melanogaster and Caenorhabditis elegans, the modERN (model organism Encyclopedia of Regulatory Networks) consortium has systematically assayed the binding sites of transcription factors (TFs). Combined with data produced by our predecessor, modENCODE (Model Organism ENCyclopedia Of DNA Elements), we now have data for 262 TFs identifying 1.23 M sites in the fly genome and 217 TFs identifying 0.67 M sites in the worm genome. Because sites from different TFs are often overlapping and tightly clustered, they fall into 91,011 and 59,150 regions in the fly and worm, respectively, and these binding sites span as little as 8.7 and 5.8 Mb in the two organisms. Clusters with large numbers of sites (so-called high occupancy target, or HOT regions) predominantly associate with broadly expressed genes, whereas clusters containing sites from just a few factors are associated with genes expressed in tissue-specific patterns. All of the strains expressing GFP-tagged TFs are available at the stock centers, and the chromatin immunoprecipitation sequencing data are available through the ENCODE Data Coordinating Center and also through a simple interface (http://epic.gs.washington.edu/modERN/) that facilitates rapid accessibility of processed data sets. These data will facilitate a vast number of scientific inquiries into the function of individual TFs in key developmental, metabolic, and defense and homeostatic regulatory pathways, as well as provide a broader perspective on how individual TFs work together in local networks and globally across the life spans of these two key model organisms.


Subject(s)
Caenorhabditis elegans/genetics , Caenorhabditis elegans/metabolism , Databases, Genetic , Drosophila/genetics , Drosophila/metabolism , Genome-Wide Association Study , Transcription Factors/metabolism , Animals , Binding Sites , Chromatin Immunoprecipitation , Genome-Wide Association Study/methods , Models, Biological , Nucleotide Motifs , Protein Binding
11.
Nature ; 512(7515): 400-5, 2014 Aug 28.
Article in English | MEDLINE | ID: mdl-25164749

ABSTRACT

Discovering the structure and dynamics of transcriptional regulatory events in the genome with cellular and temporal resolution is crucial to understanding the regulatory underpinnings of development and disease. We determined the genomic distribution of binding sites for 92 transcription factors and regulatory proteins across multiple stages of Caenorhabditis elegans development by performing 241 ChIP-seq (chromatin immunoprecipitation followed by sequencing) experiments. Integration of regulatory binding and cellular-resolution expression data produced a spatiotemporally resolved metazoan transcription factor binding map. Using this map, we explore developmental regulatory circuits that encode combinatorial logic at the levels of co-binding and co-expression of transcription factors, characterizing the genomic coverage and clustering of regulatory binding, the binding preferences of, and biological processes regulated by, transcription factors, the global transcription factor co-associations and genomic subdomains that suggest shared patterns of regulation, and identifying key transcription factors and transcription factor co-associations for fate specification of individual lineages and cell types.


Subject(s)
Caenorhabditis elegans/growth & development , Caenorhabditis elegans/genetics , Gene Expression Regulation, Developmental/genetics , Genome, Helminth/genetics , Spatio-Temporal Analysis , Transcription Factors/metabolism , Animals , Binding Sites , Caenorhabditis elegans/cytology , Caenorhabditis elegans/embryology , Caenorhabditis elegans Proteins/metabolism , Cell Lineage , Chromatin Immunoprecipitation , Genomics , Larva/cytology , Larva/genetics , Larva/growth & development , Larva/metabolism , Protein Binding
12.
Nature ; 512(7515): 453-6, 2014 Aug 28.
Article in English | MEDLINE | ID: mdl-25164757

ABSTRACT

Despite the large evolutionary distances between metazoan species, they can show remarkable commonalities in their biology, and this has helped to establish fly and worm as model organisms for human biology. Although studies of individual elements and factors have explored similarities in gene regulation, a large-scale comparative analysis of basic principles of transcriptional regulatory features is lacking. Here we map the genome-wide binding locations of 165 human, 93 worm and 52 fly transcription regulatory factors, generating a total of 1,019 data sets from diverse cell types, developmental stages, or conditions in the three species, of which 498 (48.9%) are presented here for the first time. We find that structural properties of regulatory networks are remarkably conserved and that orthologous regulatory factor families recognize similar binding motifs in vivo and show some similar co-associations. Our results suggest that gene-regulatory properties previously observed for individual factors are general principles of metazoan regulation that are remarkably well-preserved despite extensive functional divergence of individual network connections. The comparative maps of regulatory circuitry provided here will drive an improved understanding of the regulatory underpinnings of model organism biology and how these relate to human biology, development and disease.


Subject(s)
Caenorhabditis elegans/genetics , Drosophila melanogaster/genetics , Evolution, Molecular , Gene Expression Regulation/genetics , Gene Regulatory Networks/genetics , Transcription Factors/metabolism , Animals , Binding Sites , Caenorhabditis elegans/growth & development , Chromatin Immunoprecipitation , Conserved Sequence/genetics , Drosophila melanogaster/growth & development , Gene Expression Regulation, Developmental/genetics , Genome/genetics , Humans , Molecular Sequence Annotation , Nucleotide Motifs/genetics , Organ Specificity/genetics , Transcription Factors/genetics
13.
Genome Res ; 22(7): 1282-94, 2012 Jul.
Article in English | MEDLINE | ID: mdl-22508763

ABSTRACT

How cells adopt different expression patterns is a fundamental question of developmental biology. We quantitatively measured reporter expression of 127 genes, primarily transcription factors, in every cell and with high temporal resolution in C. elegans embryos. Embryonic cells are highly distinct in their gene expression; expression of the 127 genes studied here can distinguish nearly all pairs of cells, even between cells of the same tissue type. We observed recurrent lineage-regulated expression patterns for many genes in diverse contexts. These patterns are regulated in part by the TCF-LEF transcription factor POP-1. Other genes' reporters exhibited patterns correlated with tissue, position, and left-right asymmetry. Sequential patterns both within tissues and series of sublineages suggest regulatory pathways. Expression patterns often differ between embryonic and larval stages for the same genes, emphasizing the importance of profiling expression in different stages. This work greatly expands the number of genes in each of these categories and provides the first large-scale, digitally based, cellular resolution compendium of gene expression dynamics in live animals. The resulting data sets will be a useful resource for future research.


Subject(s)
Caenorhabditis elegans/embryology , Caenorhabditis elegans/genetics , Gene Expression Regulation, Developmental , Genes, Reporter , Animals , Body Patterning , Caenorhabditis elegans/metabolism , Caenorhabditis elegans Proteins/genetics , Caenorhabditis elegans Proteins/metabolism , Cell Division , Cell Lineage , DNA-Binding Proteins/genetics , DNA-Binding Proteins/metabolism , Embryonic Development , Gene Expression Profiling , High Mobility Group Proteins/genetics , High Mobility Group Proteins/metabolism , Organ Specificity , Promoter Regions, Genetic , Regulatory Elements, Transcriptional , Transcription Factors/genetics , Transcription Factors/metabolism
14.
Cell ; 120(2): 249-59, 2005 Jan 28.
Article in English | MEDLINE | ID: mdl-15680330

ABSTRACT

Brassinosteroids (BRs) signal through a plasma membrane-localized receptor kinase to regulate plant growth and development. We showed previously that a novel protein, BES1, accumulates in the nucleus in response to BRs, where it plays a role in BR-regulated gene expression; however, the mechanism by which BES1 regulates gene expression is unknown. In this study, we dissect BES1 subdomains and establish that BES1 is a transcription factor that binds to and activates BR target gene promoters both in vitro and in vivo. BES1 interacts with a basic helix-loop-helix protein, BIM1, to synergistically bind to E box (CANNTG) sequences present in many BR-induced promoters. Loss-of-function and gain-of-function mutants of BIM1 and its close family members display BR response phenotypes. Thus, BES1 defines a new class of plant-specific transcription factors that cooperate with transcription factors such as BIM1 to regulate BR-induced genes.


Subject(s)
Arabidopsis Proteins/metabolism , Arabidopsis/metabolism , Gene Expression Regulation, Plant/physiology , Phytosterols/metabolism , Transcription Factors/metabolism , Amino Acid Sequence , Cell Nucleus/metabolism , DNA-Binding Proteins , Helix-Loop-Helix Motifs/physiology , Molecular Sequence Data , Nuclear Proteins/metabolism
15.
Development ; 131(21): 5341-51, 2004 Nov.
Article in English | MEDLINE | ID: mdl-15486337

ABSTRACT

Plant steroid hormones, brassinosteroids (BRs), are perceived by the plasma membrane-localized leucine-rich-repeat-receptor kinase BRI1. Based on sequence similarity, we have identified three members of the BRI1 family, named BRL1, BRL2 and BRL3. BRL1 and BRL3, but not BRL2, encode functional BR receptors that bind brassinolide, the most active BR, with high affinity. In agreement, only BRL1 and BRL3 can rescue bri1 mutants when expressed under the control of the BRI1 promoter. While BRI1 is ubiquitously expressed in growing cells, the expression of BRL1 and BRL3 is restricted to non-overlapping subsets of vascular cells. Loss-of-function of brl1 causes abnormal phloem:xylem differentiation ratios and enhances the vascular defects of a weak bri1 mutant. bri1 brl1 brl3 triple mutants enhance bri1 dwarfism and also exhibit abnormal vascular differentiation. Thus, Arabidopsis contains a small number of BR receptors that have specific functions in cell growth and vascular differentiation.


Subject(s)
Arabidopsis Proteins/metabolism , Arabidopsis/growth & development , Arabidopsis/metabolism , Cell Differentiation , Receptors, Cell Surface/metabolism , Amino Acid Sequence , Arabidopsis/chemistry , Arabidopsis/cytology , Arabidopsis Proteins/chemistry , Arabidopsis Proteins/genetics , Brassinosteroids , Cholestanols/metabolism , Gene Expression Regulation, Plant , Molecular Sequence Data , Mutation/genetics , Phenotype , Phylogeny , Receptors, Cell Surface/chemistry , Receptors, Cell Surface/genetics , Sequence Alignment , Sequence Homology , Steroids, Heterocyclic/metabolism
16.
Dev Cell ; 2(4): 505-13, 2002 Apr.
Article in English | MEDLINE | ID: mdl-11970900

ABSTRACT

Plant steroid hormones, brassinosteroids (BRs), are perceived by a cell surface receptor kinase, BRI1, but how BR binding leads to regulation of gene expression in the nucleus is unknown. Here we describe the identification of BZR1 as a nuclear component of the BR signal transduction pathway. A dominant mutation bzr1-1D suppresses BR-deficient and BR-insensitive (bri1) phenotypes and enhances feedback inhibition of BR biosynthesis. BZR1 protein accumulates in the nucleus of elongating cells of dark-grown hypocotyls and is stabilized by BR signaling and the bzr1-1D mutation. Our results demonstrate that BZR1 is a positive regulator of the BR signaling pathway that mediates both downstream BR responses and feedback regulation of BR biosynthesis.


Subject(s)
Arabidopsis Proteins/genetics , Arabidopsis Proteins/metabolism , Arabidopsis/genetics , Nuclear Proteins/genetics , Nuclear Proteins/metabolism , Steroids/biosynthesis , Arabidopsis/metabolism , Cell Nucleus/metabolism , DNA-Binding Proteins , Feedback, Physiological/physiology , Gene Expression Regulation, Developmental , Gene Expression Regulation, Plant , Hypocotyl/cytology , Hypocotyl/metabolism , Molecular Sequence Data , Mutation/physiology , Phenotype , Plant Cells , Plants/metabolism , Sequence Homology, Amino Acid
SELECTION OF CITATIONS
SEARCH DETAIL
...