Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 7 de 7
Filter
Add more filters










Database
Language
Publication year range
1.
bioRxiv ; 2023 Aug 09.
Article in English | MEDLINE | ID: mdl-37609287

ABSTRACT

Cis-regulatory elements (CREs) control gene expression, orchestrating tissue identity, developmental timing, and stimulus responses, which collectively define the thousands of unique cell types in the body. While there is great potential for strategically incorporating CREs in therapeutic or biotechnology applications that require tissue specificity, there is no guarantee that an optimal CRE for an intended purpose has arisen naturally through evolution. Here, we present a platform to engineer and validate synthetic CREs capable of driving gene expression with programmed cell type specificity. We leverage innovations in deep neural network modeling of CRE activity across three cell types, efficient in silico optimization, and massively parallel reporter assays (MPRAs) to design and empirically test thousands of CREs. Through in vitro and in vivo validation, we show that synthetic sequences outperform natural sequences from the human genome in driving cell type-specific expression. Synthetic sequences leverage unique sequence syntax to promote activity in the on-target cell type and simultaneously reduce activity in off-target cells. Together, we provide a generalizable framework to prospectively engineer CREs and demonstrate the required literacy to write regulatory code that is fit-for-purpose in vivo across vertebrates.

2.
Mol Psychiatry ; 15(8): 810-5, 2010 Aug.
Article in English | MEDLINE | ID: mdl-19255578

ABSTRACT

Genetic variants in the brain-derived neurotrophic factor (BDNF) gene, predominantly the functional Val66Met polymorphism, have been associated with risk of bipolar disorder and other psychiatric disorders. However, not all studies support these findings, and overall the evidence for the association of BDNF with disease risk is weak. As differences in population genetic structure between patient samples could cause discrepant or spurious association results, we investigated this possibility by carrying out population genetic analyses of the BDNF genomic region. Substantial variation was detected in BDNF coding region single-nucleotide polymorphism (SNP) allele and haplotype frequencies between 58 global populations, with the derived Met allele of Val66Met ranging in frequency from 0 to 72% across populations. F(ST) analyses to assess diversity in the HapMap populations determined that the Val66Met F(ST) value was at the 99.8th percentile among all SNPs in the genome. As the BDNF population genetic differences may be due to local selection, we performed the long-range haplotype test for selection using 68 SNPs spanning the BDNF genomic region in 12 European-derived pedigrees. Evidence for positive selection was found for a high-frequency Val-carrying haplotype, with a relative extended haplotype homozygosity value above the 99 th percentile compared with HapMap data (P=4.6 x 10(-4)). In conclusion, we observed considerable BDNF allele and haplotype diversity among global populations and evidence for positive selection at the BDNF locus. These phenomena can have a profound impact on the detection of disease susceptibility genes and must be considered in gene association studies of BDNF.


Subject(s)
Brain-Derived Neurotrophic Factor/genetics , Gene Frequency , Genetics, Population/methods , Polymorphism, Single Nucleotide/genetics , Genotype , Humans , Linkage Disequilibrium , Methionine/genetics , Valine/genetics
3.
Genes Immun ; 10(3): 267-72, 2009 Apr.
Article in English | MEDLINE | ID: mdl-19212328

ABSTRACT

Vibrio cholerae causes a dehydrating diarrheal illness that can be rapidly fatal in the absence of specific treatment. The organism is an historic scourge and, like similar infectious diseases, may have influenced the evolution of the human genome. We report here the results of the first candidate gene association study of cholera. In a family-based study of 76 pedigrees from Dhaka, Bangladesh, we evaluated the association between cholera and five candidate genes-the cystic fibrosis transmembrane receptor; lactoferrin; long palate, lung and nasal epithelium clone 1 (LPLUNC1); estrogen-related receptor alpha and calcium-activated chloride channel 1. We found a significant association with a marker in the promoter region of LPLUNC1 (rs11906665), a member of a family of evolutionarily conserved innate immunity proteins. An earlier microarray-based study of duodenal biopsies showed significantly increased expression of LPLUNC1 in cholera patients compared with healthy control subjects. Our results suggest that variation in host innate immune responses may influence the outcome of exposure to V. cholerae in an endemic setting.


Subject(s)
Cholera/genetics , Chromosomes, Human, Pair 20/genetics , Genetic Predisposition to Disease , Adolescent , Adult , Alleles , Bangladesh/epidemiology , Child , Child, Preschool , Cholera/epidemiology , Female , Gene Frequency/genetics , Genotype , Haplotypes/genetics , Humans , Immunity, Innate , Linkage Disequilibrium/genetics , Male , Pedigree , Promoter Regions, Genetic , Vibrio cholerae/immunology , Young Adult
4.
Science ; 312(5780): 1614-20, 2006 Jun 16.
Article in English | MEDLINE | ID: mdl-16778047

ABSTRACT

Positive natural selection is the force that drives the increase in prevalence of advantageous traits, and it has played a central role in our development as a species. Until recently, the study of natural selection in humans has largely been restricted to comparing individual candidate genes to theoretical expectations. The advent of genome-wide sequence and polymorphism data brings fundamental new tools to the study of natural selection. It is now possible to identify new candidates for selection and to reevaluate previous claims by comparison with empirical distributions of DNA sequence variation across the human genome and among populations. The flood of data and analytical methods, however, raises many new challenges. Here, we review approaches to detect positive natural selection, describe results from recent analyses of genome-wide data, and discuss the prospects and challenges ahead as we expand our understanding of the role of natural selection in shaping the human genome.


Subject(s)
Genome, Human , Selection, Genetic , Alleles , Biological Evolution , Gene Frequency , Genetic Variation , Genetics, Population , Haplotypes , Humans , Mutation , Polymorphism, Genetic , Sequence Analysis, DNA
5.
Genes Immun ; 3(5): 286-91, 2002 Aug.
Article in English | MEDLINE | ID: mdl-12140747

ABSTRACT

CD40 ligand (CD40L), a glycoprotein involved in B cell proliferation, antigen presenting cell activation, and Ig class switching, is important in the immune response to infection. Rare coding mutations in CD40L can lead to life-threatening immunodeficiency but the potential for common variants to alter disease susceptibility remains to be explored. To identify polymorphisms in CD40L, we sequenced 2.3 kb of the 5' flanking region and the first exon of the gene in DNA samples from 36 Gambian females and one chimpanzee. Diversity was lower than the average reported for other areas of the X chromosome, and only two polymorphisms were identified. The polymorphisms were genotyped in DNA samples from 957 Gambian individuals, cases and controls from a study of severe malaria. A significant reduction in risk for severe malaria (OR = 0.52, P = 0.002) was associated with males hemizygous for the CD40L-726C. Analysis by transmission disequilibrium test of 371 cases, for whom DNA from both parents was also available, confirmed the result was not due to stratification (P = 0.04). A similar but non-significant trend was found in females. This preliminary association of a common variant in CD40L with a malaria resistance phenotype encourages further genetic characterization of the role of CD40L in infectious disease.


Subject(s)
CD40 Ligand/genetics , Malaria, Falciparum/genetics , Malaria, Falciparum/immunology , Alleles , Animals , Base Sequence , Case-Control Studies , Chromosomes, Human, X/genetics , DNA/genetics , Female , Gambia , Gene Frequency , Genetic Variation , Haplotypes , Humans , Malaria, Falciparum/prevention & control , Male , Molecular Sequence Data , Pan troglodytes , Phenotype , Promoter Regions, Genetic
6.
Nature ; 411(6834): 199-204, 2001 May 10.
Article in English | MEDLINE | ID: mdl-11346797

ABSTRACT

With the availability of a dense genome-wide map of single nucleotide polymorphisms (SNPs), a central issue in human genetics is whether it is now possible to use linkage disequilibrium (LD) to map genes that cause disease. LD refers to correlations among neighbouring alleles, reflecting 'haplotypes' descended from single, ancestral chromosomes. The size of LD blocks has been the subject of considerable debate. Computer simulations and empirical data have suggested that LD extends only a few kilobases (kb) around common SNPs, whereas other data have suggested that it can extend much further, in some cases greater than 100 kb. It has been difficult to obtain a systematic picture of LD because past studies have been based on only a few (1-3) loci and different populations. Here, we report a large-scale experiment using a uniform protocol to examine 19 randomly selected genomic regions. LD in a United States population of north-European descent typically extends 60 kb from common alleles, implying that LD mapping is likely to be practical in this population. By contrast, LD in a Nigerian population extends markedly less far. The results illuminate human history, suggesting that LD in northern Europeans is shaped by a marked demographic event about 27,000-53,000 years ago.


Subject(s)
Chromosome Mapping/methods , Genome, Human , Linkage Disequilibrium/genetics , Polymorphism, Single Nucleotide/genetics , Alleles , Bias , Computer Simulation , Europe/ethnology , Founder Effect , Genetic Diseases, Inborn/genetics , Haplotypes/genetics , Heterozygote , Humans , Models, Genetic , Nigeria , Phylogeny , Racial Groups/genetics , Recombination, Genetic/genetics , Reproducibility of Results , Selection, Genetic , Time Factors , United States
7.
Chem Biol ; 4(10): 767-74, 1997 Oct.
Article in English | MEDLINE | ID: mdl-9375255

ABSTRACT

BACKGROUND: In the past few years numerous binding and catalytic motifs have been isolated from pools of random nucleic acid sequences. To extend the utility of this approach it is important to learn how to design random-sequence pools that provide maximal access to rare activities. In an effort to better define the relative merits of longer and shorter pools (i.e. pools with longer or shorter random-sequence segments), we have examined the inhibitory effect of excess arbitrary sequence on ribozyme activity and have evaluated whether this inhibition overshadows the calculated advantage of longer pools. RESULTS: The calculated advantage of longer sequences was highly dependent on the size and complexity of the desired motif. Small, simple motifs were not much more abundant in longer molecules. In contrast, larger motifs, particularly the most complex (highly modular) motifs, were much more likely to be present in longer molecules. The experimentally determined inhibition of activity by excess sequence was moderate, with bulk effects among four libraries ranging from no effect to 18-fold inhibition. The median effect among 60 clones was fivefold inhibition. CONCLUSIONS: For accessing simple motifs (e.g. motifs at least as small and simple as the hammerhead ribozyme motif), longer pools have little if any advantage. For more complex motifs, the inhibitory effect of excess sequence does not approach the calculated advantage of pools of longer molecules. Thus, when seeking to access rare activities, the length of typical random-sequence pools (< or = 70 random positions) is shorter than optimal. As this conclusion holds over a range of incubation conditions, it may also be relevant when considering the emergence of new functional motifs during early evolution.


Subject(s)
Base Sequence , RNA Ligase (ATP)/genetics , RNA, Catalytic/genetics , Base Composition , Gene Library , Molecular Sequence Data , Nucleic Acid Conformation , Probability , RNA Ligase (ATP)/classification , RNA, Catalytic/classification , Templates, Genetic
SELECTION OF CITATIONS
SEARCH DETAIL
...