Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 24
Filter
Add more filters










Publication year range
1.
Mol Biol Evol ; 40(4)2023 04 04.
Article in English | MEDLINE | ID: mdl-36947103

ABSTRACT

It has been recently suggested that a significant fraction of homomer protein-protein interfaces evolve neutrally, without contributing to function, due to a hydrophobic bias in missense mutations. However, the fraction of such gratuitous complexes is currently unknown. Here, we quantified the fraction of homodimers where multimerization is unlikely to contribute to their biochemical function. We show that: 1) ligand binding-site structure predicts whether a homomer is functional or not; the vast majority of homodimers with multichain binding-sites (MBS) are likely to be functional, while in homodimers with single-chain binding-sites (SBS) and small to medium interfaces, quaternary structure is unlikely to be functional in a significant fraction-35%, even up to 42%-of complexes; 2) the hydrophobicity of interfaces changes little with the strength of selection, and the amino acid composition of interfaces is shaped by the "hydrophobic ratchet" in both types, but they are not in a strict equilibrium with mutations; particularly cysteines are much more abundant in mutations than in interfaces or surfaces; 3) in MBS homomers, the interfaces are conserved, while in a high fraction of SBS homomers, the interface is not more conserved than the solvent-accessible surface; and 4) MBS homomer interfaces coevolve more strongly with ligand binding sites than the interfaces of SBS homomers, and MBS complexes have higher capacity to transfer information from ligands across the interfaces than SBS homomers, explaining the enrichment of allostery in the former.


Subject(s)
Amino Acids , Proteins , Ligands , Proteins/metabolism , Binding Sites/genetics , Protein Domains , Amino Acids/chemistry , Protein Binding/genetics , Protein Structure, Quaternary
2.
PLoS Comput Biol ; 18(2): e1009806, 2022 02.
Article in English | MEDLINE | ID: mdl-35139069

ABSTRACT

Allostery is a form of protein regulation, where ligands that bind sites located apart from the active site can modify the activity of the protein. The molecular mechanisms of allostery have been extensively studied, because allosteric sites are less conserved than active sites, and drugs targeting them are more specific than drugs binding the active sites. Here we quantify the importance of allostery in genetic disease. We show that 1) known allosteric proteins are central in disease networks, contribute to genetic disease and comorbidities much more than non-allosteric proteins, and there is an association between being allosteric and involvement in disease; 2) they are enriched in many major disease types like hematopoietic diseases, cardiovascular diseases, cancers, diabetes, or diseases of the central nervous system; 3) variants from cancer genome-wide association studies are enriched near allosteric proteins, indicating their importance to polygenic traits; and 4) the importance of allosteric proteins in disease is due, at least partly, to their central positions in protein-protein interaction networks, and less due to their dynamical properties.


Subject(s)
Genome-Wide Association Study , Proteins , Allosteric Regulation/genetics , Proteins/chemistry , Allosteric Site , Catalytic Domain
3.
Nucleic Acids Res ; 50(5): 2807-2825, 2022 03 21.
Article in English | MEDLINE | ID: mdl-35188569

ABSTRACT

The Sleeping Beauty (SB) transposon system is a popular tool for genome engineering, but random integration into the genome carries a certain genotoxic risk in therapeutic applications. Here we investigate the role of amino acids H187, P247 and K248 in target site selection of the SB transposase. Structural modeling implicates these three amino acids located in positions analogous to amino acids with established functions in target site selection in retroviral integrases and transposases. Saturation mutagenesis of these residues in the SB transposase yielded variants with altered target site selection properties. Transposon integration profiling of several mutants reveals increased specificity of integrations into palindromic AT repeat target sequences in genomic regions characterized by high DNA bendability. The H187V and K248R mutants redirect integrations away from exons, transcriptional regulatory elements and nucleosomal DNA in the human genome, suggesting enhanced safety and thus utility of these SB variants in gene therapy applications.


Subject(s)
Transposases , Amino Acids/genetics , DNA Transposable Elements/genetics , Humans , Integrases/metabolism , Protein Engineering , Transposases/genetics , Transposases/metabolism
5.
J Mol Biol ; 431(19): 3871-3888, 2019 09 06.
Article in English | MEDLINE | ID: mdl-31306664

ABSTRACT

Ligand binding site structure has profound consequences for the evolution of function of protein complexes, particularly in homomers-complexes comprising multiple copies of the same protein. Previously, we have shown that homomers with multichain binding sites (MBSs) are characterized by more conserved binding sites and quaternary structure, and qualitatively different allosteric pathways than homomers with single-chain binding sites (SBSs) or monomers. Here, using computational methods, we show that the folds of single-domain MBS and SBS homomers are different, and SBS homomers are likely to be folded cotranslationally, while MBS homomers are more likely to form post-translationally and rely on more advanced folding-assistance and quality control mechanisms, which include chaperonins. In addition, our findings demonstrate that MBS homomers are qualitatively different from monomers, while SBS homomers are much less distinct, supporting the hypothesis that the evolution of quaternary structure in SBS homomers is significantly influenced by stochastic processes.


Subject(s)
Multiprotein Complexes/chemistry , Multiprotein Complexes/metabolism , Protein Folding , Proteolysis , Amino Acid Sequence , Binding Sites , Conserved Sequence , Disulfides/metabolism , Gene Ontology , Humans , Ligands , Models, Molecular , Molecular Chaperones/metabolism , Protein Binding , Protein Domains , Protein Interaction Mapping , THP-1 Cells
6.
J Med Chem ; 62(21): 9357-9374, 2019 11 14.
Article in English | MEDLINE | ID: mdl-31188598

ABSTRACT

The spread of antibiotic resistance is one of the most serious global public-health problems. Here we show that a particular class of homomers with binding sites spanning multiple protein chains is particularly suitable for targeting by broad-spectrum antibacterial agents because due to the slow evolutionary change of such binding pockets, ligands of such homomers are much more likely to bind their homologs than ligands of monomers, or homomers with a single-chain binding site. Additionally, using de novo ligand design and deep learning, we show that the chemical compounds that can bind several different receptors have common structural characteristics and that halogens and fragments similar to the building blocks existing antimicrobials are overrepresented in them. Finally, we show that binding multiple receptors selects for flexible compounds, which are less likely to accumulate in Gram-negative bacteria; thus there is trade-off between reducing the emergence of resistance by multitargeting and broad-spectrum antibacterial activity.


Subject(s)
Anti-Bacterial Agents/metabolism , Anti-Bacterial Agents/pharmacology , Drug Design , Gram-Negative Bacteria/drug effects , Gram-Negative Bacteria/metabolism , Ligands , Protein Binding , Protein Multimerization/drug effects
7.
Mol Biol Evol ; 36(8): 1711-1727, 2019 08 01.
Article in English | MEDLINE | ID: mdl-31004156

ABSTRACT

The structure of ligand-binding sites has been shown to profoundly influence the evolution of function in homomeric protein complexes. Complexes with multichain binding sites (MBSs) have more conserved quaternary structure, more similar binding sites and ligands between homologs, and evolve new functions slower than homomers with single-chain binding sites (SBSs). Here, using in silico analyses of protein dynamics, we investigate whether ligand-binding-site structure shapes allosteric signal transduction pathways, and whether the structural similarity of binding sites influences the evolution of allostery. Our analyses show that: 1) allostery is more frequent among MBS complexes than in SBS complexes, particularly in homomers; 2) in MBS homomers, semirigid communities and critical residues frequently connect interfaces and thus they are characterized by signal transduction pathways that cross protein-protein interfaces, whereas SBS homomers usually not; 3) ligand binding alters community structure differently in MBS and SBS homomers; and 4) except MBS homomers, allosteric proteins are more likely to have homologs with similar binding site than nonallosteric proteins, suggesting that binding site similarity is an important factor driving the evolution of allostery.


Subject(s)
Allosteric Site/genetics , Evolution, Molecular , Signal Transduction , Computer Simulation , Humans , Ligands
8.
Cell Rep ; 22(12): 3265-3276, 2018 03 20.
Article in English | MEDLINE | ID: mdl-29562182

ABSTRACT

It has been suggested that the evolution of protein complexes is significantly influenced by stochastic, non-adaptive processes. Using ligand binding as a proxy of function, we show that the structure of ligand-binding sites significantly influences the evolution of protein complexes. We show that homomers with multi-chain binding sites (MBSs) evolve new functions slower than monomers or other homomers, and those binding cofactors and metals have more conserved quaternary structure than other homomers. Moreover, the ligands and ligand-binding pockets of homologous MBS homomers are more similar than monomers and other homomers. Our results suggest strong evolutionary selection for quaternary structure in cofactor-binding MBS homomers, whereas neutral processes are more important in complexes with single-chain binding sites. They also have pharmacological implications, suggesting that complexes with single-chain binding sites are better targets for selective drugs, whereas MBS homomers are good candidates for broad-spectrum antibiotic and multitarget drug design.


Subject(s)
Binding Sites/genetics , Ligands , Protein Binding/genetics , Humans
9.
PLoS Comput Biol ; 12(12): e1005242, 2016 12.
Article in English | MEDLINE | ID: mdl-27935949

ABSTRACT

The rapidly increasing amount of data on human genetic variation has resulted in a growing demand to identify pathogenic mutations computationally, as their experimental validation is currently beyond reach. Here we show that alpha helices and beta strands differ significantly in their ability to tolerate mutations: helices can accumulate more mutations than strands without change, due to the higher numbers of inter-residue contacts in helices. This results in two patterns: a) the same number of mutations causes less structural change in helices than in strands; b) helices diverge more rapidly in sequence than strands within the same domains. Additionally, both helices and strands are significantly more robust than coils. Based on this observation we show that human missense mutations that change secondary structure are more likely to be pathogenic than those that do not. Moreover, inclusion of predicted secondary structure changes shows significant utility for improving upon state-of-the-art pathogenicity predictions.


Subject(s)
Models, Molecular , Mutation , Protein Conformation, alpha-Helical , Protein Conformation, beta-Strand , Amino Acid Sequence , Computational Biology , Humans , Sequence Analysis, Protein
10.
Mol Ther ; 24(8): 1369-77, 2016 08.
Article in English | MEDLINE | ID: mdl-27401040

ABSTRACT

Transposases are important tools in genome engineering, and there is considerable interest in engineering more efficient ones. Here, we seek to understand the factors determining their activity using the Sleeping Beauty transposase. Recent work suggests that protein coevolutionary information can be used to classify groups of physically connected, coevolving residues into elements called "sectors", which have proven useful for understanding the folding, allosteric interactions, and enzymatic activity of proteins. Using extensive mutagenesis data, protein modeling and analysis of folding energies, we show that (i) The Sleeping Beauty transposase contains two sectors, which span across conserved domains, and are enriched in DNA-binding residues, indicating that the DNA binding and endonuclease functions of the transposase coevolve; (ii) Sector residues are highly sensitive to mutations, and most mutations of these residues strongly reduce transposition rate; (iii) Mutations with a strong effect on free energy of folding in the DDE domain of the transposase significantly reduce transposition rate. (iv) Mutations that influence DNA and protein-protein interactions generally reduce transposition rate, although most hyperactive mutants are also located on the protein surface, including residues with protein-protein interactions. This suggests that hyperactivity results from the modification of protein interactions, rather than the stabilization of protein fold.


Subject(s)
DNA Transposable Elements , Transposases/genetics , Transposases/metabolism , Carrier Proteins/metabolism , Conserved Sequence , Genetic Vectors/genetics , Homologous Recombination , Models, Molecular , Mutagenesis , Mutation , Protein Binding , Protein Conformation , Protein Interaction Domains and Motifs , Protein Multimerization , Protein Stability , Structure-Activity Relationship , Transposases/chemistry
11.
Genetics ; 195(4): 1407-17, 2013 Dec.
Article in English | MEDLINE | ID: mdl-24056411

ABSTRACT

It has been recently discovered that new genes can originate de novo from noncoding DNA, and several biological traits including expression or sequence composition form a continuum from noncoding sequences to conserved genes. In this article, using yeast genes I test whether the integration of new genes into cellular networks and their structural maturation shows such a continuum by analyzing their changes with gene age. I show that 1) The number of regulatory, protein-protein, and genetic interactions increases continuously with gene age, although with very different rates. New regulatory interactions emerge rapidly within a few million years, while the number of protein-protein and genetic interactions increases slowly, with a rate of 2-2.25 × 10(-8)/year and 4.8 × 10(-8)/year, respectively. 2) Gene essentiality evolves relatively quickly: the youngest essential genes appear in proto-genes ∼14 MY old. 3) In contrast to interactions, the secondary structure of proteins and their robustness to mutations indicate that new genes face a bottleneck in their evolution: proto-genes are characterized by high ß-strand content, high aggregation propensity, and low robustness against mutations, while conserved genes are characterized by lower strand content and higher stability, most likely due to the higher probability of gene loss among young genes and accumulation of neutral mutations.


Subject(s)
Ascomycota/genetics , Evolution, Molecular , Fungal Proteins/metabolism , Gene Regulatory Networks , Genes, Fungal , Amino Acid Sequence , Ascomycota/classification , Ascomycota/metabolism , Binding Sites , Fungal Proteins/chemistry , Fungal Proteins/genetics , Molecular Sequence Data , Phylogeny , Protein Binding , Protein Conformation
12.
J Biol Chem ; 288(22): 16127-38, 2013 May 31.
Article in English | MEDLINE | ID: mdl-23530042

ABSTRACT

Despite the considerable amount of research on transposable elements, no large-scale structural analyses of the TE proteome have been performed so far. We predicted the structures of hundreds of proteins from a representative set of DNA and LINE transposable elements and used the obtained structural data to provide the first general structural characterization of TE proteins and to estimate the frequency of TE domestication and horizontal transfer events. We show that 1) ORF1 and Gag proteins of retrotransposons contain high amounts of structural disorder; thus, despite their very low conservation, the presence of disordered regions and probably their chaperone function is conserved. 2) The distribution of SCOP classes in DNA transposons and LINEs indicates that the proteins of DNA transposons are more ancient, containing folds that already existed when the first cellular organisms appeared. 3) DNA transposon proteins have lower contact order than randomly selected reference proteins, indicating rapid folding, most likely to avoid protein aggregation. 4) Structure-based searches for TE homologs indicate that the overall frequency of TE domestication events is low, whereas we found a relatively high number of cases where horizontal transfer, frequently involving parasites, is the most likely explanation for the observed homology.


Subject(s)
DNA Transposable Elements , Gene Products, gag/genetics , Long Interspersed Nucleotide Elements , Protein Folding , Sequence Analysis, Protein , Structural Homology, Protein , Gene Products, gag/chemistry , Humans , Protein Structure, Tertiary
13.
Nucleic Acids Res ; 41(5): 3190-200, 2013 Mar 01.
Article in English | MEDLINE | ID: mdl-23341038

ABSTRACT

The numerous discovered cases of domesticated transposable element (TE) proteins led to the recognition that TEs are a significant source of evolutionary innovation. However, much less is known about the reverse process, whether and to what degree the evolution of TEs is influenced by the genome of their hosts. We addressed this issue by searching for cases of incorporation of host genes into the sequence of TEs and examined the systems-level properties of these genes using the Saccharomyces cerevisiae and Drosophila melanogaster genomes. We identified 51 cases where the evolutionary scenario was the incorporation of a host gene fragment into a TE consensus sequence, and we show that both the yeast and fly homologues of the incorporated protein sequences have central positions in the cellular networks. An analysis of selective pressure (Ka/Ks ratio) detected significant selection in 37% of the cases. Recent research on retrovirus-host interactions shows that virus proteins preferentially target hubs of the host interaction networks enabling them to take over the host cell using only a few proteins. We propose that TEs face a similar evolutionary pressure to evolve proteins with high interacting capacities and take some of the necessary protein domains directly from their hosts.


Subject(s)
DNA Transposable Elements/genetics , Drosophila melanogaster/genetics , Gene Regulatory Networks , Saccharomyces cerevisiae/genetics , Amino Acid Sequence , Animals , Drosophila Proteins/genetics , Evolution, Molecular , Genome, Fungal , Genome, Insect , Likelihood Functions , Models, Genetic , Models, Molecular , Molecular Sequence Data , Monte Carlo Method , Phylogeny , Protein Structure, Tertiary , Saccharomyces cerevisiae Proteins/genetics
14.
Biol Direct ; 7: 41, 2012 Nov 23.
Article in English | MEDLINE | ID: mdl-23176288

ABSTRACT

UNLABELLED: It has been recently discovered that transposable elements show high activity in the brain of mammals, however, the magnitude of their influence on its functioning is unclear so far. In this paper, I use flux balance analysis to examine the influence of somatic retrotransposition on brain metabolism, and the biosynthesis of its key metabolites, including neurotransmitters. The analysis shows that somatic transposition in the human brain can influence the biosynthesis of more than 250 metabolites, including dopamine, serotonin and glutamate, shows large inter-individual variability in metabolic effects, and may contribute to the development of Parkinson's disease and schizophrenia. REVIEWERS: This article was reviewed by Dr Kenji Kojima (nominated by Dr Jerzy Jurka) and Dr Eugene Koonin.


Subject(s)
Brain/metabolism , DNA Transposable Elements , Neurotransmitter Agents/biosynthesis , Parkinson Disease/genetics , Parkinson Disease/metabolism , Schizophrenia/genetics , Schizophrenia/metabolism , Computational Biology , Dopamine/biosynthesis , Dopamine/metabolism , Glutamic Acid/biosynthesis , Glutamic Acid/metabolism , Humans , Neurotransmitter Agents/metabolism , Organ Specificity , Serotonin/biosynthesis , Serotonin/metabolism
15.
Bioinformatics ; 25(10): 1329-30, 2009 May 15.
Article in English | MEDLINE | ID: mdl-19349283

ABSTRACT

MOTIVATION: The large number of sequenced genomes required the development of software that reconstructs the consensus sequences of transposons and other repetitive elements. However, the available tools usually focus on the accurate identification of raw repeats and provide no information about the taxonomic position of the reconstructed consensi. TEclass is a tool to classify unknown transposable elements into their four main functional categories, which reflect their mode of transposition: DNA transposons, long terminal repeats (LTRs), long interspersed nuclear elements (LINEs) and short interspersed nuclear elements (SINEs). TEclass uses machine learning support vector machine (SVM) for classification based on oligomer frequencies. It achieves 90-97% accuracy in the classification of novel DNA and LTR repeats, and 75% for LINEs and SINEs. AVAILABILITY: http://www.compgen.uni-muenster.de/teclass, stand alone program upon request.


Subject(s)
Computational Biology/methods , DNA Transposable Elements , Software , Eukaryotic Cells , Repetitive Sequences, Nucleic Acid , Short Interspersed Nucleotide Elements , Terminal Repeat Sequences
16.
BMC Genomics ; 9: 533, 2008 Nov 07.
Article in English | MEDLINE | ID: mdl-18992157

ABSTRACT

BACKGROUND: Tandemly Repeated DNA represents a large portion of the human genome, and accounts for a significant amount of copy number variation. Here we present a genome wide analysis of the largest tandem repeats found in the human genome sequence. RESULTS: Using Tandem Repeats Finder (TRF), tandem repeat arrays greater than 10 kb in total size were identified, and classified into simple sequence e.g. GAATG, classical satellites e.g. alpha satellite DNA, and locus specific VNTR arrays. Analysis of these large sequenced regions revealed that several "simple sequence" arrays actually showed complex domain and/or higher order repeat organization. Using additional methods, we further identified a total of 96 additional arrays with tandem repeat units greater than 2 kb (the detection limit of TRF), 53 of which contained genes or repeated exons. The overall size of an array of tandem 12 kb repeats which spanned a gap on chromosome 8 was found to be 600 kb to 1.7 Mbp in size, representing one of the largest non-centromeric arrays characterized. Several novel megasatellite tandem DNA families were observed that are characterized by repeating patterns of interspersed transposable elements that have expanded presumably by unequal crossing over. One of these families is found on 11 different chromosomes in >25 arrays, and represents one of the largest most widespread megasatellite DNA families. CONCLUSION: This study represents the most comprehensive genome wide analysis of large tandem repeats in the human genome, and will serve as an important resource towards understanding the organization and copy number variation of these complex DNA families.


Subject(s)
Genome, Human , Tandem Repeat Sequences/genetics , Chromosomes, Human/genetics , DNA/chemistry , DNA, Satellite/genetics , Gene Dosage , Genetic Variation , Humans , In Situ Hybridization, Fluorescence , Retroelements
17.
PLoS Genet ; 4(8): e1000172, 2008 Aug 29.
Article in English | MEDLINE | ID: mdl-18769724

ABSTRACT

It has been hypothesised that the massive accumulation of L1 transposable elements on the X chromosome is due to their function in X inactivation, and that the accumulation of Alu elements near genes is adaptive. We tested the possible selective advantage of these two transposable element (TE) families with a novel method, interruption analysis. In mammalian genomes, a large number of TEs interrupt other TEs due to the high overall abundance and age of repeats, and these interruptions can be used to test whether TEs are selectively neutral. Interruptions of TEs, which are beneficial for the host, are expected to be deleterious and underrepresented compared with neutral ones. We found that L1 elements in the regions of the X chromosome that contain the majority of the inactivated genes are significantly less frequently interrupted than on the autosomes, while L1s near genes that escape inactivation are interrupted with higher frequency, supporting the hypothesis that L1s on the X chromosome play a role in its inactivation. In addition, we show that TEs are less frequently interrupted in introns than in intergenic regions, probably due to selection against the expansion of introns, but the insertion pattern of Alus is comparable to other repeats.


Subject(s)
DNA Transposable Elements , Long Interspersed Nucleotide Elements , Selection, Genetic , X Chromosome/genetics , Alu Elements , Animals , Chromosomes, Human, X , Gene Silencing , Humans , Introns , Molecular Sequence Data , Opossums/genetics
18.
Genetics ; 178(1): 573-81, 2008 Jan.
Article in English | MEDLINE | ID: mdl-17947446

ABSTRACT

The genomes of birds are much smaller than mammalian genomes, and transposable elements (TEs) make up only 10% of the chicken genome, compared with the 45% of the human genome. To study the mechanisms that constrain the copy numbers of TEs, and as a consequence the genome size of birds, we analyzed the distributions of LINEs (CR1's) and SINEs (MIRs) on the chicken autosomes and Z chromosome. We show that (1) CR1 repeats are longest on the Z chromosome and their length is negatively correlated with the local GC content; (2) the decay of CR1 elements is highly biased, and the 5'-ends of the insertions are lost much faster than their 3'-ends; (3) the GC distribution of CR1 repeats shows a bimodal pattern with repeats enriched in both AT-rich and GC-rich regions of the genome, but the CR1 families show large differences in their GC distribution; and (4) the few MIRs in the chicken are most abundant in regions with intermediate GC content. Our results indicate that the primary mechanism that removes repeats from the chicken genome is ectopic exchange and that the low abundance of repeats in avian genomes is likely to be the consequence of their high recombination rates.


Subject(s)
Chickens/genetics , Genome/genetics , Long Interspersed Nucleotide Elements/genetics , Animals , Base Composition/genetics , Bias , Biological Evolution , Chromosomes/genetics , Short Interspersed Nucleotide Elements/genetics
19.
PLoS Comput Biol ; 3(7): e137, 2007 Jul.
Article in English | MEDLINE | ID: mdl-17630829

ABSTRACT

The constant bombardment of mammalian genomes by transposable elements (TEs) has resulted in TEs comprising at least 45% of the human genome. Because of their great age and abundance, TEs are important in comparative phylogenomics. However, estimates of TE age were previously based on divergence from derived consensus sequences or phylogenetic analysis, which can be unreliable, especially for older more diverged elements. Therefore, a novel genome-wide analysis of TE organization and fragmentation was performed to estimate TE age independently of sequence composition and divergence or the assumption of a constant molecular clock. Analysis of TEs in the human genome revealed approximately 600,000 examples where TEs have transposed into and fragmented other TEs, covering >40% of all TEs or approximately 542 Mbp of genomic sequence. The relative age of these TEs over evolutionary time is implicit in their organization, because newer TEs have necessarily transposed into older TEs that were already present. A matrix of the number of times that each TE has transposed into every other TE was constructed, and a novel objective function was developed that derived the chronological order and relative ages of human TEs spanning >100 million years. This method has been used to infer the relative ages across all four major TE classes, including the oldest, most diverged elements. Analysis of DNA transposons over the history of the human genome has revealed the early activity of some MER2 transposons, and the relatively recent activity of MER1 transposons during primate lineages. The TEs from six additional mammalian genomes were defragmented and analyzed. Pairwise comparison of the independent chronological orders of TEs in these mammalian genomes revealed species phylogeny, the fact that transposons shared between genomes are older than species-specific transposons, and a subset of TEs that were potentially active during periods of speciation.


Subject(s)
DNA Mutational Analysis/methods , DNA Transposable Elements/genetics , Evolution, Molecular , Mammals/genetics , Animals , Base Pairing , Base Sequence , Cattle , Chronobiology Phenomena/genetics , Dogs , Genetic Speciation , Genome, Human , Haplorhini/genetics , Humans , Mice , Models, Genetic , Molecular Sequence Data , Phylogeny , Rats
20.
Genome Biol ; 8(7): R148, 2007.
Article in English | MEDLINE | ID: mdl-17651496

ABSTRACT

BACKGROUND: Mammalian centromere formation is dependent on chromatin that contains centromere protein (CENP)-A, which is the centromere-specific histone H3 variant. Human neocentromeres have acquired CENP-A chromatin epigenetically in ectopic chromosomal locations on low-copy complex DNA. Neocentromeres permit detailed investigation of centromeric chromatin organization that is not possible in the highly repetitive alpha satellite DNA present at endogenous centromeres. RESULTS: We have examined the distribution of CENP-A, as well as two additional centromeric chromatin-associated proteins (CENP-C and CENP-H), across neocentromeric DNA using chromatin immunoprecipitation (ChIP) on CHIP assays on custom genomic microarrays at three different resolutions. Analysis of two neocentromeres using a contiguous bacterial artificial chromosome (BAC) microarray spanning bands 13q31.3 to 13q33.1 shows that both CENP-C and CENP-H co-localize to the CENP-A chromatin domain. Using a higher resolution polymerase chain reaction (PCR)-amplicon microarray spanning the neocentromere, we find that the CENP-A chromatin is discontinuous, consisting of a major domain of about 87.8 kilobases (kb) and a minor domain of about 13.2 kb, separated by an approximately 158 kb region devoid of CENPs. Both CENP-A domains exhibit co-localization of CENP-C and CENP-H, defining a distinct inner kinetochore chromatin structure that is consistent with higher order chromatin looping models at centromeres. The PCR microarray data suggested varying density of CENP-A nucleosomes across the major domain, which was confirmed using a higher resolution oligo-based microarray. CONCLUSION: Centromeric chromatin consists of several CENP-A subdomains with highly discontinuous CENP-A chromatin at both the level of individual nucleosomes and at higher order chromatin levels, raising questions regarding the overall structure of centromeric chromatin.


Subject(s)
Autoantigens/analysis , Centromere/chemistry , Chromatin/chemistry , Chromosomal Proteins, Non-Histone/analysis , Base Sequence , Centromere Protein A , Chromatin Immunoprecipitation , Chromosomes, Human, Pair 13/chemistry , Humans , Nucleosomes/chemistry , Sequence Analysis, DNA
SELECTION OF CITATIONS
SEARCH DETAIL
...