Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 124
Filter
Add more filters










Publication year range
1.
Theor Appl Genet ; 108(2): 238-42, 2004 Jan.
Article in English | MEDLINE | ID: mdl-13679980

ABSTRACT

We developed SCAR primers based on isolated and sequenced male-specific fragments as identified in an AFLP analysis of the dioecious plant Rumex nivalis. PCR amplification using these primers on females and males resulted in fragments exclusively present in males. Co-amplification of the nuclear rDNA internal transcribed spacer 2 together with the male-specific fragment was applied as an internal control for successful PCR reactions to avoid false-negative sex scoring. With a length of about 164 bp, the AFLP fragment was of a similar size as the tandemly arranged, repetitive sequences of 180 bp located on the Y chromosomes of Rumex acetosa. The genetic distances between the Y-chromosomal sequences of R. nivalis and R. acetosa, both members of the section Acetosa, were substantial. We found intra-individual divergence among cloned sequences of the male-specific fragment in R. nivalis. The patterns of interspecific and intra-individual sequence variation found are in accordance with proposed modes of the evolution of sex chromosomes. Y chromosomes possibly arose only once in the genus Rumex and consist mainly of heterochromatic DNA. Due to the almost complete absence of selection on them, Y chromosomes are likely to accumulate large numbers of mutations.


Subject(s)
Evolution, Molecular , Genetic Markers , Rumex/genetics , Sex Chromosomes , DNA, Plant/genetics , Phylogeny
2.
J Exp Bot ; 54(388): 1645-54, 2003 Jul.
Article in English | MEDLINE | ID: mdl-12754262

ABSTRACT

When seeds germinate nearly all the proteins are degraded in senescing storage tissue cells. All these proteins act as amino acid reserves which are mobilized to nourish the seedling. Nevertheless, the major amount of the seeds' protein reserve consists of a few enzymatically inactive, abundant, genuine storage proteins. In their metabolism the conflicting processes of biosynthesis, protein turnover and breakdown, are temporally separated. No degradation of correctly formed storage proteins was observed at the time of synthesis and accumulation during seed maturation. Breakdown takes place after a (long) period of rest when seeds germinate and seedlings start growing. At that time genuine storage proteins are no longer synthesized. Genuine storage proteins have evolved structural features permitting controlled temporal patterns of protection and proteolysis. The acquisition of inserted sequence stretches as sites accessible to limited proteolysis played a key role in the evolution of this control system and happened in coevolution of genuine storage proteins with specific proteinases. This can be deduced from the results of current research on the mechanisms of limited and unlimited proteolysis of storage globulins and on storage globulin evolution. The evolved system of controlled structure-function interplay between storage globulins and proteinases is part of a syndrome that, in addition, comprises differential compartmentation and gene expression of storage proteins and proteinases for controlling the total spatial and temporal patterns of globulin storage and mobilization in maturing and germinating seeds.


Subject(s)
Evolution, Molecular , Globulins/metabolism , Plant Proteins/genetics , Seeds/metabolism , Amino Acid Sequence , Globulins/chemistry , Globulins/genetics , Glycoproteins/chemistry , Glycoproteins/genetics , Glycoproteins/metabolism , Molecular Sequence Data , Phylogeny , Plant Proteins/chemistry , Plant Proteins/metabolism , Seed Storage Proteins , Seeds/genetics , Seeds/growth & development , Sequence Homology, Amino Acid , Legumins
3.
Infect Immun ; 71(5): 2775-86, 2003 May.
Article in English | MEDLINE | ID: mdl-12704152

ABSTRACT

We determined the complete genome sequence of Shigella flexneri serotype 2a strain 2457T (4,599,354 bp). Shigella species cause >1 million deaths per year from dysentery and diarrhea and have a lifestyle that is markedly different from those of closely related bacteria, including Escherichia coli. The genome exhibits the backbone and island mosaic structure of E. coli pathogens, albeit with much less horizontally transferred DNA and lacking 357 genes present in E. coli. The strain is distinctive in its large complement of insertion sequences, with several genomic rearrangements mediated by insertion sequences, 12 cryptic prophages, 372 pseudogenes, and 195 S. flexneri-specific genes. The 2457T genome was also compared with that of a recently sequenced S. flexneri 2a strain, 301. Our data are consistent with Shigella being phylogenetically indistinguishable from E. coli. The S. flexneri-specific regions contain many genes that could encode proteins with roles in virulence. Analysis of these will reveal the genetic basis for aspects of this pathogenic organism's distinctive lifestyle that have yet to be explained.


Subject(s)
Genome, Bacterial , Genomics , Shigella flexneri/genetics , Base Sequence , DNA Transposable Elements , Genes, Bacterial , Molecular Sequence Data , Phylogeny , Plasmids , Shigella flexneri/classification , Shigella flexneri/pathogenicity
4.
Proc Natl Acad Sci U S A ; 99(26): 17020-4, 2002 Dec 24.
Article in English | MEDLINE | ID: mdl-12471157

ABSTRACT

We present the complete genome sequence of uropathogenic Escherichia coli, strain CFT073. A three-way genome comparison of the CFT073, enterohemorrhagic E. coli EDL933, and laboratory strain MG1655 reveals that, amazingly, only 39.2% of their combined (nonredundant) set of proteins actually are common to all three strains. The pathogen genomes are as different from each other as each pathogen is from the benign strain. The difference in disease potential between O157:H7 and CFT073 is reflected in the absence of genes for type III secretion system or phage- and plasmid-encoded toxins found in some classes of diarrheagenic E. coli. The CFT073 genome is particularly rich in genes that encode potential fimbrial adhesins, autotransporters, iron-sequestration systems, and phase-switch recombinases. Striking differences exist between the large pathogenicity islands of CFT073 and two other well-studied uropathogenic E. coli strains, J96 and 536. Comparisons indicate that extraintestinal pathogenic E. coli arose independently from multiple clonal lineages. The different E. coli pathotypes have maintained a remarkable synteny of common, vertically evolved genes, whereas many islands interrupting this common backbone have been acquired by different horizontal transfer events in each strain.


Subject(s)
Escherichia coli/genetics , Genome, Bacterial , Pyelonephritis/microbiology , Acute Disease , Base Sequence , Escherichia coli/pathogenicity , Female , Genetic Structures , Humans , Molecular Sequence Data , Open Reading Frames
5.
Mol Ecol ; 11(10): 2027-36, 2002 Oct.
Article in English | MEDLINE | ID: mdl-12296946

ABSTRACT

Polymerase chain reaction-restriction fragment length polymorphisms (PCR-RFLPs) and sequence analysis of noncoding regions of chloroplast DNA were used to investigate 37 populations of Eritrichium nanum covering its total distribution area, the European Alps. There was no haplotypic variation within the populations, and most haplotypes were restricted to single sites or to neighbouring populations, suggesting low levels of long distance gene flow via seeds. The present geographical distribution of haplotypes probably reflects an ancient geographical pattern within two regions in the intensely glaciated western and eastern central Alps identified as genetic hotspot areas. These two regions contained seven of the total of 11 haplotypes, including many of the most derived ones. The divergent haplotypes formed closely related groups, which supported a separate evolution of these haplotypes in these two regions and, more importantly, gave strong evidence for the in situ survival of these populations on nunataks within the western and eastern central Alps during Pleistocene glaciation. This result is in concordance with a previous study on E. nanum using nuclear markers. Only one haplotype was common and widespread throughout the distributional range of E. nanum. At the same time, it was the evolutionarily basal-most and all other haplotypes were best described as its descendants. This haplotype is hypothesized to be genetically identical to a Tertiary Alpine colonizing ancestor, whose distribution was secondarily fragmented and infiltrated by derived haplotypes originating through local mutations.


Subject(s)
Altitude , DNA, Chloroplast/genetics , Genetics, Population , Ice , Magnoliopsida/genetics , Europe , Haplotypes , Magnoliopsida/growth & development , Molecular Sequence Data , Polymerase Chain Reaction , Polymorphism, Restriction Fragment Length , Sequence Analysis, DNA
6.
Genome Res ; 11(9): 1584-93, 2001 Sep.
Article in English | MEDLINE | ID: mdl-11544203

ABSTRACT

We have constructed NheI and XhoI optical maps of Escherichia coli O157:H7 solely from genomic DNA molecules to provide a uniquely valuable scaffold for contig closure and sequence validation. E. coli O157:H7 is a common pathogen found in contaminated food and water. Our approach obviated the need for the analysis of clones, PCR products, and hybridizations, because maps were constructed from ensembles of single DNA molecules. Shotgun sequencing of bacterial genomes remains labor-intensive, despite advances in sequencing technology. This is partly due to manual intervention required during the last stages of finishing. The applicability of optical mapping to this problem was enhanced by advances in machine vision techniques that improved mapping throughput and created a path to full automation of mapping. Comparisons were made between maps and sequence data that characterized sequence gaps and guided nascent assemblies.


Subject(s)
Contig Mapping/methods , Escherichia coli O157/genetics , Genome, Bacterial , Restriction Mapping/methods , Sequence Analysis, DNA/methods , Software
7.
Mol Phylogenet Evol ; 19(3): 331-44, 2001 Jun.
Article in English | MEDLINE | ID: mdl-11399144

ABSTRACT

Many species of the paleotropical pioneer tree genus Macaranga Thou. (Euphorbiaceae) live in association with ants. Various types of mutualistic interactions exist, ranging from the attraction of unspecific ant visitors to obligate myrmecophytism. In the latter, nesting space and food bodies are exchanged for protection by highly specific ant partners (mainly species of the myrmicine genus Crematogaster). As a first step toward elucidating the coevolution of ant-plant interactions in the Macaranga-Crematogaster system, we have initiated a molecular investigation of the plant partners' phylogeny. Nuclear ribosomal DNA internal transcribed spacer (ITS) sequences were analyzed for 73 accessions from 47 Macaranga species, representing 17 sections or informally described species groups. Three accessions from the putative sister taxon Mallotus Lour, were included as outgroups. Cladograms of the ITS data revealed Macaranga to be nested within Mallotus. ITS sequences are highly similar within section Pachystemon s.str., suggesting a relatively recent and rapid radiation of obligate myrmecophytes within this section. Forty-three accessions, mainly of ant-inhabited species, were additionally investigated by random amplified polymorphic DNA (RAPD) and microsatellite-primed PCR (MP-PCR) techniques. Phenetic analysis of RAPD and MP-PCR banding profiles generally confirmed the ITS results. Best resolutions for individual clades were obtained when ITS and RAPD/MP-PCR data were combined into a single matrix and analyzed phenetically. The combined analysis suggests multiple (four) rather than a single evolutionary origin of myrmecophytism, at least one reversal from obligate myrmecophytism to nonmyrmecophytism, and one loss of mutualistic specifity.


Subject(s)
Euphorbiaceae/genetics , Phylogeny , DNA, Plant/chemistry , DNA, Plant/genetics , DNA, Ribosomal Spacer/genetics , Euphorbiaceae/classification , Evolution, Molecular , Molecular Sequence Data , RNA, Ribosomal, 5.8S/genetics , Random Amplified Polymorphic DNA Technique , Sequence Analysis, DNA
8.
J Comput Biol ; 8(1): 37-52, 2001.
Article in English | MEDLINE | ID: mdl-11339905

ABSTRACT

We consider the problem of inferring fold changes in gene expression from cDNA microarray data. Standard procedures focus on the ratio of measured fluorescent intensities at each spot on the microarray, but to do so is to ignore the fact that the variation of such ratios is not constant. Estimates of gene expression changes are derived within a simple hierarchical model that accounts for measurement error and fluctuations in absolute gene expression levels. Significant gene expression changes are identified by deriving the posterior odds of change within a similar model. The methods are tested via simulation and are applied to a panel of Escherichia coli microarrays.


Subject(s)
Gene Expression , Models, Theoretical , Oligonucleotide Array Sequence Analysis/methods , Algorithms , Bayes Theorem , Escherichia coli/genetics , Gene Expression Profiling/methods , Models, Statistical
9.
Infect Immun ; 69(5): 3271-85, 2001 May.
Article in English | MEDLINE | ID: mdl-11292750

ABSTRACT

The complete sequence analysis of the 210-kb Shigella flexneri 5a virulence plasmid was determined. Shigella spp. cause dysentery and diarrhea by invasion and spread through the colonic mucosa. Most of the known Shigella virulence determinants are encoded on a large plasmid that is unique to virulent strains of Shigella and enteroinvasive Escherichia coli; these known genes account for approximately 30 to 35% of the virulence plasmid. In the complete sequence of the virulence plasmid, 286 open reading frames (ORFs) were identified. An astonishing 153 (53%) of these were related to known and putative insertion sequence (IS) elements; no known bacterial plasmid has previously been described with such a high proportion of IS elements. Four new IS elements were identified. Fifty putative proteins show no significant homology to proteins of known function; of these, 18 have a G+C content of less than 40%, typical of known virulence genes on the plasmid. These 18 constitute potentially unknown virulence genes. Two alleles of shet2 and five alleles of ipaH were also identified on the plasmid. Thus, the plasmid sequence suggests a remarkable history of IS-mediated acquisition of DNA across bacterial species. The complete sequence will permit targeted characterization of potential new Shigella virulence determinants.


Subject(s)
DNA, Bacterial/chemistry , Plasmids , Shigella flexneri/genetics , Shigella flexneri/pathogenicity , Amino Acid Sequence , Base Sequence , Biological Evolution , DNA Transposable Elements , Humans , Molecular Sequence Data , Open Reading Frames , Replicon , Virulence
10.
Appl Environ Microbiol ; 67(4): 1911-21, 2001 Apr.
Article in English | MEDLINE | ID: mdl-11282649

ABSTRACT

In an effort to efficiently discover genes in the diazotrophic endophyte of maize, Klebsiella pneumoniae 342, DNA from strain 342 was hybridized to a microarray containing 96% (n = 4,098) of the annotated open reading frames from Escherichia coli K-12. Using a criterion of 55% identity or greater, 3,000 (70%) of the E. coli K-12 open reading frames were also found to be present in strain 342. Approximately 24% (n = 1,030) of the E. coli K-12 open reading frames are absent in strain 342. For 1.6% (n = 68) of the open reading frames, the signal was too low to make a determination regarding the presence or absence of the gene. Genes with high identity between the two organisms are those involved in energy metabolism, amino acid metabolism, fatty acid metabolism, cofactor synthesis, cell division, DNA replication, transcription, translation, transport, and regulatory proteins. Functions that were less highly conserved included carbon compound metabolism, membrane proteins, structural proteins, putative transport proteins, cell processes such as adaptation and protection, and central intermediary metabolism. Open reading frames of E. coli K-12 with little or no identity in strain 342 included putative regulatory proteins, putative chaperones, surface structure proteins, mobility proteins, putative enzymes, hypothetical proteins, and proteins of unknown function, as well as genes presumed to have been acquired by lateral transfer from sources such as phage, plasmids, or transposons. The results were in agreement with the physiological properties of the two strains. Whole genome comparisons by genomic interspecies microarray hybridization are shown to rapidly identify thousands of genes in a previously uncharacterized bacterial genome provided that the genome of a close relative has been fully sequenced. This approach will become increasingly more useful as more full genome sequences become available.


Subject(s)
Escherichia coli/genetics , Klebsiella pneumoniae/genetics , Oligonucleotide Array Sequence Analysis , Open Reading Frames , Zea mays/microbiology , Escherichia coli/metabolism , Gene Expression Profiling , Genes, Bacterial/genetics , Genome, Bacterial , Klebsiella pneumoniae/metabolism
11.
Nature ; 409(6819): 529-33, 2001 Jan 25.
Article in English | MEDLINE | ID: mdl-11206551

ABSTRACT

The bacterium Escherichia coli O157:H7 is a worldwide threat to public health and has been implicated in many outbreaks of haemorrhagic colitis, some of which included fatalities caused by haemolytic uraemic syndrome. Close to 75,000 cases of O157:H7 infection are now estimated to occur annually in the United States. The severity of disease, the lack of effective treatment and the potential for large-scale outbreaks from contaminated food supplies have propelled intensive research on the pathogenesis and detection of E. coli O157:H7 (ref. 4). Here we have sequenced the genome of E. coli O157:H7 to identify candidate genes responsible for pathogenesis, to develop better methods of strain detection and to advance our understanding of the evolution of E. coli, through comparison with the genome of the non-pathogenic laboratory strain E. coli K-12 (ref. 5). We find that lateral gene transfer is far more extensive than previously anticipated. In fact, 1,387 new genes encoded in strain-specific clusters of diverse sizes were found in O157:H7. These include candidate virulence factors, alternative metabolic capacities, several prophages and other new functions--all of which could be targets for surveillance.


Subject(s)
Escherichia coli O157/genetics , Genome, Bacterial , Base Sequence , Chromosome Mapping , Chromosomes, Bacterial , Escherichia coli Infections/microbiology , Escherichia coli O157/pathogenicity , Genetic Variation , Humans , Molecular Sequence Data , Polymorphism, Genetic , Sequence Analysis, DNA , Species Specificity , Virulence/genetics
12.
J Bacteriol ; 183(2): 545-56, 2001 Jan.
Article in English | MEDLINE | ID: mdl-11133948

ABSTRACT

A nearly complete collection of 4,290 Escherichia coli open reading frames was amplified and arrayed in high density on glass slides. To exploit this reagent, conditions for RNA isolation from E. coli cells, cDNA production with attendant fluorescent dye incorporation, DNA-DNA hybridization, and hybrid quantitation have been established. A brief isopropyl-beta-D-thiogalactopyranoside (IPTG) treatment elevated lacZ, lacY, and lacA transcript content about 30-fold; in contrast, most other transcript titers remained unchanged. Distinct RNA expression patterns between E. coli cultures in the exponential and transitional phases of growth were catalogued, as were differences associated with culturing in minimal and rich media. The relative abundance of each transcript was estimated by using hybridization of a genomic DNA-derived, fluorescently labeled probe as a correction factor. This inventory provided a quantitative view of the steady-state level of each mRNA species. Genes the expression of which was detected by this method were enumerated, and results were compared with the current understanding of E. coli physiology.


Subject(s)
Escherichia coli/genetics , Gene Expression Profiling/methods , Oligonucleotide Array Sequence Analysis , Amino Acids/biosynthesis , Culture Media , Gene Expression Regulation, Bacterial , Genes, Bacterial , Isopropyl Thiogalactoside , Operon , RNA, Bacterial/analysis , RNA, Messenger/analysis , Ribosomal Proteins/genetics
13.
Nat Biotechnol ; 18(12): 1262-8, 2000 Dec.
Article in English | MEDLINE | ID: mdl-11101804

ABSTRACT

We have developed a high-resolution "genome array" for the study of gene expression and regulation in Escherichia coli. This array contains on average one 25-mer oligonucleotide probe per 30 base pairs over the entire genome, with one every 6 bases for the intergenic regions and every 60 bases for the 4,290 open reading frames (ORFs). Twofold concentration differences can be detected at levels as low as 0.2 messenger RNA (mRNA) copies per cell, and differences can be seen over a dynamic range of three orders of magnitude. In rich medium we detected transcripts for 97% and 87% of the ORFs in stationary and log phases, respectively. We found that 1, 529 transcripts were differentially expressed under these conditions. As expected, genes involved in translation were expressed at higher levels in log phase, whereas many genes known to be involved in the starvation response were expressed at higher levels in stationary phase. Many previously unrecognized growth phase-regulated genes were identified, such as a putative receptor (b0836) and a 30S ribosomal protein subunit (S22), both of which are highly upregulated in stationary phase. Transcription of between 3,000 and 4,000 predicted ORFs was observed from the antisense strand, indicating that most of the genome is transcribed at a detectable level. Examples are also presented for high-resolution array analysis of transcript start and stop sites and RNA secondary structure.


Subject(s)
Escherichia coli/genetics , Gene Expression Profiling/methods , Genome, Bacterial , Oligonucleotide Array Sequence Analysis/methods , RNA, Messenger/metabolism , Escherichia coli/growth & development , Escherichia coli/metabolism , Gene Expression Regulation, Bacterial , Open Reading Frames/genetics , RNA, Antisense/metabolism , RNA, Bacterial/genetics , RNA, Bacterial/metabolism , RNA, Messenger/genetics , Transcription, Genetic
14.
Mol Phylogenet Evol ; 17(2): 209-18, 2000 Nov.
Article in English | MEDLINE | ID: mdl-11083935

ABSTRACT

Phylogenetic relationships between Allium and the monotypic Himalayan genus Milula were analyzed using sequences of the nuclear ribosomal DNA internal transcribed spacer (ITS) region and of the intergenic spacers from the chloroplast trnD(GUC)-trnT(GGU) region. Both marker systems unambiguously placed Milula spicata within Allium subgenus Rhizirideum, close to A. cyathophorum. Morphologically, the main difference between Allium and Milula is the conspicuous spicate inflorescence of Milula vs the mostly capitate or umbellate inflorescences in Allium. Anatomical investigations of leaf characters support a close relationship of Milula with A. cyathophorum and A. mairei, whereas root characters are distinctive from other species of section Cyathophora. To maintain Allium as monophyletic, Milula has been included as A. spicatum in Allium subgenus Rhizirideum.


Subject(s)
Allium/genetics , Evolution, Molecular , Liliaceae/genetics , Allium/classification , DNA, Chloroplast/chemistry , DNA, Chloroplast/genetics , DNA, Plant/chemistry , DNA, Plant/genetics , DNA, Ribosomal Spacer/genetics , Liliaceae/anatomy & histology , Liliaceae/classification , Molecular Sequence Data , Phylogeny , Plant Leaves/anatomy & histology , Plant Roots/anatomy & histology , RNA, Ribosomal, 5.8S/genetics , RNA, Transfer/genetics , Sequence Analysis, DNA
15.
Appl Environ Microbiol ; 66(8): 3310-29, 2000 Aug.
Article in English | MEDLINE | ID: mdl-10919786

ABSTRACT

Photorhabdus luminescens is a pathogenic bacterium that lives in the guts of insect-pathogenic nematodes. After invasion of an insect host by a nematode, bacteria are released from the nematode gut and help kill the insect, in which both the bacteria and the nematodes subsequently replicate. However, the bacterial virulence factors associated with this "symbiosis of pathogens" remain largely obscure. In order to identify genes encoding potential virulence factors, we performed approximately 2,000 random sequencing reads from a P. luminescens W14 genomic library. We then compared the sequences obtained to sequences in existing gene databases and to the Escherichia coli K-12 genome sequence. Here we describe the different classes of potential virulence factors found. These factors include genes that putatively encode Tc insecticidal toxin complexes, Rtx-like toxins, proteases and lipases, colicin and pyocins, and various antibiotics. They also include a diverse array of secretion (e.g., type III), iron uptake, and lipopolysaccharide production systems. We speculate on the potential functions of each of these gene classes in insect infection and also examine the extent to which the invertebrate pathogen P. luminescens shares potential antivertebrate virulence factors. The implications for understanding both the biology of this insect pathogen and links between the evolution of vertebrate virulence factors and the evolution of invertebrate virulence factors are discussed.


Subject(s)
Genome, Bacterial , Insecta/microbiology , Nematoda/microbiology , Photorhabdus/genetics , Photorhabdus/pathogenicity , Animals , Bacterial Adhesion/genetics , Bacterial Proteins/genetics , Bacterial Proteins/metabolism , Bacterial Toxins/genetics , Bacterial Toxins/metabolism , Escherichia coli/genetics , Escherichia coli/metabolism , Escherichia coli/pathogenicity , Extrachromosomal Inheritance , Genomic Library , Insecta/parasitology , Iron/metabolism , Molecular Sequence Data , Polysaccharides, Bacterial/genetics , Polysaccharides, Bacterial/metabolism , Sequence Analysis, DNA , Symbiosis , Virulence/genetics
16.
Nucleic Acids Res ; 28(10): 2177-86, 2000 May 15.
Article in English | MEDLINE | ID: mdl-10773089

ABSTRACT

Salmonella typhi, the causative agent of typhoid fever, annually infects 16 million people and kills 600 000 world wide. Plasmid-encoded multiple drug resistance in S. typhi is always encoded by plasmids of incompatibility group H (IncH). The complete DNA sequence of the large temperature-sensitive conjugative plasmid R27, the prototype for the IncHI1 family of plasmids, has been compiled and analyzed. This 180 kb plasmid contains 210 open reading frames (ORFs), of which 14 have been previously identified and 56 exhibit similarity to other plasmid and prokaryotic ORFs. A number of insertion elements were found, including the full Tn 10 transposon, which carries tetracycline resistance genes. Two transfer regions, Tra1 and Tra2, are present, which are separated by a minimum of 64 kb. Homologs of the DNA-binding proteins TlpA and H-NS that act as temperature-regulated repressors in other systems have been located in R27. Sequence analysis of transfer and replication regions supports a mosaic-like structure for R27. The genes responsible for conjugation and plasmid maintenance have been identified and mechanisms responsible for thermosensitive transfer are discussed.


Subject(s)
Drug Resistance, Multiple/genetics , R Factors/chemistry , Salmonella typhi/genetics , Amino Acid Sequence , Base Sequence , Conjugation, Genetic , DNA Nucleotidyltransferases/chemistry , DNA Nucleotidyltransferases/genetics , Deoxyribonuclease I/chemistry , Deoxyribonuclease I/genetics , Molecular Sequence Data , Open Reading Frames , Sequence Alignment , Sequence Homology, Amino Acid , Temperature
17.
Nucleic Acids Res ; 28(1): 65-7, 2000 Jan 01.
Article in English | MEDLINE | ID: mdl-10592182

ABSTRACT

RegulonDB is a database on transcription regulation and operon organization in Escherichia coli. The current version describes regulatory signals of transcription initiation, promoters, regulatory binding sites of specific regulators, ribosome binding sites and terminators, as well as information on genes clustered in operons. These specific annotations have been gathered from a constant search in the literature, as well as based on computational sequence predictions. The genomic coordinates of all these objects in the E.coli K-12 chromosome are clearly indicated. Every known object has a link to at least one MEDLINE reference. We have also added direct links to recent expression data of E.coli K-12. The version presented here has important modifications both in the structure of the database, as well as in the amount and type of information encoded in the database. RegulonDB can be accessed on the web at URL: http://www.cifn.unam. mx/Computational_Biology/regulondb/


Subject(s)
Databases, Factual , Escherichia coli/genetics , Gene Expression Regulation, Bacterial , Operon , Regulon , Transcription, Genetic , Internet
18.
J Bacteriol ; 181(20): 6425-40, 1999 Oct.
Article in English | MEDLINE | ID: mdl-10515934

ABSTRACT

DNA arrays of the entire set of Escherichia coli genes were used to measure the genomic expression patterns of cells growing in late logarithmic phase on minimal glucose medium and on Luria broth containing glucose. Ratios of the transcript levels for all 4,290 E. coli protein-encoding genes (cds) were obtained, and analysis of the expression ratio data indicated that the physiological state of the cells under the two growth conditions could be ascertained. The cells in the rich medium grew faster, and expression of the majority of the translation apparatus genes was significantly elevated under this growth condition, consistent with known patterns of growth rate-dependent regulation and increased rate of protein synthesis in rapidly growing cells. The cells grown on minimal medium showed significantly elevated expression of many genes involved in biosynthesis of building blocks, most notably the amino acid biosynthetic pathways. Nearly half of the known RpoS-dependent genes were expressed at significantly higher levels in minimal medium than in rich medium, and rpoS expression was similarly elevated. The role of RpoS regulation in these logarithmic phase cells was suggested by the functions of the RpoS dependent genes that were induced. The hallmark features of E. coli cells growing on glucose minimal medium appeared to be the formation and excretion of acetate, metabolism of the acetate, and protection of the cells from acid stress. A hypothesis invoking RpoS and UspA (universal stress protein, also significantly elevated in minimal glucose medium) as playing a role in coordinating these various aspects and consequences of glucose and acetate metabolism was generated. This experiment demonstrates that genomic expression assays can be applied in a meaningful way to the study of whole-bacterial-cell physiology for the generation of hypotheses and as a guide for more detailed studies of particular genes of interest.


Subject(s)
Escherichia coli/genetics , Gene Expression Profiling , Genome, Bacterial , Oligonucleotide Array Sequence Analysis , Amino Acids/biosynthesis , Amino Acyl-tRNA Synthetases/biosynthesis , Bacterial Physiological Phenomena , Bacterial Proteins , Culture Media , Energy Metabolism , Fatty Acids/metabolism , Gene Expression Regulation, Bacterial , Gluconates , Glucose , Nucleotides/biosynthesis , Protein Biosynthesis , Ribosomal Proteins/biosynthesis , Sigma Factor , Vitamins/biosynthesis
19.
Nucleic Acids Res ; 27(22): 4409-15, 1999 Nov 15.
Article in English | MEDLINE | ID: mdl-10536150

ABSTRACT

A simple and efficient gene replacement method, based on the recombination and repair activities of the cell, was developed. The method permits the targeted construction of markerless deletions, insertions and point mutations in the Escherichia coli chromosome. A suicide plasmid, carrying the mutant allele and the recognition site of meganuclease I- Sce I, is inserted into the genome by homologous recombination between the mutant and the wild-type (wt) alleles. Resolution of this cointegrate by intramolecular recombination of the allele pair results in either a mutant or a wt chromosome which can be distinguished by allele-specific PCR screening. The resolution process is stimulated by introducing a unique double-strand break (DSB) into the chromosome at the I- Sce I site. Cleavage by the nuclease not only enhances the frequency of resolution by two to three orders of magnitude, but also selects for the resolved products. The DSB-stimulated gene replacement method can be used in recombination-proficient E.coli cells, does not require specific growth conditions, and is potentially applicable in other microorganisms. Use of the method was demonstrated by constructing a 17-bp and a 62-kb deletion in the MG1655 chromosome. Cleavage of the chromosome induces the SOS response but does not lead to an increased mutation rate.


Subject(s)
DNA, Bacterial/genetics , Escherichia coli/genetics , Gene Transfer Techniques , Chromosome Deletion , Chromosomes, Bacterial , DNA/metabolism , DNA, Bacterial/metabolism , Deoxyribonucleases, Type II Site-Specific/biosynthesis , Deoxyribonucleases, Type II Site-Specific/genetics , Electroporation , Escherichia coli/metabolism , Molecular Sequence Data , Mutation , Recombination, Genetic , Saccharomyces cerevisiae Proteins
20.
Bioinformatics ; 15(9): 723-8, 1999 Sep.
Article in English | MEDLINE | ID: mdl-10498772

ABSTRACT

MOTIVATION: Given inputs extracted from an aligned column of DNA bases and the underlying Perkin Elmer Applied Biosystems (ABI) fluorescent traces, our goal is to train a neural network to determine correctly the consensus base for the column. Choosing an appropriate network input representation is critical to success in this task. We empirically compare five representations; one uses only base calls and the others include trace information. RESULTS: We attained the most accurate results from networks that incorporate trace information into their input representations. Based on estimates derived from using 10-fold cross-validation, the best network topology produces consensus accuracies ranging from 99.26% to >99.98% for coverages from two to six aligned sequences. With a coverage of six, it makes only three errors in 20 000 consensus calls. In contrast, the network that only uses base calls in its input representation has over double that error rate: eight errors in 20 000 consensus calls. CONTACT: allex@cs.wisc.edu


Subject(s)
Consensus Sequence , Neural Networks, Computer , Sequence Analysis, DNA , Base Sequence , DNA Fragmentation , DNA, Bacterial/analysis , Escherichia coli/genetics , Molecular Sequence Data , Sequence Analysis, DNA/methods
SELECTION OF CITATIONS
SEARCH DETAIL
...