Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 29
Filter
Add more filters










Publication year range
1.
Nucleic Acids Res ; 45(5): 2776-2785, 2017 03 17.
Article in English | MEDLINE | ID: mdl-28076288

ABSTRACT

We report the identification of novel tRNA species with 12-base pair amino-acid acceptor branches composed of longer acceptor stem and shorter T-stem. While canonical tRNAs have a 7/5 configuration of the branch, the novel tRNAs have either 8/4 or 9/3 structure. They were found during the search for selenocysteine tRNAs in terabytes of genome, metagenome and metatranscriptome sequences. Certain bacteria and their phages employ the 8/4 structure for serine and histidine tRNAs, while minor cysteine and selenocysteine tRNA species may have a modified 8/4 structure with one bulge nucleotide. In Acidobacteria, tRNAs with 8/4 and 9/3 structures may function as missense and nonsense suppressor tRNAs and/or regulatory noncoding RNAs. In δ-proteobacteria, an additional cysteine tRNA with an 8/4 structure mimics selenocysteine tRNA and may function as opal suppressor. We examined the potential translation function of suppressor tRNA species in Escherichia coli; tRNAs with 8/4 or 9/3 structures efficiently inserted serine, alanine and cysteine in response to stop and sense codons, depending on the identity element and anticodon sequence of the tRNA. These findings expand our view of how tRNA, and possibly the genetic code, is diversified in nature.


Subject(s)
RNA, Bacterial/chemistry , RNA, Transfer/chemistry , Anticodon , Bacteria/genetics , Bacterial Toxins/genetics , Nucleic Acid Conformation , Protein Biosynthesis , RNA, Transfer, Amino Acid-Specific/chemistry , RNA, Transfer, Cys/chemistry , RNA, Transfer, Cys/metabolism
4.
Angew Chem Int Ed Engl ; 55(17): 5337-41, 2016 Apr 18.
Article in English | MEDLINE | ID: mdl-26991476

ABSTRACT

Selenocysteine (Sec or U) is encoded by UGA, a stop codon reassigned by a Sec-specific elongation factor and a distinctive RNA structure. To discover possible code variations in extant organisms we analyzed 6.4 trillion base pairs of metagenomic sequences and 24 903 microbial genomes for tRNA(Sec) species. As expected, UGA is the predominant Sec codon in use. We also found tRNA(Sec) species that recognize the stop codons UAG and UAA, and ten sense codons. Selenoprotein synthesis programmed by UAG in Geodermatophilus and Blastococcus, and by the Cys codon UGU in Aeromonas salmonicida was confirmed by metabolic labeling with (75) Se or mass spectrometry. Other tRNA(Sec) species with different anticodons enabled E. coli to synthesize active formate dehydrogenase H, a selenoenzyme. This illustrates the ease by which the genetic code may evolve new coding schemes, possibly aiding organisms to adapt to changing environments, and show the genetic code is much more flexible than previously thought.


Subject(s)
Bacteria/genetics , Codon, Terminator/genetics , Genome, Bacterial , Selenocysteine/genetics , Base Sequence , Evolution, Molecular , Genetic Code , Metagenome
5.
Stand Genomic Sci ; 11: 17, 2016.
Article in English | MEDLINE | ID: mdl-26918089

ABSTRACT

The DOE-JGI Metagenome Annotation Pipeline (MAP v.4) performs structural and functional annotation for metagenomic sequences that are submitted to the Integrated Microbial Genomes with Microbiomes (IMG/M) system for comparative analysis. The pipeline runs on nucleotide sequences provided via the IMG submission site. Users must first define their analysis projects in GOLD and then submit the associated sequence datasets consisting of scaffolds/contigs with optional coverage information and/or unassembled reads in fasta and fastq file formats. The MAP processing consists of feature prediction including identification of protein-coding genes, non-coding RNAs and regulatory RNAs, as well as CRISPR elements. Structural annotation is followed by functional annotation including assignment of protein product names and connection to various protein family databases.

6.
Environ Microbiol ; 18(4): 1122-36, 2016 Apr.
Article in English | MEDLINE | ID: mdl-26487573

ABSTRACT

Hydrothermal vents represent a deep, hot, aphotic biosphere where chemosynthetic primary producers, fuelled by chemicals from Earth's subsurface, form the basis of life. In this study, we examined microbial mats from two distinct volcanic sites within the Hellenic Volcanic Arc (HVA). The HVA is geologically and ecologically unique, with reported emissions of CO2 -saturated fluids at temperatures up to 220°C and a notable absence of macrofauna. Metagenomic data reveals highly complex prokaryotic communities composed of chemolithoautotrophs, some methanotrophs, and to our surprise, heterotrophs capable of anaerobic degradation of aromatic hydrocarbons. Our data suggest that aromatic hydrocarbons may indeed be a significant source of carbon in these sites, and instigate additional research into the nature and origin of these compounds in the HVA. Novel physiology was assigned to several uncultured prokaryotic lineages; most notably, a SAR406 representative is attributed with a role in anaerobic hydrocarbon degradation. This dataset, the largest to date from submarine volcanic ecosystems, constitutes a significant resource of novel genes and pathways with potential biotechnological applications.


Subject(s)
Archaea/classification , Archaea/genetics , Bacteria/classification , Bacteria/genetics , Ecosystem , Hydrothermal Vents/microbiology , Archaea/isolation & purification , Bacteria/isolation & purification , Base Sequence , Geology , Metagenomics , RNA, Ribosomal, 16S/genetics , Temperature
7.
Stand Genomic Sci ; 10: 86, 2015.
Article in English | MEDLINE | ID: mdl-26512311

ABSTRACT

The DOE-JGI Microbial Genome Annotation Pipeline performs structural and functional annotation of microbial genomes that are further included into the Integrated Microbial Genome comparative analysis system. MGAP is applied to assembled nucleotide sequence datasets that are provided via the IMG submission site. Dataset submission for annotation first requires project and associated metadata description in GOLD. The MGAP sequence data processing consists of feature prediction including identification of protein-coding genes, non-coding RNAs and regulatory RNA features, as well as CRISPR elements. Structural annotation is followed by assignment of protein product names and functions.

8.
Stand Genomic Sci ; 10: 45, 2015.
Article in English | MEDLINE | ID: mdl-26380633

ABSTRACT

BACKGROUND: In an effort to identify the best practice for finding genes in prokaryotic genomes and propose it as a standard for automated annotation pipelines, 1,004,576 peptides were collected from various publicly available resources, and were used as a basis to evaluate various gene-calling methods. The peptides came from 45 bacterial replicons with an average GC content from 31 % to 74 %, biased toward higher GC content genomes. Automated, manual, and semi-manual methods were used to tally errors in three widely used gene calling methods, as evidenced by peptides mapped outside the boundaries of called genes. RESULTS: We found that the consensus set of identical genes predicted by the three methods constitutes only about 70 % of the genes predicted by each individual method (with start and stop required to coincide). Peptide data was useful for evaluating some of the differences between gene callers, but not reliable enough to make the results conclusive, due to limitations inherent in any proteogenomic study. CONCLUSIONS: A single, unambiguous, unanimous best practice did not emerge from this analysis, since the available proteomics data were not adequate to provide an objective measurement of differences in the accuracy between these methods. However, as a result of this study, software, reference data, and procedures have been better matched among participants, representing a step toward a much-needed standard. In the absence of sufficient amount of exprimental data to achieve a universal standard, our recommendation is that any of these methods can be used by the community, as long as a single method is employed across all datasets to be compared.

9.
PLoS One ; 9(10): e109738, 2014.
Article in English | MEDLINE | ID: mdl-25340743

ABSTRACT

Synechococcus sp. PCC 7942 and Synechocystis sp. PCC 6803 are model cyanobacteria from which the metabolism and adaptive responses of other cyanobacteria are inferred. Using stranded and 5' enriched libraries, we measured the gene expression response of cells transferred from reference conditions to stress conditions of decreased inorganic carbon, increased salinity, increased pH, and decreased illumination at 1-h and 24-h after transfer. We found that the specific responses of the two strains were by no means identical. Transcriptome profiles allowed us to improve the structural annotation of the genome i.e. identify possible missed genes (including anti-sense), alter gene coordinates and determine transcriptional units (operons). Finally, we predicted associations between proteins of unknown function and biochemical pathways by revealing proteins of known functions that are co-regulated with the unknowns. Future studies of these model organisms will benefit from the cataloging of their responses to environmentally relevant stresses, and improvements in their genome annotations found here.


Subject(s)
Acclimatization/genetics , Stress, Physiological/genetics , Synechococcus/genetics , Synechocystis/genetics , Transcriptome , Aerobiosis , Cluster Analysis , Down-Regulation/genetics , Gene Expression Regulation, Bacterial , Genome, Bacterial , Molecular Sequence Annotation , Operon/genetics , Photosynthesis/genetics , RNA, Messenger/genetics , RNA, Messenger/metabolism , Synechococcus/growth & development , Synechococcus/physiology , Synechocystis/growth & development , Synechocystis/physiology , Up-Regulation/genetics
10.
Environ Microbiol ; 16(10): 3211-23, 2014 Oct.
Article in English | MEDLINE | ID: mdl-25314671

ABSTRACT

A dataset of 87 020 nifH reads and 16 782 unique nifH protein sequences obtained over 2 years from four locations across a gradient of agricultural soil types in Argentina were analysed to provide a detailed and comprehensive picture of the diversity, abundance and responses of the N2 -fixing community in relation to differences in soil chemistry and agricultural practices. Phylogenetic analysis revealed an expected high proportion of Alphaproteobacteria, Betaproteobacteria and Deltaproteobacteria, mainly relatives to Bradyrhizobium and Methylosinus/Methylocystis, but a surprising paucity of Gammaproteobacteria. Analysis of variance and stepwise regression modelling suggested location and treatment-specific influences of soil type on diazotrophic community composition and organic carbon concentrations on nifH diversity. nifH gene abundance, determined by quantitative real-time polymerase chain reaction, was higher in agricultural soils than in non-agricultural soils, and was influenced by soil chemistry under intensive crop rotation but not under monoculture. At some locations, sustainable increased crop yields might be possible through the management of soil chemistry to improve the abundance and diversity of N2 -fixing bacteria.


Subject(s)
Nitrogen Fixation , Oxidoreductases/genetics , Proteobacteria/metabolism , Soil Microbiology , Soil/chemistry , Agriculture , Argentina , Bacteria/classification , Bacteria/genetics , Bacteria/isolation & purification , Nitrogen Fixation/genetics , Phylogeny , Proteobacteria/classification , Proteobacteria/genetics , Proteobacteria/isolation & purification , Sequence Analysis, DNA
11.
Bioinformatics ; 30(20): 2883-90, 2014 Oct 15.
Article in English | MEDLINE | ID: mdl-24990605

ABSTRACT

MOTIVATION: Studies of the biochemical functions and activities of uncultivated microorganisms in the environment require analysis of DNA sequences for phylogenetic characterization and for the development of sequence-based assays for the detection of microorganisms. The numbers of sequences for genes that are indicators of environmentally important functions such as nitrogen (N2) fixation have been rapidly growing over the past few decades. Obtaining these sequences from the National Center for Biotechnology Information's GenBank database is problematic because of annotation errors, nomenclature variation and paralogues; moreover, GenBank's structure and tools are not conducive to searching solely by function. For some genes, such as the nifH gene commonly used to assess community potential for N2 fixation, manual collection and curation are becoming intractable because of the large number of sequences in GenBank and the large number of highly similar paralogues. If analysis is to keep pace with sequence discovery, an automated retrieval and curation system is necessary. RESULTS: ARBitrator uses a two-step process composed of a broad collection of potential homologues followed by screening with a best hit strategy to conserved domains. 34 420 nifH sequences were identified in GenBank as of November 20, 2012. The false-positive rate is ∼0.033%. ARBitrator rapidly updates a public nifH sequence database, and we show that it can be adapted for other genes. AVAILABILITY AND IMPLEMENTATION: Java source and executable code are freely available to non-commercial users at http://pmc.ucsc.edu/∼wwwzehr/research/database/. CONTACT: zehrj@ucsc.edu SUPPLEMENTARY INFORMATION: SUPPLEMENTARY INFORMATION is available at Bioinformatics online.


Subject(s)
Computational Biology/methods , Databases, Nucleic Acid , Information Storage and Retrieval/methods , Oxidoreductases/genetics , Software , False Positive Reactions , Phylogeny , Sequence Homology, Nucleic Acid
12.
Science ; 344(6186): 909-13, 2014 May 23.
Article in English | MEDLINE | ID: mdl-24855270

ABSTRACT

The canonical genetic code is assumed to be deeply conserved across all domains of life with very few exceptions. By scanning 5.6 trillion base pairs of metagenomic data for stop codon reassignment events, we detected recoding in a substantial fraction of the >1700 environmental samples examined. We observed extensive opal and amber stop codon reassignments in bacteriophages and of opal in bacteria. Our data indicate that bacteriophages can infect hosts with a different genetic code and demonstrate phage-host antagonism based on code differences. The abundance and diversity of genetic codes present in environmental organisms should be considered in the design of engineered organisms with altered genetic codes in order to preclude the exchange of genetic information with naturally occurring species.


Subject(s)
Bacteria/genetics , Bacteria/virology , Bacteriophages/genetics , Bacteriophages/physiology , Codon, Terminator/genetics , Genetic Variation , Genome, Bacterial , Consensus Sequence , Humans , Likelihood Functions , Phylogeny , Protein Biosynthesis/genetics
13.
Nat Commun ; 4: 1767, 2013.
Article in English | MEDLINE | ID: mdl-23612308

ABSTRACT

Diatoms with symbiotic N2-fixing cyanobacteria are often abundant in the oligotrophic open ocean gyres. The most abundant cyanobacterial symbionts form heterocysts (specialized cells for N2 fixation) and provide nitrogen (N) to their hosts, but their morphology, cellular locations and abundances differ depending on the host. Here we show that the location of the symbiont and its dependency on the host are linked to the evolution of the symbiont genome. The genome of Richelia (found inside the siliceous frustule of Hemiaulus) is reduced and lacks ammonium transporters, nitrate/nitrite reductases and glutamine:2-oxoglutarate aminotransferase. In contrast, the genome of the closely related Calothrix (found outside the frustule of Chaetoceros) is more similar to those of free-living heterocyst-forming cyanobacteria. The genome of Richelia is an example of metabolic streamlining that has implications for the evolution of N2-fixing symbiosis and potentially for manipulating plant-cyanobacterial interactions.


Subject(s)
Cyanobacteria/genetics , Diatoms/microbiology , Genome, Bacterial/genetics , Metabolic Networks and Pathways/genetics , Nitrogen/metabolism , Sequence Deletion/genetics , Symbiosis/genetics , Flow Cytometry , Genes, Bacterial/genetics , Nitrogenase/genetics , Operon/genetics , Phylogeny
14.
J Microbiol ; 51(2): 147-53, 2013 Apr.
Article in English | MEDLINE | ID: mdl-23625213

ABSTRACT

The deeply branching clade of abundant, globally distributed aquatic α-Proteobacteria known as "SAR11", are adapted to nutrient-poor environments such as the surface waters of the open ocean. Unknown prior to 1990, uncultured until 2002, members of the SAR11 clade can now be cultured in artificial, defined media to densities three orders of magnitude higher than in unamended natural media. Cultivation in natural and defined media has confirmed genomic and metagenomic predictions such as an inability to reduce sulfate to sulfide, a requirement for pyruvate, an ability to oxidize a wide variety of methylated and one-carbon compounds for energy, and an unusual form of conditional glycine auxotrophy. Here we describe the metabolism of the SAR11 type strain Candidatus "Pelagibacter ubique" str. HTCC1062, as revealed by genome-assisted studies of laboratory cultures. We also describe the discovery of SAR11 and field studies that have been done on natural populations.


Subject(s)
Alphaproteobacteria/metabolism , Seawater/microbiology , Alphaproteobacteria/classification , Alphaproteobacteria/genetics , Alphaproteobacteria/isolation & purification , Oceans and Seas , Seawater/chemistry
15.
Nucleic Acids Res ; 39(20): 8792-802, 2011 Nov 01.
Article in English | MEDLINE | ID: mdl-21771858

ABSTRACT

In the course of analyzing 9,522,746 pyrosequencing reads from 23 stations in the Southwestern Pacific and equatorial Atlantic oceans, it came to our attention that misannotations of rRNA as proteins is now so widespread that false positive matching of rRNA pyrosequencing reads to the National Center for Biotechnology Information (NCBI) non-redundant protein database approaches 90%. One conserved portion of 23S rRNA was consistently misannotated often enough to prompt curators at Pfam to create a spurious protein family. Detailed examination of the annotation history of each seed sequence in the spurious Pfam protein family (PF10695, 'Cw-hydrolase') uncovered issues in the standard operating procedures and quality assurance programs of major sequencing centers, and other issues relating to the curation practices of those managing public databases such as GenBank and SwissProt. We offer recommendations for all these issues, and recommend as well that workers in the field of metatranscriptomics take extra care to avoid including false positive matches in their datasets.


Subject(s)
Genes, rRNA , Metagenomics , Molecular Sequence Annotation , Transcriptome , Alphaproteobacteria/genetics , Archaea/genetics , Bacteria/genetics , Databases, Nucleic Acid , Hydrolases/genetics , Operon , Pseudogenes , RNA, Antisense/chemistry , Sequence Analysis, DNA
16.
Front Microbiol ; 2: 261, 2011.
Article in English | MEDLINE | ID: mdl-22232617

ABSTRACT

Unicellular nitrogen-fixing cyanobacteria are important components of marine phytoplankton. Although non-nitrogen-fixing marine phytoplankton generally exhibit high gene sequence and genomic diversity, gene sequences of natural populations and isolated strains of Crocosphaerawatsonii, one of the two most abundant open ocean unicellular cyanobacteria groups, have been shown to be 98-100% identical. The low sequence diversity in Crocosphaera is a dramatic contrast to sympatric species of Prochlorococcus and Synechococcus, and raises the question of how genome differences can explain observed phenotypic diversity among Crocosphaera strains. Here we show, through whole genome comparisons of two phenotypically different strains, that there are strain-specific sequences in each genome, and numerous genome rearrangements, despite exceptionally low sequence diversity in shared genomic regions. Some of the strain-specific sequences encode functions that explain observed phenotypic differences, such as exopolysaccharide biosynthesis. The pattern of strain-specific sequences distributed throughout the genomes, along with rearrangements in shared sequences is evidence of significant genetic mobility that may be attributed to the hundreds of transposase genes found in both strains. Furthermore, such genetic mobility appears to be the main mechanism of strain divergence in Crocosphaera which do not accumulate DNA microheterogeneity over the vast majority of their genomes. The strain-specific sequences found in this study provide tools for future physiological studies, as well as genetic markers to help determine the relative abundance of phenotypes in natural populations.

17.
Arch Microbiol ; 192(10): 783-90, 2010 Oct.
Article in English | MEDLINE | ID: mdl-20803290

ABSTRACT

Some unicellular N(2)-fixing cyanobacteria have recently been found to lack a functional photosystem II of photosynthesis. Such organisms, provisionally termed UCYN-A, of the oceanic picoplanktion are major contributors to the global marine N-input by N(2)-fixation. Since their photosystem II is inactive, they can perform N(2)-fixation during the day. UCYN-A organisms cannot be cultivated as yet. Their genomic analysis indicates that they lack genes coding for enzymes of the Calvin cycle, the tricarboxylic acid cycle and for the biosynthesis of several amino acids. The carbon source in the ocean that allows them to thrive in such high abundance has not been identified. Their genomic analysis implies that they metabolize organic carbon by a new mode of life. These unicellular N(2)-fixing cyanobacteria of the oceanic picoplankton are evolutionarily related to spheroid bodies present in diatoms of the family Epithemiaceae, such as Rhopalodia gibba. More recently, spheroid bodies were ultimately proven to be related to cyanobacteria and to express nitrogenase. They have been reported to be completely inactive in all photosynthetic reactions despite the presence of thylakoids. Sequence data show that R. gibba and its spheroid bodies are an evolutionarily young symbiosis that might serve as a model system to unravel early events in the evolution of chloroplasts. The cell metabolism of UCYN-A and the spheroid bodies may be related to that of the acetate photoassimilating green alga Chlamydobotrys.


Subject(s)
Cyanobacteria/genetics , Nitrogen Fixation , Oxygen/metabolism , Photosynthesis , Carbon/metabolism , Chlorophyta/metabolism , Cyanobacteria/cytology , Cyanobacteria/metabolism , Diatoms/genetics , Diatoms/metabolism , Diatoms/ultrastructure , Genes, Bacterial , Nitrogenase/genetics , Nitrogenase/metabolism , Photosystem II Protein Complex/genetics , Photosystem II Protein Complex/metabolism , Phylogeny , Symbiosis/genetics , Thylakoids/genetics , Thylakoids/metabolism
18.
Environ Microbiol ; 12(7): 1940-56, 2010 Jul.
Article in English | MEDLINE | ID: mdl-20406287

ABSTRACT

Because bacterioplankton production rates do not vary strongly across vast expanses of the ocean, it is unclear how variability in community structure corresponds with functional variability in the open ocean. We surveyed community transcript functional profiles at eight locations in the open ocean, in both the light and in the dark, using the genomic subsystems approach, to understand variability in gene expression patterns in surface waters. Metatranscriptomes from geographically distinct areas and collected during the day and night shared a large proportion of metabolic functional similarity (74%) at the finest metabolic resolution possible. The variability between metatranscriptomes could be explained by phylogenetic differences between libraries (Mantel test, P < 0.0001). Several key gene expression pathways, including Photosystem I, Photosystem II and ammonium uptake, demonstrated the most variability both geographically and between light and dark. Libraries were dominated by transcripts of the cyanobacterium Prochlorocococcus marinus, where most geographical and diel variability between metatranscriptomes reflected between-station differences in cyanobacterial phototrophic metabolism. Our results demonstrate that active genetic machinery in surface waters of the ocean is dominated by photosynthetic microorganisms and their site-to-site variability, while variability in the remainder of assemblages is dependent on local taxonomic composition.


Subject(s)
Cyanobacteria/genetics , Darkness , Gene Expression Profiling , Light , Seawater/microbiology , Geography , Metabolic Networks and Pathways/genetics , Oceans and Seas
19.
Nature ; 464(7285): 90-4, 2010 Mar 04.
Article in English | MEDLINE | ID: mdl-20173737

ABSTRACT

Nitrogen (N(2))-fixing marine cyanobacteria are an important source of fixed inorganic nitrogen that supports oceanic primary productivity and carbon dioxide removal from the atmosphere. A globally distributed, periodically abundant N(2)-fixing marine cyanobacterium, UCYN-A, was recently found to lack the oxygen-producing photosystem II complex of the photosynthetic apparatus, indicating a novel metabolism, but remains uncultivated. Here we show, from metabolic reconstructions inferred from the assembly of the complete UCYN-A genome using massively parallel pyrosequencing of paired-end reads, that UCYN-A has a photofermentative metabolism and is dependent on other organisms for essential compounds. We found that UCYN-A lacks a number of major metabolic pathways including the tricarboxylic acid cycle, but retains sufficient electron transport capacity to generate energy and reducing power from light. Unexpectedly, UCYN-A has a reduced genome (1.44 megabases) that is structurally similar to many chloroplasts and some bacteria, in that it contains inverted repeats of ribosomal RNA operons. The lack of biosynthetic pathways for several amino acids and purines suggests that this organism depends on other organisms, either in close association or in symbiosis, for critical nutrients. However, size fractionation experiments using natural populations have so far not provided evidence of a symbiotic association with another microorganism. The UCYN-A cyanobacterium is a paradox in evolution and adaptation to the marine environment, and is an example of the tight metabolic coupling between microorganisms in oligotrophic oceanic microbial communities.


Subject(s)
Cyanobacteria/genetics , Cyanobacteria/metabolism , Genome, Bacterial/genetics , Nitrogen Fixation/physiology , Nitrogen/metabolism , Seawater/microbiology , Carbon/metabolism , Chromosomes, Bacterial/genetics , Cyanobacteria/classification , Cyanobacteria/cytology , Electron Transport , Genomics , Marine Biology , Molecular Sequence Data , Nitrogen Fixation/genetics , Oceans and Seas , Oxidoreductases/genetics
20.
ISME J ; 3(11): 1286-300, 2009 Nov.
Article in English | MEDLINE | ID: mdl-19571897

ABSTRACT

Trichodesmium are responsible for a large fraction of open ocean nitrogen fixation, and are often found in complex consortia of other microorganisms, including viruses, prokaryotes, microbial eukaryotes and metazoa. We applied a community gene expression (metatranscriptomic) approach to study the patterns of microbial gene utilization within colonies of Trichodesmium collected during a bloom in the Southwest Pacific Ocean in April 2007. The survey generated 5711-day and 5385-night putative mRNA reads. The majority of mRNAs were from the co-occurring microorganisms and not Trichodesmium, including other cyanobacteria, heterotrophic bacteria, eukaryotes and phage. Most transcripts did not share homology with proteins from cultivated microorganisms, but were similar to shotgun sequences and unannotated proteins from open ocean metagenomic surveys. Trichodesmium transcripts were mostly expressed photosynthesis, N(2) fixation and S-metabolism genes, whereas those in the co-occurring microorganisms were mostly involved in genetic information storage and processing. Detection of Trichodesmium genes involved in P uptake and As detoxification suggest that local enrichment of N through N(2) fixation may lead to a P-stress response. Although containing similar dominant transcripts to open ocean metatranscriptomes, the overall pattern of gene expression in Trichodesmium colonies was distinct from free-living pelagic assemblages. The identifiable genes expressed by Trichodesmium and closely associated microorganisms reflect the constraints of life in well-lit and nutrient-poor waters, with biosynthetic investment in nutrient acquisition and cell maintenance, which is in contrast to gene transcription by soil and coastal seawater microbial assemblages. The results provide insight into aggregate microbial communities in contrast to planktonic free-living assemblages that are the focus of other studies.


Subject(s)
Cyanobacteria/genetics , Gene Expression Regulation, Bacterial , Seawater/microbiology , Bacterial Proteins/genetics , Bacterial Proteins/metabolism , Cyanobacteria/classification , Cyanobacteria/isolation & purification , Cyanobacteria/metabolism , Molecular Sequence Data , Pacific Ocean , Phylogeny
SELECTION OF CITATIONS
SEARCH DETAIL
...