Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 6 de 6
Filter
Add more filters










Database
Publication year range
1.
BMC Genomics ; 13: 648, 2012 Nov 21.
Article in English | MEDLINE | ID: mdl-23171398

ABSTRACT

BACKGROUND: Chinese fir (Cunninghamia lanceolata) is an important timber species that accounts for 20-30% of the total commercial timber production in China. However, the available genomic information of Chinese fir is limited, and this severely encumbers functional genomic analysis and molecular breeding in Chinese fir. Recently, major advances in transcriptome sequencing have provided fast and cost-effective approaches to generate large expression datasets that have proven to be powerful tools to profile the transcriptomes of non-model organisms with undetermined genomes. RESULTS: In this study, the transcriptomes of nine tissues from Chinese fir were analyzed using the Illumina HiSeq™ 2000 sequencing platform. Approximately 40 million paired-end reads were obtained, generating 3.62 gigabase pairs of sequencing data. These reads were assembled into 83,248 unique sequences (i.e. Unigenes) with an average length of 449 bp, amounting to 37.40 Mb. A total of 73,779 Unigenes were supported by more than 5 reads, 42,663 (57.83%) had homologs in the NCBI non-redundant and Swiss-Prot protein databases, corresponding to 27,224 unique protein entries. Of these Unigenes, 16,750 were assigned to Gene Ontology classes, and 14,877 were clustered into orthologous groups. A total of 21,689 (29.40%) were mapped to 119 pathways by BLAST comparison against the Kyoto Encyclopedia of Genes and Genomes (KEGG) database. The majority of the genes encoding the enzymes in the biosynthetic pathways of cellulose and lignin were identified in the Unigene dataset by targeted searches of their annotations. And a number of candidate Chinese fir genes in the two metabolic pathways were discovered firstly. Eighteen genes related to cellulose and lignin biosynthesis were cloned for experimental validating of transcriptome data. Overall 49 Unigenes, covering different regions of these selected genes, were found by alignment. Their expression patterns in different tissues were analyzed by qRT-PCR to explore their putative functions. CONCLUSIONS: A substantial fraction of transcript sequences was obtained from the deep sequencing of Chinese fir. The assembled Unigene dataset was used to discover candidate genes of cellulose and lignin biosynthesis. This transcriptome dataset will provide a comprehensive sequence resource for molecular genetics research of C. lanceolata.


Subject(s)
Cunninghamia/genetics , Cunninghamia/metabolism , Gene Expression Profiling , Genes, Plant/genetics , Lignin/biosynthesis , Databases, Genetic , Molecular Sequence Annotation , Plant Proteins/genetics , Sequence Analysis
2.
J Zhejiang Univ Sci B ; 9(9): 667-74, 2008 Sep.
Article in English | MEDLINE | ID: mdl-18763298

ABSTRACT

Begomoviruses are single-stranded DNA viruses and cause severe diseases in major crop plants worldwide. Based on current genome sequence analyses, we found that synonymous codon usage variations in the protein-coding genes of begomoviruses are mainly influenced by mutation bias. Base composition analysis suggested that the codon usage bias of AV1 and BV1 genes is significant and their expressions are high. Fourteen codons were determined as translational optimal ones according to the comparison of codon usage patterns between highly and lowly expressed genes. Interestingly the codon usages between begomoviruses from the Old and the New Worlds are apparently different, which supports the idea that the bipartite begomoviruses of the New World might originate from bipartite ones of the Old World, whereas the latter evolve from the Old World monopartite begomoviruses.


Subject(s)
Begomovirus/genetics , Biological Evolution , Chromosome Mapping , Codon/genetics , DNA Mutational Analysis , DNA, Viral/genetics , Evolution, Molecular
3.
Yi Chuan Xue Bao ; 33(8): 669-77, 2006 Aug.
Article in English | MEDLINE | ID: mdl-16939000

ABSTRACT

Rice is known to be one of the most important crops for human consumption. As the model cereal crop, large-scale sequencing of rice genome must play quite important roles both in theoretical research and practical application in rice breeding, which announces the opening of another new way to resolve the world food crisis. At present, the emphasis of rice genome research has been transferred from structure genomics to functional analysis. The discovery of new genes and annotation of gene function was believed to be an important issue in functional genomics research. In this article, the sequencing and functional research of the rice genome were reviewed. These results may provide some useful clues for rice genetic engineering and breeding practices.


Subject(s)
Crops, Agricultural/genetics , Genes, Plant/physiology , Genome, Plant/physiology , Genomics/trends , Oryza/genetics , Computational Biology
4.
Yi Chuan Xue Bao ; 32(8): 864-73, 2005 Aug.
Article in Chinese | MEDLINE | ID: mdl-16231742

ABSTRACT

Based on hidden Markov models (HMM), the paper utilized reported SCP (Serine Carboxypeptidases) protein sequences as datasets to build HMM profile. To search Arabidopsis thaliana whole proteome,and identified 54 SCPL (Serine Carboxypeptidase-Like) proteins coding genes. The intron-exon structure, the chromosome mapping and the characteristic of coded protein sequences of those 54 putative genes were analyzed in details, revealing seven gene clusters probably resulted from recent gene duplication. This implied that a remarkable number of Arabidopsis thaliana SCPL genes are harboring alternative splice sites. Phylogenetics evolution analysis suggested 88.9% proteins encoded by Arabidopsis genes belong to two string subfamily of carboxypeptidase, I or II, while only 11.1% proteins fall into single string carboxypeptidase III subfamily. Our results indicated besides the facts that all enzymes of this family contained a central catalytic domain of unique topology and three dimensional structure designated as "alpha/beta hydrolase fold", the DNA and their encoded protein sequences also gave clues to phylogentics studies.


Subject(s)
Arabidopsis Proteins/genetics , Arabidopsis/enzymology , Carboxypeptidases/genetics , Chromosomes, Plant/genetics , Amino Acid Sequence , Arabidopsis/genetics , Chromosome Mapping , Evolution, Molecular , Genomics/methods , Molecular Sequence Data , Multigene Family , Phylogeny , Sequence Homology, Amino Acid
5.
Yi Chuan Xue Bao ; 31(11): 1284-93, 2004 Nov.
Article in Chinese | MEDLINE | ID: mdl-15651682

ABSTRACT

PHD-finger (plant homeodomain finger), a unique Cys4-His-Cys3 pattern zinc-containing domain, is widely found in many transcriptional regulation proteins from plants or animals. This protein has many different family members distributing in plant or animal proteome,and plays key roles in the development. Here, we identified 44 different putative PHD-finger proteins in the rice (Oryza sativa) genome and performed a phylogenetic analysis with 45 Arabidopsis thaliana proteins. The analysis led to a classification of the 89 proteins into two major groups based on the number of PHD-fingers. Phylogentic analysis of these proteins indicated that there are common ancestors in both plant and human genomes, from which two or three PHD-finger containing proteins might duplicate.


Subject(s)
Arabidopsis Proteins/genetics , Arabidopsis/genetics , Homeodomain Proteins/genetics , Oryza/genetics , Plant Proteins/genetics , Zinc Fingers/genetics , Amino Acid Sequence , Molecular Sequence Data , Phylogeny
6.
Yi Chuan Xue Bao ; 30(4): 335-40, 2003 Apr.
Article in Chinese | MEDLINE | ID: mdl-12812058

ABSTRACT

By using the whole genome sequences and EST data from the indica rice cultivar 93-11, a detailed relative analysis is made of the effect of some impact factors on synonymous codon usage. The results showed that the gene expression level assessed by mRNA abundance is positive relative to the "codon adaptation index" (CAI, 0.227**), and "codon preference parameter" (CPP, 0.145**), but negative relative to "effective number of codons" (ENC, -0.147**), indicating that genes with higher expression showed more significant variation in codon usage. There are significant negative correlations between gene length and CAI, CPP (r = -0.413** and -0.480** respectively), but a positive correlation between gene length and ENC(r = 0.210**), which suggested a tendency of shorter genes to higher expression of the transcriptional activity in 93-11. From the results that a higher negative correlation between GC content and ENC(r = -0.740**), but higher positive correlations between GC content and CAI, CPP (r = 0.877** and 0.832**, respectively), we can concluded that the GC content in coding region gave far more contribution to codon usage bias than that mRNA abundance and gene length. Four kinds of bases showed a three-period distribution in the translation initiation region, the bias at the first codon sites, which located +4, and +6, in the downstream of ATG being the largest. That suggested that there was a strong action of natural selection on these specific positions in the 93-11 genome. In this paper twenty-five codons defined firstly as "optimal codons" in 93-11 may provide some more useful information for rice gene-transformation.


Subject(s)
Codon/genetics , Oryza/genetics , Base Composition/genetics , Genes, Plant/genetics , Genome, Plant , RNA, Messenger/genetics
SELECTION OF CITATIONS
SEARCH DETAIL
...