Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 158
Filter
1.
Biotechnol Adv ; 75: 108417, 2024 Jul 20.
Article in English | MEDLINE | ID: mdl-39038691

ABSTRACT

Protein expression is a critical process in diverse biological systems. For Escherichia coli, a widely employed microbial host in industrial catalysis and healthcare, researchers often face significant challenges in constructing recombinant expression systems. To maximize the potential of E. coli expression systems, it is essential to address problems regarding the low or absent production of certain target proteins. This article presents viable solutions to the main factors posing challenges to heterologous protein expression in E. coli, which includes protein toxicity, the intrinsic influence of gene sequences, and mRNA structure. These strategies include specialized approaches for managing toxic protein expression, addressing issues related to mRNA structure and codon bias, advanced codon optimization methodologies that consider multiple factors, and emerging optimization techniques facilitated by big data and machine learning.

2.
Front Genet ; 15: 1395224, 2024.
Article in English | MEDLINE | ID: mdl-38836039

ABSTRACT

The NODULE-INCEPTION-like protein (NLP) family is a plant-specific transcription factor (TF) family involved in nitrate transport and assimilation in plants, which are essential for improving plant nitrogen use efficiency. Currently, the molecular nature and evolutionary trajectory of NLP genes in the C4 model crop foxtail millet are unknown. Therefore, we performed a comprehensive analysis of NLP and molecular evolution in foxtail millet by scanning the genomes of foxtail millet and representative species of the plant kingdom. We identified seven NLP genes in the foxtail millet genome, all of which are individually and separately distributed on different chromosomes. They were not structurally identical to each other and were mainly expressed on root tissues. We unearthed two key genes (Si5G004100.1 and Si6G248300.1) with a variety of excellent characteristics. Regarding its molecular evolution, we found that NLP genes in Gramineae mainly underwent dispersed duplication, but maize NLP genes were mainly generated via WGD events. Other factors such as base mutations and natural selection have combined to promote the evolution of NLP genes. Intriguingly, the family in plants showed a gradual expansion during evolution with more duplications than losses, contrary to most gene families. In conclusion, this study advances the use of NLP genetic resources and the understanding of molecular evolution in cereals.

3.
Cell Rep ; 43(5): 114203, 2024 May 28.
Article in English | MEDLINE | ID: mdl-38722744

ABSTRACT

Leishmania is the causative agent of cutaneous and visceral diseases affecting millions of individuals worldwide. Pseudouridine (Ψ), the most abundant modification on rRNA, changes during the parasite life cycle. Alterations in the level of a specific Ψ in helix 69 (H69) affected ribosome function. To decipher the molecular mechanism of this phenotype, we determine the structure of ribosomes lacking the single Ψ and its parental strain at ∼2.4-3 Å resolution using cryo-EM. Our findings demonstrate the significance of a single Ψ on H69 to its structure and the importance for its interactions with helix 44 and specific tRNAs. Our study suggests that rRNA modification affects translation of mRNAs carrying codon bias due to selective accommodation of tRNAs by the ribosome. Based on the high-resolution structures, we propose a mechanism explaining how the ribosome selects specific tRNAs.


Subject(s)
Pseudouridine , RNA, Transfer , Ribosomes , Pseudouridine/metabolism , Ribosomes/metabolism , RNA, Transfer/metabolism , RNA, Transfer/genetics , Leishmania/metabolism , Leishmania/genetics , Cryoelectron Microscopy , RNA, Ribosomal/metabolism , RNA, Ribosomal/chemistry , RNA, Ribosomal/genetics , Nucleic Acid Conformation , Models, Molecular
4.
Viruses ; 16(2)2024 02 10.
Article in English | MEDLINE | ID: mdl-38400051

ABSTRACT

The rapid evolution of SARS-CoV-2 has fueled its global proliferation since its discovery in 2019, with several notable variants having been responsible for increases in cases of coronavirus disease 2019 (COVID-19). Analyses of codon bias and usage in these variants between phylogenetic clades or lineages may grant insights into the evolution of SARS-CoV-2 and identify target codons indicative of evolutionary or mutative trends that may prove useful in tracking or defending oneself against emerging strains. We processed a cohort of 120 SARS-CoV-2 genome sequences through a statistical and bioinformatic pipeline to identify codons presenting evidence of selective pressure as well as codon coevolution. We report the identification of two codon sites in the orf8 and N genes demonstrating such evidence with real-world impacts on pathogenicity and transmissivity.


Subject(s)
COVID-19 , SARS-CoV-2 , Humans , SARS-CoV-2/genetics , COVID-19/genetics , Phylogeny , Genome, Viral , Genomics , Codon
5.
Microb Genom ; 10(1)2024 Jan.
Article in English | MEDLINE | ID: mdl-38270515

ABSTRACT

African swine fever (ASF) is a severe haemorrhagic disease caused by the African swine fever virus (ASFV), transmitted by ticks, resulting in high mortality among domestic pigs and wild boars. The global spread of ASFV poses significant economic threats to the swine industry. This study employs diverse analytical methods to explore ASFV's evolution and host adaptation, focusing on codon usage patterns and associated factors. Utilizing phylogenetic analysis methods including neighbour-joining and maximum-likelihood, 64 ASFV strains were categorized into four clades. Codon usage bias (CUB) is modest in ASFV coding sequences. This research identifies multiple factors - such as nucleotide composition, mutational pressures, natural selection and geographical diversity - contributing to the formation of CUB in ASFV. Analysis of relative synonymous codon usage reveals CUB variations within clades and among ASFVs and their hosts. Both Codon Adaptation Index and Similarity Index analyses confirm that ASFV strains are highly adapted to soft ticks (Ornithodoros moubata) but less so to domestic pigs, which could be a result of the long-term co-evolution of ASFV with ticks. This study sheds light on the factors influencing ASFV's codon usage and fitness dynamics, enriching our understanding of its evolution, adaptation and host interactions.


Subject(s)
African Swine Fever Virus , African Swine Fever , Ornithogalum , Animals , Swine , African Swine Fever Virus/genetics , Codon Usage , Host Adaptation , Phylogeny , Sus scrofa
6.
New Phytol ; 241(3): 1144-1160, 2024 Feb.
Article in English | MEDLINE | ID: mdl-38072860

ABSTRACT

Chlorella ohadii was isolated from desert biological soil crusts, one of the harshest habitats on Earth, and is emerging as an exciting new green model for studying growth, photosynthesis and metabolism under a wide range of conditions. Here, we compared the genome of C. ohadii, the fastest growing alga on record, to that of other green algae, to reveal the genomic imprints empowering its unparalleled growth rate and resistance to various stressors, including extreme illumination. This included the genome of its close relative, but slower growing and photodamage sensitive, C. sorokiniana UTEX 1663. A larger number of ribosome-encoding genes, high intron abundance, increased codon bias and unique genes potentially involved in metabolic flexibility and resistance to photodamage are all consistent with the faster growth of C. ohadii. Some of these characteristics highlight general trends in Chlorophyta and Chlorella spp. evolution, and others open new broad avenues for mechanistic exploration of their relationship with growth. This work entails a unique case study for the genomic adaptations and costs of exceptionally fast growth and sheds light on the genomic signatures of fast growth in photosynthetic cells. It also provides an important resource for future studies leveraging the unique properties of C. ohadii for photosynthesis and stress response research alongside their utilization for synthetic biology and biotechnology aims.


Subject(s)
Chlorella , Chlorella/genetics , Photosynthesis , Genomics
7.
Biosystems ; 235: 105102, 2024 Jan.
Article in English | MEDLINE | ID: mdl-38092331

ABSTRACT

Analyzing the codon usage frequencies of a specimen of 20 plants, for which the codon-anticodon pattern is known, we have remarked that the hierarchy of the usage frequencies present an almost "universal" behavior. Searching to explain this behavior, we assume that the codon usage probability results from the sum of two contributions: the first dominant term is an almost "universal" one and it depends on the codon-anticodon interaction; the second term is a local one, i.e. depends on the biological species. The codon-anticodon interaction is written as a spin-spin plus a z-spin term in the formalism of the crystal basis model. From general considerations, in particular from the choice of the signs and some constraints on the parameters defining the interaction, we are able to explain most of the observed data.


Subject(s)
Anticodon , RNA, Transfer , Anticodon/genetics , Codon Usage , Codon/genetics
8.
Cells ; 12(20)2023 10 13.
Article in English | MEDLINE | ID: mdl-37887289

ABSTRACT

The complexities of translational strategies make this stage of implementing genetic information one of the most challenging to comprehend and, simultaneously, perhaps the most engaging. It is evident that this diverse range of strategies results not only from a long evolutionary history, but is also of paramount importance for refining gene expression and metabolic modulation. This notion is particularly accurate for organisms that predominantly exhibit biochemical and physiological reactions with a lack of behavioural ones. Plants are a group of organisms that exhibit such features. Addressing unfavourable environmental conditions plays a pivotal role in plant physiology. This is particularly evident with the changing conditions of global warming and the irrevocable loss or depletion of natural ecosystems. In conceptual terms, the plant response to abiotic stress comprises a set of elaborate and intricate strategies. This is influenced by a range of abiotic factors that cause stressful conditions, and molecular genetic mechanisms that fine-tune metabolic pathways allowing the plant organism to overcome non-standard and non-optimal conditions. This review aims to focus on the current state of the art in the field of translational regulation in plants under abiotic stress conditions. Different regulatory elements and patterns are being assessed chronologically. We deem it important to focus on significant high-performance techniques for studying the genetic information dynamics during the translation phase.


Subject(s)
Ecosystem , Plants , Plants/genetics , Plants/metabolism , Plant Physiological Phenomena , Metabolic Networks and Pathways , Stress, Physiological/genetics
9.
Planta ; 258(5): 87, 2023 Sep 26.
Article in English | MEDLINE | ID: mdl-37750983

ABSTRACT

MAIN CONCLUSION: TL63 orthologs were angiosperm specific and had undergone motifs loss and gain, and increased purifying selection. AtTL63 was involved in the response of yeast and Arabidopsis plants to oxidative stress. The Tóxicos en Levadura (TL) family, a class of E3 ubiquitin ligases with typical RING-H2 type zinc finger structure, plays a pivotal role in mediating physiological processes and responding to stress in plants. However, the evolution and function of TL63 remain unclear. In this study, TL63 homologs were dated roughly back to the origin of land plants and confirmed to have subjected to the gain and loss of motifs and increased purifying selection. Phylogenetic analysis displayed that 279 TL63s could be divided into four main clades (Clade A-D). Notably, the ancestral tandem TL40/41 cluster contributed to the expansion of modern Brassicaceae TL40/41. The substitution rate tests revealed that the TL63 lineage was evidently different from other lineages. The codon usage index exhibited that monocotyledons preferred to use not A3s and T3s, but C3s, G3s, CAI, CBI and Fop. Sequence analysis showed that the TL63 homologs had conserved TM and GLD motifs and RING-H2 domain whose key amino acid residues accounted for the high average abundance. Particularly, Arabidopsis thaliana TL63 (AtTL63) was located in the nuclei, cell membranes and peroxisomes and expressed universally and significantly throughout A. thaliana development. Under H2O2 treatment, low or moderate expression of the AtTL63 held beneficial effects on the growth and viability of yeast cells and the mutation or overexpression of the AtTL63 positively affected the growth of A. thaliana plants. In brief, this study could supply useful insight into the evolution of the plant TL63s and the AtTL63 functions under oxidative stress.


Subject(s)
Arabidopsis , Arabidopsis/genetics , Phylogeny , Hydrogen Peroxide , Saccharomyces cerevisiae , Oxidative Stress/genetics , Gene Expression Regulation, Plant/genetics
10.
Sheng Wu Gong Cheng Xue Bao ; 39(7): 2939-2953, 2023 Jul 25.
Article in Chinese | MEDLINE | ID: mdl-37584141

ABSTRACT

The genomic DNA of Rubus rosaefolius was extracted and sequenced by Illumina NovaSeq platform to obtain the complete chloroplast genome sequence, and the sequence characteristics and phylogenetic analysis of chloroplast genes were carried out. The results showed that the complete chloroplast genome of the R. rosaefolius was 155 650 bp in length and had a typical tetrad structure, including two reverse repeats (25 748 bp each), a large copy region (85 443 bp) and a small copy region (18 711 bp). A total of 131 genes were identified in the whole genome of R. rosaefolius chloroplast, including 86 protein coding genes, 37 tRNA genes and 8 rRNA genes. The GC content of the whole genome was 36.9%. The genome of R. rosaefolius chloroplast contains 47 scattered repeats and 72 simple sequence repeating (SSR) loci. The codon preference is leucine codon, and the codon at the end of A/U is preferred. Phylogenetic analysis showed that R. rosaefolius had the closest relationship with R. taiwanicola, followed by R. rubraangustifolius and R. glandulosopunctatus. The chloroplast genome characteristics and phylogenetic analysis of R. rosaefolius provide a theoretical basis for its genetic diversity research and chloroplast development and utilization.


Subject(s)
Genome, Chloroplast , Rubus , Phylogeny , Rubus/genetics , Fruit/genetics , Codon/genetics
11.
Alcohol Clin Exp Res (Hoboken) ; 47(8): 1478-1493, 2023 Aug.
Article in English | MEDLINE | ID: mdl-37336636

ABSTRACT

BACKGROUND: The basis for familial alcohol use disorder (AUD) remains an enigma due to various biological and societal confounds. The present study used three of the most adopted and documented rat models, combining the alcohol-preferring/non-alcohol-preferring (P/NP) lines and high alcohol-drinking/low alcohol-drinking (HAD/LAD) replicated lines, of AUD as examined through the lens of whole genomic analyses. METHODS: We used complete genome sequencing of the P/NP lines and previously published sequences of the HAD/LAD replicates to enhance the discovery of variants associated with AUD and to remove confounding with genetic background and random genetic drift. Specifically, we used high-order statistical methods to search for genetic variants whose frequency changes in whole sets of gene ontologies corresponded with phenotypic changes in the direction of selection, that is, ethanol-drinking preference. RESULTS: Our first finding was that in addition to variants causing translational changes, the principal genetic changes associated with drinking predisposition were silent mutations and mutations in the 3' untranslated regions (3'UTR) of genes. Neither of these types of mutations alters the amino acid sequence of the translated protein but they influence both the rate and conformation of gene transcription, including its stability and posttranslational events that alter gene efficacy. This finding argues for refocusing human genomic studies on changes in gene efficacy. Among the key ontologies identified were the central genes associated with the Na+ voltage-gated channels of neurons and glia (including the Scn1a, Scn2a, Scn2b, Scn3a, Scn7a, and Scn9a subtypes) and excitatory glutamatergic secretion (including Grm2 and Myo6), both of which are essential in neuroplasticity. In addition, we identified "Nociception or Sensory Perception of Pain," which contained variants in nociception (Arrb1, Ccl3, Ephb1) and enlist sodium (Scn1a, Scn2a, Scn2b, Scn3a, Scn7a), pain activation (Scn9a), and potassium channel (Kcna1) genes. CONCLUSION: The multi-model analyses used herein reduced the confounding effects of random drift and the "founders" genetic background. The most differentiated bidirectionally selected genes across all three animal models were Scn9a, Scn1a, and Kcna, all of which are annotated in the nociception ontology. The complexity of neuroplasticity and nociception adds strength to the hypothesis that neuroplasticity and pain (physical or psychological) are prominent phenotypes genetically linked to the development of AUD.

12.
Genetics ; 224(4)2023 08 09.
Article in English | MEDLINE | ID: mdl-37310925

ABSTRACT

Codon bias and mRNA folding strength (mF) are hypothesized molecular mechanisms by which polymorphisms in genes modify protein expression. Natural patterns of codon bias and mF across genes as well as effects of altering codon bias and mF suggest that the influence of these 2 mechanisms may vary depending on the specific location of polymorphisms within a transcript. Despite the central role codon bias and mF may play in natural trait variation within populations, systematic studies of how polymorphic codon bias and mF relate to protein expression variation are lacking. To address this need, we analyzed genomic, transcriptomic, and proteomic data for 22 Saccharomyces cerevisiae isolates, estimated protein accumulation for each allele of 1,620 genes as the log of protein molecules per RNA molecule (logPPR), and built linear mixed-effects models associating allelic variation in codon bias and mF with allelic variation in logPPR. We found that codon bias and mF interact synergistically in a positive association with logPPR, and this interaction explains almost all the effects of codon bias and mF. We examined how the locations of polymorphisms within transcripts influence their effects and found that codon bias primarily acts through polymorphisms in domain-encoding and 3' coding sequences, while mF acts most significantly through coding sequences with weaker effects from untranslated regions. Our results present the most comprehensive characterization to date of how polymorphisms in transcripts influence protein expression.


Subject(s)
Codon Usage , Saccharomyces cerevisiae , RNA, Messenger/genetics , RNA, Messenger/metabolism , Saccharomyces cerevisiae/genetics , Saccharomyces cerevisiae/metabolism , Proteomics , RNA Folding , Codon/genetics
13.
Mol Biol Evol ; 40(6)2023 06 01.
Article in English | MEDLINE | ID: mdl-37221009

ABSTRACT

The rampant variability in codon bias existing between bacterial genomes is expected to interfere with horizontal gene transfer (HGT), a phenomenon that drives bacterial adaptation. However, delineating the constraints imposed by codon bias on functional integration of the transferred genes is complicated by multiple genomic and functional barriers controlling HGT, and by the dependence of the evolutionary outcomes of HGT on the host's environment. Here, we designed an experimental system in which codon composition of the transferred genes is the only variable triggering fitness change of the host. We replaced Escherichia coli's chromosomal folA gene encoding dihydrofolate reductase, an essential enzyme that constitutes a target for trimethoprim, with combinatorial libraries of synonymous codons of folA genes from trimethoprim-sensitive Listeria grayi and trimethoprim-resistant Neisseria sicca. The resulting populations underwent selection at a range of trimethoprim concentrations, and the ensuing changes in variant frequencies were used to infer the fitness effects of the individual combinations of codons. We found that when HGT causes overstabilization of the 5'-end mRNA, the fitness contribution of mRNA folding stability dominates over that of codon optimality. The 5'-end overstabilization can also lead to mRNA accumulation outside of the polysome, thus preventing the decay of the foreign transcripts despite the codon composition-driven reduction in translation efficiency. Importantly, the fitness effects of mRNA stability or codon optimality become apparent only at sub-lethal levels of trimethoprim individually tailored for each library, emphasizing the central role of the host's environment in shaping the codon bias compatibility of horizontally transferred genes.


Subject(s)
Anti-Bacterial Agents , Trimethoprim , Anti-Bacterial Agents/pharmacology , Codon , RNA, Messenger , Drug Resistance, Microbial/genetics , Trimethoprim/pharmacology
14.
BMC Genomics ; 24(1): 290, 2023 May 30.
Article in English | MEDLINE | ID: mdl-37254040

ABSTRACT

BACKGROUND: In the evolutionary study of gene families, exploring the duplication mechanisms of gene families helps researchers understand their evolutionary history. The tubby-like protein (TLP) family is essential for growth and development in plants and animals. Much research has been done on its function; however, limited information is available with regard to the evolution of the TLP gene family. Herein, we systematically investigated the evolution of TLP genes in seven representative Poaceae lineages. RESULTS: Our research showed that the evolution of TLP genes was influenced not only by whole-genome duplication (WGD) and dispersed duplication (DSD) but also by transposed duplication (TRD), which has been neglected in previous research. For TLP family size, we found an evolutionary pattern of progressive shrinking in the grass family. Furthermore, the evolution of the TLP gene family was at least affected by evolutionary driving forces such as duplication, purifying selection, and base mutations. CONCLUSIONS: This study presents the first comprehensive evolutionary analysis of the TLP gene family in grasses. We demonstrated that the TLP gene family is also influenced by a transposed duplication mechanism. Several new insights into the evolution of the TLP gene family are presented. This work provides a good reference for studying gene evolution and the origin of duplication.


Subject(s)
Gene Duplication , Poaceae , Evolution, Molecular , Genome, Plant , Phylogeny , Poaceae/genetics
15.
Zhongguo Zhong Yao Za Zhi ; 48(1): 52-59, 2023 Jan.
Article in Chinese | MEDLINE | ID: mdl-36725258

ABSTRACT

This study investigated the choroplast genome sequence of wild Atractylodes lancea from Yuexi in Anhui province by high-throughput sequencing, followed by characterization of the genome structure, which laid a foundation for the species identification, analysis of genetic diversity, and resource conservation of A. lancea. To be specific, the total genomic DNA was extracted from the leaves of A. lancea with the improved CTAB method. The chloroplast genome of A. lancea was sequenced by the high-throughput sequencing technology, followed by assembling by metaSPAdes and annotation by CPGAVAS2. Bioiformatics methods were employed for the analysis of simple sequence repeats(SSRs), inverted repeat(IR) border, codon bias, and phylogeny. The results showed that the whole chloroplast genome of A. lancea was 153 178 bp, with an 84 226 bp large single copy(LSC) and a 18 658 bp small single copy(SSC) separated by a pair of IRs(25 147 bp). The genome had the GC content of 37.7% and 124 genes: 87 protein-coding genes, 8 rRNA genes, and 29 tRNA genes. It had 26 287 codons and encoded 20 amino acids. Phylogenetic analysis showed that Atractylodes species clustered into one clade and that A. lancea had close genetic relationship with A. koreana. This study established a method for sequencing the chloroplast genome of A. lancea and enriched the genetic resources of Compositae. The findings are expected to lay a foundation for species identification, analysis of genetic diversity, and resource conservation of A. lancea.


Subject(s)
Atractylodes , Genome, Chloroplast , Lamiales , Phylogeny , Atractylodes/genetics , Whole Genome Sequencing , Microsatellite Repeats
16.
Sheng Wu Gong Cheng Xue Bao ; 39(2): 670-684, 2023 Feb 25.
Article in Chinese | MEDLINE | ID: mdl-36847097

ABSTRACT

The structure and size of the chloroplast genome of Castanopsis hystrix was determined by Illumina HiSeq 2500 sequencing platform to understand the difference between C. hystrix and the chloroplast genome of the same genus, and the evolutionary position of C. hystrix in the genus, so as to facilitate species identification, genetic diversity analysis and resource conservation of the genus. Bioinformatics analysis was used to perform sequence assembly, annotation and characteristic analysis. R, Python, MISA, CodonW and MEGA 6 bioinformatics software were used to analyze the genome structure and number, codon bias, sequence repeats, simple sequence repeat (SSR) loci and phylogeny. The genome size of C. hystrix chloroplast was 153 754 bp, showing tetrad structure. A total of 130 genes were identified, including 85 coding genes, 37 tRNA genes and 8 rRNA genes. According to codon bias analysis, the average number of effective codons was 55.5, indicating that the codons were highly random and low in bias. Forty-five repeats and 111 SSR loci were detected by SSR and long repeat fragment analysis. Compared with the related species, chloroplast genome sequences were highly conserved, especially the protein coding sequences. Phylogenetic analysis showed that C. hystrix is closely related to the Hainanese cone. In summary, we obtained the basic information and phylogenetic position of the chloroplast genome of red cone, which will provide a preliminary basis for species identification, genetic diversity of natural populations and functional genomics research of C. hystrix.


Subject(s)
Genome, Chloroplast , Phylogeny , Codon/genetics , Genomics , Chloroplasts/genetics
17.
Chinese Journal of Biotechnology ; (12): 2939-2953, 2023.
Article in Chinese | WPRIM (Western Pacific) | ID: wpr-981242

ABSTRACT

The genomic DNA of Rubus rosaefolius was extracted and sequenced by Illumina NovaSeq platform to obtain the complete chloroplast genome sequence, and the sequence characteristics and phylogenetic analysis of chloroplast genes were carried out. The results showed that the complete chloroplast genome of the R. rosaefolius was 155 650 bp in length and had a typical tetrad structure, including two reverse repeats (25 748 bp each), a large copy region (85 443 bp) and a small copy region (18 711 bp). A total of 131 genes were identified in the whole genome of R. rosaefolius chloroplast, including 86 protein coding genes, 37 tRNA genes and 8 rRNA genes. The GC content of the whole genome was 36.9%. The genome of R. rosaefolius chloroplast contains 47 scattered repeats and 72 simple sequence repeating (SSR) loci. The codon preference is leucine codon, and the codon at the end of A/U is preferred. Phylogenetic analysis showed that R. rosaefolius had the closest relationship with R. taiwanicola, followed by R. rubraangustifolius and R. glandulosopunctatus. The chloroplast genome characteristics and phylogenetic analysis of R. rosaefolius provide a theoretical basis for its genetic diversity research and chloroplast development and utilization.


Subject(s)
Phylogeny , Rubus/genetics , Genome, Chloroplast , Fruit/genetics , Codon/genetics
18.
Article in Chinese | WPRIM (Western Pacific) | ID: wpr-970501

ABSTRACT

This study investigated the choroplast genome sequence of wild Atractylodes lancea from Yuexi in Anhui province by high-throughput sequencing, followed by characterization of the genome structure, which laid a foundation for the species identification, analysis of genetic diversity, and resource conservation of A. lancea. To be specific, the total genomic DNA was extracted from the leaves of A. lancea with the improved CTAB method. The chloroplast genome of A. lancea was sequenced by the high-throughput sequencing technology, followed by assembling by metaSPAdes and annotation by CPGAVAS2. Bioiformatics methods were employed for the analysis of simple sequence repeats(SSRs), inverted repeat(IR) border, codon bias, and phylogeny. The results showed that the whole chloroplast genome of A. lancea was 153 178 bp, with an 84 226 bp large single copy(LSC) and a 18 658 bp small single copy(SSC) separated by a pair of IRs(25 147 bp). The genome had the GC content of 37.7% and 124 genes: 87 protein-coding genes, 8 rRNA genes, and 29 tRNA genes. It had 26 287 codons and encoded 20 amino acids. Phylogenetic analysis showed that Atractylodes species clustered into one clade and that A. lancea had close genetic relationship with A. koreana. This study established a method for sequencing the chloroplast genome of A. lancea and enriched the genetic resources of Compositae. The findings are expected to lay a foundation for species identification, analysis of genetic diversity, and resource conservation of A. lancea.


Subject(s)
Phylogeny , Atractylodes/genetics , Genome, Chloroplast , Whole Genome Sequencing , Microsatellite Repeats , Lamiales
19.
Chinese Journal of Biotechnology ; (12): 670-684, 2023.
Article in Chinese | WPRIM (Western Pacific) | ID: wpr-970399

ABSTRACT

The structure and size of the chloroplast genome of Castanopsis hystrix was determined by Illumina HiSeq 2500 sequencing platform to understand the difference between C. hystrix and the chloroplast genome of the same genus, and the evolutionary position of C. hystrix in the genus, so as to facilitate species identification, genetic diversity analysis and resource conservation of the genus. Bioinformatics analysis was used to perform sequence assembly, annotation and characteristic analysis. R, Python, MISA, CodonW and MEGA 6 bioinformatics software were used to analyze the genome structure and number, codon bias, sequence repeats, simple sequence repeat (SSR) loci and phylogeny. The genome size of C. hystrix chloroplast was 153 754 bp, showing tetrad structure. A total of 130 genes were identified, including 85 coding genes, 37 tRNA genes and 8 rRNA genes. According to codon bias analysis, the average number of effective codons was 55.5, indicating that the codons were highly random and low in bias. Forty-five repeats and 111 SSR loci were detected by SSR and long repeat fragment analysis. Compared with the related species, chloroplast genome sequences were highly conserved, especially the protein coding sequences. Phylogenetic analysis showed that C. hystrix is closely related to the Hainanese cone. In summary, we obtained the basic information and phylogenetic position of the chloroplast genome of red cone, which will provide a preliminary basis for species identification, genetic diversity of natural populations and functional genomics research of C. hystrix.


Subject(s)
Phylogeny , Genome, Chloroplast , Codon/genetics , Genomics , Chloroplasts/genetics
20.
Genes (Basel) ; 13(12)2022 11 27.
Article in English | MEDLINE | ID: mdl-36553496

ABSTRACT

Paeonia suffruticosa Andr., a member of Paeoniaceae, is native to China. In its 1600 years' cultivation, more than 2000 cultivars for different purposes (ornamental, medicinal and oil use) have been inbred. However, there are still some controversies regarding the provenance of tree peony cultivars and the phylogenetic relationships between and within different cultivar groups. In this study, plastid genome sequencing was performed on 10 representative tree peony cultivars corresponding to 10 different flower types. Structure and comparative analyses of the plastid genomes showed that the total lengths of the chloroplast genome of the 10 cultivars ranged from 152,153 to 152,385 bp and encoded 84-88 protein-coding genes, 8 rRNAs and 31-40 tRNAs. The number of simple sequence repeats and interspersed repeat sequences of the 10 cultivars ranged from 65-68 and 40-42, respectively. Plastid phylogenetic relationships of Paeonia species/cultivars were reconstructed incorporating data from our newly sequenced plastid genomes and 15 published species, and results showed that subsect. Vaginatae was the closest relative to the central plains cultivar group with robust support, and that it may be involved in the formation of the group. Paeonia ostii was recovered as a successive sister group to this lineage. Additionally, eleven morphological characteristics of flowers were mapped to the phylogenetic skeleton to reconstruct the evolutionary trajectory of flower architecture in Paeoniaceae.


Subject(s)
Paeonia , Paeonia/genetics , Phylogeny , Flowers/genetics , Chromosome Mapping , Plastids/genetics
SELECTION OF CITATIONS
SEARCH DETAIL
...