Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 4 de 4
Filter
Add more filters










Database
Language
Publication year range
1.
Biology (Basel) ; 11(10)2022 Oct 12.
Article in English | MEDLINE | ID: mdl-36290398

ABSTRACT

Mitragyna speciosa (Kratom) is a tropical narcotic plant native to Southeast Asia with unique pharmacological properties. Here, we report the first chromosome-scale assembly of the M. speciosa genome. We employed PacBio sequencing to obtain a preliminary assembly, which was subsequently scaffolded using the chromatin contact mapping technique (Hi-C) into 22 pseudomolecules. The final assembly was 692 Mb with a scaffold N50 of 26 Mb. We annotated a total of 39,708 protein-coding genes, and our gene predictions recovered 98.4% of the highly conserved orthologs based on the BUSCO analysis. The phylogenetic analysis revealed that M. speciosa diverged from the last common ancestors of Coffea arabica and Coffea canephora approximately 47.6 million years ago. Our analysis of the sequence divergence at fourfold-degenerate sites from orthologous gene pairs provided evidence supporting a genome-wide duplication in M. speciosa, agreeing with the report that members of the genus Mitragyna are tetraploid. The STRUCTURE and principal component analyses demonstrated that the 85 M. speciosa accessions included in this study were an admixture of two subpopulations. The availability of our high-quality chromosome-level genome assembly and the transcriptomic resources will be useful for future studies on the alkaloid biosynthesis pathway, as well as comparative phylogenetic studies in Mitragyna and related species.

2.
BMC Mol Cell Biol ; 21(1): 28, 2020 Apr 15.
Article in English | MEDLINE | ID: mdl-32295515

ABSTRACT

BACKGROUND: The details of the folding mechanisms have not yet been fully understood for many proteins, and it is believed that the information on the folding mechanism of a protein is encoded in its amino acid sequence. ß-trefoil proteins are known to have the same 3D scaffold, namely, a three-fold symmetric scaffold, despite the proteins' low sequence identity among superfamilies. In this study, we extract an initial folding unit from the amino acid sequences of irregular ß-trefoil proteins by constructing an average distance map (ADM) and utilizing inter-residue average distance statistics to determine the relative contact frequencies for residue pairs in terms of F values. We compare our sequence-based prediction results with the packing between hydrophobic residues in native 3D structures and a Go-model simulation. RESULTS: The ADM and F-value analyses predict that the N-terminal and C-terminal regions are compact and that the hydrophobic residues at the central region can be regarded as an interaction center with other residues. These results correspond well to those of the Go-model simulations. Moreover, our results indicate that the irregular parts in the ß-trefoil proteins do not hinder the protein formation. Conserved hydrophobic residues on the ß5 strand are always the interaction center of packing between the conserved hydrophobic residues in both regular and irregular ß-trefoil proteins. CONCLUSIONS: We revealed that the ß5 strand plays an important role in ß-trefoil protein structure construction. The sequence-based methods used in this study can extract the protein folding information from only amino acid sequence data, and well corresponded to 3D structure-based Go-model simulation and available experimental results.


Subject(s)
Models, Molecular , Protein Folding , Trefoil Factors/chemistry , Amino Acid Sequence , Computer Simulation
3.
Proteins ; 88(6): 740-758, 2020 06.
Article in English | MEDLINE | ID: mdl-31833097

ABSTRACT

Describing the whole story of protein folding is currently the main enigmatic problem in molecular bioinformatics study. Protein folding mechanisms have been intensively investigated with experimental as well as simulation techniques. Since a protein folds into its specific 3D structure from a unique amino acid sequence, it is interesting to extract as much information as possible from the amino acid sequence of a protein. Analyses based on inter-residue average distance statistics and a coarse-grained Go-model simulation were conducted on Ig and FN3 domains of a titin protein to decode the folding mechanisms from their sequence data and native structure data, respectively. The central region of all domains was predicted to be an initial folding unit, that is, stable in an early state of folding. This common feature coincides well with the experimental results and underscores the significance of the ß-sandwich proteins' common structure, namely, the key strands for folding and the Greek-key motif, which is located in the central region. We confirmed that our sequence-based techniques were able to predict the initial folding event just next to the denatured state and that a 3D-based Go-model simulation can be used to investigate the whole process of protein folding.


Subject(s)
Amino Acids/chemistry , Connectin/chemistry , Protein Folding , Amino Acid Sequence , Amino Acids/metabolism , Connectin/metabolism , Humans , Kinetics , Models, Molecular , Protein Conformation, alpha-Helical , Protein Conformation, beta-Strand , Protein Interaction Domains and Motifs , Sequence Alignment , Sequence Homology, Amino Acid , Thermodynamics
4.
Proteins ; 87(2): 120-135, 2019 02.
Article in English | MEDLINE | ID: mdl-30520530

ABSTRACT

To understand the folding mechanism of a protein is one of the goals in bioinformatics study. Nowadays, it is enigmatic and difficult to extract folding information from amino acid sequence using standard bioinformatics techniques or even experimental protocols which can be time consuming. To overcome these problems, we aim to extract the initial folding unit for titin protein (Ig and fnIII domains) by means of inter-residue average distance statistics, Average Distance Map (ADM) and contact frequency analysis (F-value). TI I27 and TNfn3 domains are used to represent the Ig-domain and fnIII-domain, respectively. Beta-strands 2, 3, 5, and 6 are significant for the initial folding processes of TI I27. The central strands of TNfn3 were predicted as a primary folding segment. Known 3D structure and unknown 3D structure domains were investigated by structure or non-structure based multiple sequence alignment, respectively, to learn the conserved hydrophobic residues and predicted compact region relevant to evolution. Our results show good correspondence to experimental data, phi-value and protection factor from H-D exchange experiments. The significance of conserved hydrophobic residues near F-value peaks for structural stability using hydrophobic packing is confirmed. Our prediction methods once again could extract a folding mechanism only knowing the amino acid sequence.


Subject(s)
Computational Biology/methods , Immunoglobulins/chemistry , Protein Folding , Protein Structure, Tertiary , Proteins/chemistry , Algorithms , Amino Acid Sequence , Connectin/chemistry , Connectin/genetics , Connectin/metabolism , Humans , Hydrophobic and Hydrophilic Interactions , Immunoglobulins/genetics , Immunoglobulins/metabolism , Models, Molecular , Models, Theoretical , Protein Stability , Proteins/genetics , Proteins/metabolism , Sequence Homology, Amino Acid , Tenascin/chemistry , Tenascin/genetics , Tenascin/metabolism
SELECTION OF CITATIONS
SEARCH DETAIL
...