Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 20
Filter
Add more filters










Publication year range
1.
Article in English | MEDLINE | ID: mdl-38673307

ABSTRACT

For a long time, traditional medicine has relied on the use of medicinal plants and herbal products which have served as the basis for numerous pharmaceuticals. Parkia biglobosa (Jacq) R.Br.ex. G. Don., commonly called the African locust bean tree, is a perennial deciduous plant native to West Africa where it is highly esteemed for its nutritional and traditional medicinal benefits. Parkia biglobosa's ethnomedicinal uses include microbial infections such as diarrhea and chronic diseases like hypertension and type 2 diabetes mellitus. This article presents the current understanding of the molecular mechanisms underlying Parkia biglobosa's biological effects. An electronic database search was conducted using P. biglobosa and its synonyms as keywords in Scientific Electronic Library Online, ISI Web of Knowledge, PubMed, Scopus, Science Direct, and Google Scholar. Consistently, scientific research has confirmed the medicinal effects of the plant's extracts and active phytochemicals, including antimicrobial, analgesic, antidiabetic, antihypertensive, hypolipidemic, and neuroprotective properties, among others. It highlights the contributions of identified innate phytochemicals and existing limitations to therapeutic applications, as well as the need for and prospects for further research. Advancing our understanding of the medicinal plant's biological mechanisms and the contributions of the active phytochemicals would allow for more effective exploration of its vast pharmacological potential and facilitate clinical applications.


Subject(s)
Fabaceae , Humans , Plant Extracts/pharmacology , Plant Extracts/therapeutic use , Phytochemicals/pharmacology , Phytochemicals/therapeutic use , Animals , Plants, Medicinal/chemistry , Medicine, African Traditional
2.
Anal Methods ; 9(4): 724-728, 2017 Jan 28.
Article in English | MEDLINE | ID: mdl-28217148

ABSTRACT

Cordyceps sinensis (C. sinensis) is a natural product that has diverse nutritional and medicinal values. Since the availability of natural C. sinensis becomes limited its authentication and quality control is of high significance. Herein we report on profiling of metals in C. sinensis by using inductively coupled plasma mass spectrometry (ICP-MS). The analysis reveals that C. sinensis contains a wide array of essential elements, including P, Mg, Zn, Cu, Fe, etc. Toxic metals detected are Cd, Pb, and As. In all five samples analyzed Pb contents are below 2.0 ppm. Arsenic level in C. sinensis caterpillar is significantly higher than that in its mycelium and varies from 3.0 to 32 ppm likely due to soil contamination. It's for the first time demonstrated in this work that clustering analysis on the proposed metal profiles consisting of 24 elements is very useful to identify "abnormal" C. sinensis samples, thus adding another dimension to the effective means for authentication and quality assessment of this highly demanded previous natural product.

3.
Int J Environ Res Public Health ; 13(4): 379, 2016 Mar 29.
Article in English | MEDLINE | ID: mdl-27043589

ABSTRACT

Exposure to environmental hazards has been associated with diseases in humans. The identification of single nucleotide polymorphisms (SNPs) in human populations exposed to different environmental hazards, is vital for detecting the genetic risks of some important human diseases. Several studies in this field have been conducted on glutathione S-transferases (GSTs), a phase II detoxification superfamily, to investigate its role in the occurrence of diseases. Human GSTs consist of cytosolic and microsomal superfamilies that are further divided into subfamilies. Based on scientific search engines and a review of the literature, we have found a large amount of published articles on human GST super- and subfamilies that have greatly assisted in our efforts to examine their role in health and disease. Because of its polymorphic variations in relation to environmental hazards such as air pollutants, cigarette smoke, pesticides, heavy metals, carcinogens, pharmaceutical drugs, and xenobiotics, GST is considered as a significant biomarker. This review examines the studies on gene-environment interactions related to various diseases with respect to single nucleotide polymorphisms (SNPs) found in the GST superfamily. Overall, it can be concluded that interactions between GST genes and environmental factors play an important role in human diseases.


Subject(s)
Gene-Environment Interaction , Genetic Predisposition to Disease , Glutathione Transferase/genetics , Humans , Polymorphism, Single Nucleotide
4.
Cell Rep ; 13(9): 1868-80, 2015 Dec 01.
Article in English | MEDLINE | ID: mdl-26655902

ABSTRACT

Genes expressing circadian RNA rhythms are enriched for metabolic pathways, but the adaptive significance of cyclic gene expression remains unclear. We estimated the genome-wide synthetic and degradative cost of transcription and translation in three organisms and found that the cost of cycling genes is strikingly higher compared to non-cycling genes. Cycling genes are expressed at high levels and constitute the most costly proteins to synthesize in the genome. We demonstrate that metabolic cycling is accelerated in yeast grown under higher nutrient flux and the number of cycling genes increases ∼40%, which are achieved by increasing the amplitude and not the mean level of gene expression. These results suggest that rhythmic gene expression optimizes the metabolic cost of global gene expression and that highly expressed genes have been selected to be downregulated in a cyclic manner for energy conservation.


Subject(s)
Energy Metabolism/genetics , Gene Regulatory Networks , Genome , Animals , Circadian Rhythm/physiology , Drosophila/metabolism , Glucose/metabolism , Liver/metabolism , Metabolic Networks and Pathways/genetics , Mice , Proteome/metabolism , RNA, Messenger/metabolism , Saccharomyces cerevisiae/genetics , Saccharomyces cerevisiae/metabolism , Transcriptome
5.
Genes Dev ; 29(20): 2081-96, 2015 Oct 15.
Article in English | MEDLINE | ID: mdl-26494785

ABSTRACT

Mutations in the transcription factor Forkhead box p1 (FOXP1) are causative for neurodevelopmental disorders such as autism. However, the function of FOXP1 within the brain remains largely uncharacterized. Here, we identify the gene expression program regulated by FoxP1 in both human neural cells and patient-relevant heterozygous Foxp1 mouse brains. We demonstrate a role for FoxP1 in the transcriptional regulation of autism-related pathways as well as genes involved in neuronal activity. We show that Foxp1 regulates the excitability of striatal medium spiny neurons and that reduction of Foxp1 correlates with defects in ultrasonic vocalizations. Finally, we demonstrate that FoxP1 has an evolutionarily conserved role in regulating pathways involved in striatal neuron identity through gene expression studies in human neural progenitors with altered FOXP1 levels. These data support an integral role for FoxP1 in regulating signaling pathways vulnerable in autism and the specific regulation of striatal pathways important for vocal communication.


Subject(s)
Autism Spectrum Disorder/physiopathology , Corpus Striatum/physiopathology , Forkhead Transcription Factors/metabolism , Repressor Proteins/metabolism , Signal Transduction/genetics , Animals , Autism Spectrum Disorder/genetics , Cells, Cultured , Disease Models, Animal , Forkhead Transcription Factors/genetics , Gene Expression Regulation/genetics , Haploinsufficiency , Hippocampus/physiopathology , Humans , Mice , Mice, Inbred C57BL , Mutation , Neurons/pathology , Repressor Proteins/genetics , Verbal Behavior/physiology
6.
Methods Enzymol ; 551: 285-321, 2015.
Article in English | MEDLINE | ID: mdl-25662462

ABSTRACT

Genome-wide analyses have revolutionized our ability to study the transcriptional regulation of circadian rhythms. The advent of next-generation sequencing methods has facilitated the use of two such technologies, ChIP-seq and RNA-seq. In this chapter, we describe detailed methods and protocols for these two techniques, with emphasis on their usage in circadian rhythm experiments in the mouse liver, a major target organ of the circadian clock system. Critical factors for these methods are highlighted and issues arising with time series samples for ChIP-seq and RNA-seq are discussed. Finally, detailed protocols for library preparation suitable for Illumina sequencing platforms are presented.


Subject(s)
Circadian Clocks , Gene Expression Regulation , Acetylation , Animals , CLOCK Proteins/isolation & purification , CLOCK Proteins/metabolism , Cell Line , Chromatin Immunoprecipitation , Histones/metabolism , Humans , Methylation , Protein Processing, Post-Translational , Sequence Analysis, RNA , Transcription, Genetic
7.
Science ; 342(6165): 1508-12, 2013 Dec 20.
Article in English | MEDLINE | ID: mdl-24357318

ABSTRACT

The inbred mouse C57BL/6J is the reference strain for genome sequence and for most behavioral and physiological phenotypes. However, the International Knockout Mouse Consortium uses an embryonic stem cell line derived from a related C57BL/6N substrain. We found that C57BL/6N has a lower acute and sensitized response to cocaine and methamphetamine. We mapped a single causative locus and identified a nonsynonymous mutation of serine to phenylalanine (S968F) in Cytoplasmic FMRP interacting protein 2 (Cyfip2) as the causative variant. The S968F mutation destabilizes CYFIP2, and deletion of the C57BL/6N mutant allele leads to acute and sensitized cocaine-response phenotypes. We propose that CYFIP2 is a key regulator of cocaine response in mammals and present a framework to use mouse substrains to identify previously unknown genes and alleles regulating behavior.


Subject(s)
Cocaine-Related Disorders/genetics , Cocaine-Related Disorders/psychology , Cocaine/administration & dosage , Drug-Seeking Behavior , Nerve Tissue Proteins/physiology , Adaptor Proteins, Signal Transducing , Amino Acid Substitution , Animals , Central Nervous System Stimulants/administration & dosage , Methamphetamine/administration & dosage , Mice , Mice, Inbred C57BL , Mice, Knockout , Motor Activity/drug effects , Mutation , Nerve Tissue Proteins/genetics , Phenylalanine/genetics , Polymorphism, Single Nucleotide , Psychomotor Performance/drug effects , Quantitative Trait Loci , Serine/genetics
8.
Lymphat Res Biol ; 11(3): 155-71, 2013 Sep.
Article in English | MEDLINE | ID: mdl-24044756

ABSTRACT

The principal function of the lymphatic system is to transport lymph from the interstitium to the nodes and then from the nodes to the blood. In doing so lymphatics play important roles in fluid homeostasis, macromolecular/antigen transport and immune cell trafficking. To better understand the genes that contribute to their unique physiology, we compared the transcriptional profile of muscular lymphatics (prenodal mesenteric microlymphatics and large, postnodal thoracic duct) to axillary and mesenteric arteries and veins isolated from rats. Clustering of the differentially expressed genes demonstrated that the lymph versus blood vessel differences were more profound than between blood vessels, particularly the microvessels. Gene ontology functional category analysis indicated that microlymphatics were enriched in antigen processing/presentation, IgE receptor signaling, catabolic processes, translation and ribosome; while they were diminished in oxygen transport, regulation of cell proliferation, glycolysis and inhibition of adenylate cyclase activity by G-proteins. We evaluated the differentially expressed microarray genes/products by qPCR and/or immunofluorescence. Immunofluorescence documented that multiple MHC class II antigen presentation proteins were highly expressed by an antigen-presenting cell (APC) type found resident within the lymphatic wall. These APCs also expressed CD86, a co-stimulatory protein necessary for T-cell activation. We evaluated the distribution and phenotype of APCs within the pre and postnodal lymphatic network. This study documents a novel population of APCs resident within the walls of muscular, prenodal lymphatics that indicates novel roles in antigen sampling and immune responses. In conclusion, these prenodal lymphatics exhibit a unique profile that distinguishes them from blood vessels and highlights the role of the lymphatic system as an immunovascular system linking the parenchymal interstitium, lymph nodes and the blood.


Subject(s)
Arteries/metabolism , Gene Expression Profiling/methods , Immunity/genetics , Lymphatic System/metabolism , Veins/metabolism , Animals , Antigen Presentation/genetics , Antigen-Presenting Cells/metabolism , Cluster Analysis , Fluorescent Antibody Technique , Gene Ontology , Histocompatibility Antigens Class II/genetics , Histocompatibility Antigens Class II/metabolism , Male , Mesentery/metabolism , Oligonucleotide Array Sequence Analysis , Rats , Rats, Sprague-Dawley , Reverse Transcriptase Polymerase Chain Reaction
9.
Science ; 338(6105): 349-54, 2012 Oct 19.
Article in English | MEDLINE | ID: mdl-22936566

ABSTRACT

The mammalian circadian clock involves a transcriptional feed back loop in which CLOCK and BMAL1 activate the Period and Cryptochrome genes, which then feedback and repress their own transcription. We have interrogated the transcriptional architecture of the circadian transcriptional regulatory loop on a genome scale in mouse liver and find a stereotyped, time-dependent pattern of transcription factor binding, RNA polymerase II (RNAPII) recruitment, RNA expression, and chromatin states. We find that the circadian transcriptional cycle of the clock consists of three distinct phases: a poised state, a coordinated de novo transcriptional activation state, and a repressed state. Only 22% of messenger RNA (mRNA) cycling genes are driven by de novo transcription, suggesting that both transcriptional and posttranscriptional mechanisms underlie the mammalian circadian clock. We also find that circadian modulation of RNAPII recruitment and chromatin remodeling occurs on a genome-wide scale far greater than that seen previously by gene expression profiling.


Subject(s)
Chromatin/metabolism , Circadian Clocks/genetics , Cryptochromes/genetics , Epigenesis, Genetic , Liver/physiology , Transcription, Genetic , Transcriptional Activation , ARNTL Transcription Factors/metabolism , Animals , CLOCK Proteins/metabolism , Chromatin Assembly and Disassembly/genetics , DNA, Intergenic , Enhancer Elements, Genetic , Gene Expression Profiling , Genetic Loci , Histones/metabolism , Liver/metabolism , Male , Mice , Mice, Inbred C57BL , Period Circadian Proteins/genetics , RNA Polymerase II/metabolism , RNA, Messenger/genetics
10.
Biopolymers ; 97(8): 617-28, 2012 Aug.
Article in English | MEDLINE | ID: mdl-22605553

ABSTRACT

The structures of four small RNAs each containing a different version of the UNAC loop were determined in solution using NMR spectroscopy and restrained molecular dynamics. The UMAC tetraloops (where M is A or C) exhibited a typical GNRA fold including at least one hydrogen bond between the first U and fourth C. In contrast, UGAC and UUAC tetraloops have a different orientation of the first and fourth residues, such that they do not closely mimic the GNRA fold. Although the UMAC tetraloops are excellent structural mimics of the GNRA tetraloop backbone, sequence comparisons typically do not reveal co-variation between the two loop types. The limited covariation is attributed to differences in the location of potential hydrogen bond donors and acceptors as a result of the replacement of the terminal A of GNRA with C in the UMAC version. Thus, UMAC loops do not readily form the common GNRA tetraloop-receptor interaction. The loop at positions 863-866 in E. coli 16S ribosomal RNA appears to be a major exception. However, in this case the GNRA loop does not in fact engage in the usual base to backbone tertiary interactions. In summary, UMAC loops are not just an alternative sequence version of the GNRA loop family, but instead they expand the types of interactions, or lack thereof, that are possible. From a synthetic biology perspective their inclusion in an artificial RNA may allow the establishment of a stable loop structure while minimizing unwanted long range interactions or permitting alternative long-range interactions. © 2012 Wiley Periodicals, Inc. Biopolymers 97: 617-628, 2012.


Subject(s)
Models, Molecular , Nucleic Acid Conformation , RNA Folding , RNA, Ribosomal/chemistry , Bacteria , Escherichia coli , Hydrogen Bonding , Magnetic Resonance Spectroscopy , Molecular Dynamics Simulation , RNA, Ribosomal/chemical synthesis
11.
Technol Cancer Res Treat ; 9(3): 219-30, 2010 Jun.
Article in English | MEDLINE | ID: mdl-20441232

ABSTRACT

The large amounts of microarray data provide us a great opportunity to identify gene expression profiles (GEPs) in different tissues or disease states. Disease-specific biomarker genes likely share GEPs that are distinct in disease samples as compared with normal samples. The similarity of the GEPs may be evaluated by Pearson Correlation Coefficient (PCC) and the distinctness of GEPs may be assessed by Kolmogorov-Smirnov distance (KSD). In this study, we used the PCC and KSD metrics for GEPs to identify disease-specific (cancer-specific) biomarkers. We first analyzed and compared GEPs using microarray datasets for smoking and lung cancer. We found that the number of genes with highly different GEPs between comparing groups in smoking dataset was much larger than that in lung cancer dataset; this observation was further verified when we compared GEPs in smoking dataset with prostate cancer datasets. Moreover, our Gene Ontology analysis revealed that the top ranked biomarker candidate genes for prostate cancer were highly enriched in molecular function categories such as 'cytoskeletal protein binding' and biological process categories such as 'muscle contraction'. Finally, we used two genes, ACTC1 (encoding an actin subunit) and HPN (encoding hepsin), to demonstrate the feasibility of diagnosing and monitoring prostate cancer using the expression intensity histograms of marker genes. In summary, our results suggested that this approach might prove promising and powerful for diagnosing and monitoring the patients who come to the clinic for screening or evaluation of a disease state including cancer.


Subject(s)
Biomarkers, Tumor/genetics , Gene Expression Profiling/methods , Neoplasms/diagnosis , Neoplasms/genetics , Humans , Lung Neoplasms/etiology , Lung Neoplasms/genetics , Male , Prognosis , Prostatic Neoplasms/genetics , Smoking/adverse effects
12.
PLoS One ; 5(2): e9056, 2010 Feb 04.
Article in English | MEDLINE | ID: mdl-20140228

ABSTRACT

BACKGROUND: Identification of genes with switch-like properties will facilitate discovery of regulatory mechanisms that underlie these properties, and will provide knowledge for the appropriate application of Boolean networks in gene regulatory models. As switch-like behavior is likely associated with tissue-specific expression, these gene products are expected to be plausible candidates as tissue-specific biomarkers. METHODOLOGY/PRINCIPAL FINDINGS: In a systematic classification of genes and search for biomarkers, gene expression profiles (GEPs) of more than 16,000 genes from 2,145 mouse array samples were analyzed. Four distribution metrics (mean, standard deviation, kurtosis and skewness) were used to classify GEPs into four categories: predominantly-off, predominantly-on, graded (rheostatic), and switch-like genes. The arrays under study were also grouped and examined by tissue type. For example, arrays were categorized as 'brain group' and 'non-brain group'; the Kolmogorov-Smirnov distance and Pearson correlation coefficient were then used to compare GEPs between brain and non-brain for each gene. We were thus able to identify tissue-specific biomarker candidate genes. CONCLUSIONS/SIGNIFICANCE: The methodology employed here may be used to facilitate disease-specific biomarker discovery.


Subject(s)
Biomarkers/analysis , Gene Expression Profiling/statistics & numerical data , Oligonucleotide Array Sequence Analysis/statistics & numerical data , Proteins/genetics , Algorithms , Animals , Cluster Analysis , Computational Biology , Databases, Genetic , Gene Expression Regulation, Developmental , Gene Regulatory Networks , Genetic Predisposition to Disease/classification , Genetic Predisposition to Disease/genetics , Heart/embryology , Mice , Models, Genetic , Myocardium/metabolism , Proteins/classification , Proteins/metabolism , Software
13.
Biopolymers ; 89(3): 210-9, 2008 Mar.
Article in English | MEDLINE | ID: mdl-18023020

ABSTRACT

Structural data produced by a 2-ns molecular dynamics (MD) simulation on Geobacillus alanine racemase (AlaR; PDB: 1SFT) was used to study hydration around the two AlaR active sites. AlaR is a crucial enzyme for bacterial cell wall biosynthesis. It has been shown previously that the potency of an inhibitor can be increased by incorporating a functional group or atom that displaces hydration sites close to the substrate binding pocket of its target enzyme. The complete linkage algorithm was used for cluster analysis of the active site water positions from 126 solvent configurations sampled at regular intervals from the 2-ns MD simulation. Crystal waters in the 1SFT X-ray structure occupy most of the tightly bound water sites that were discovered. We show here that tightly bound water sites can be identified by cluster analysis of MD-generated coordinates starting with data supplied by a single X-ray structure, and we predict a highly conserved hydration site close to the carboxyl oxygen of L-Ala substrate. This approach holds promise for accelerating the drug design process. We also discuss an analysis of the well-known notion of residence time and introduce a new measure called retention time.


Subject(s)
Alanine Racemase/chemistry , Geobacillus stearothermophilus/enzymology , Models, Molecular , Water/chemistry , Alanine Racemase/metabolism , Binding Sites , Catalysis , Cluster Analysis , Computer Simulation , Geobacillus stearothermophilus/chemistry , Hydrogen Bonding , Models, Chemical , Motion , Protein Structure, Secondary , Solvents/chemistry
14.
J Bioinform Comput Biol ; 5(2a): 251-79, 2007 Apr.
Article in English | MEDLINE | ID: mdl-17589961

ABSTRACT

In most real-world gene expression data sets, there are often multiple sample classes with ordinals, which are categorized into the normal or diseased type. The traditional feature or attribute selection methods consider multiple classes equally without paying attention to the up/down regulation across the normal and diseased types of classes, while the specific gene selection methods particularly consider the differential expressions across the normal and diseased, but ignore the existence of multiple classes. In this paper, to improve the biomarker discovery, we propose to make the best use of these two aspects: the differential expressions (that can be viewed as the domain knowledge of gene expression data) and the multiple classes (that can be viewed as a kind of data set characteristic). Therefore, we simultaneously take into account these two aspects by employing the 1-rank generalized matrix approximations (GMA). Our results show that GMA cannot only improve the accuracy of classifying the samples, but also provide a visualization method to effectively analyze the gene expression data on both genes and samples. Based on the mechanism of matrix approximation, we further propose an algorithm, CBiomarker, to discover compact biomarker by reducing the redundancy.


Subject(s)
Algorithms , Biomarkers/metabolism , Computer Graphics , Gene Expression Profiling/methods , Models, Biological , Oligonucleotide Array Sequence Analysis/methods , User-Computer Interface , Computer Simulation , Databases, Protein , Humans
15.
Article in English | MEDLINE | ID: mdl-17473317

ABSTRACT

Recent advances in high throughput experiments and annotations via published literature have provided a wealth of interaction maps of several biomolecular networks, including metabolic, protein-protein, and protein-DNA interaction networks. The architecture of these molecular networks reveals important principles of cellular organization and molecular functions. Analyzing such networks, i.e., discovering dense regions in the network, is an important way to identify protein complexes and functional modules. This task has been formulated as the problem of finding heavy subgraphs, the Heaviest k-Subgraph Problem (k-HSP), which itself is NP-hard. However, any method based on the k-HSP requires the parameter k and an exact solution of k-HSP may still end up as a "spurious" heavy subgraph, thus reducing its practicability in analyzing large scale biological networks. We proposed a new formulation, called the rank-HSP, and two dynamical systems to approximate its results. In addition, a novel metric, called the Standard deviation and Mean Ratio (SMR), is proposed for use in "spurious" heavy subgraphs to automate the discovery by setting a fixed threshold. Empirical results on both the simulated graphs and biological networks have demonstrated the efficiency and effectiveness of our proposal.


Subject(s)
Algorithms , Gene Expression/physiology , Models, Biological , Protein Interaction Mapping/methods , Proteome/metabolism , Signal Transduction/physiology , Computer Simulation , Data Interpretation, Statistical
16.
Bioinformatics ; 23(11): 1339-47, 2007 Jun 01.
Article in English | MEDLINE | ID: mdl-17237040

ABSTRACT

MOTIVATION: Dilution design (Mixed tissue RNA) has been utilized by some researchers to evaluate and assess the performance of multiple microarray platforms. Current microarray data analysis approaches assume that the quantified signal intensities are linearly related to the expression of the corresponding genes in the sample. However, there are sources of nonlinearity in microarray expression measurements. Such nonlinearity study in the expressions of the RNA mixtures provides a new way to analyze gene expression data, and we argue that the nonlinearity can reveal novel information for microarray data analysis. Therefore, we proposed a statistical model, called proportion model, which is based on the linear regression analysis. To approximately quantify the nonlinearity in the dilution design, a new calibration, beta ratio (BR) was derived from the proportion model. Furthermore, a new adjusted fold change (adj-FC) was proposed to predict the true FC without nonlinearity, in particular for large FC. RESULTS: We applied our method to one microarray dilution dataset. The experimental results indicated that, to some extent, there are global biases comparing with the linear assumption for the significant genes. Further analysis of those highly expressed genes with significant nonlinearity revealed some promising results, e.g. 'poison' effect was discovered for some genes in RNA mixtures. The adj-FCs of those genes with 'poison' effect, indicate that the nonlinearity can be also caused by the inherent feature of the genes besides signal noise and technical variation. Moreover, when percentage of overlapping genes (POG) was used as a cross-platform consistency measure, adj-FC outperformed simple fold change to show that Affymetrix and Illumina platforms are consistent. AVAILABILITY: The R codes which implements all described methods, and some Supplementary material, are freely available from http://www.utdallas.edu/~ying.liu/BetaRatio.htm


Subject(s)
Algorithms , Databases, Genetic , Gene Expression Profiling/methods , Models, Genetic , Oligonucleotide Array Sequence Analysis/methods , RNA/genetics , Computer Simulation , Data Interpretation, Statistical , Nonlinear Dynamics
17.
Article in English | MEDLINE | ID: mdl-17369632

ABSTRACT

In most real-life gene expression data sets, there are often multiple sample classes with ordinals, which are categorized into the normal or diseased type. The traditional feature or attribute selection methods consider multiple classes equally without paying attention to the up/down regulation across the normal and diseased types of classes, while the specific gene selection methods particularly consider the differential expressions across the normal and diseased, but ignore the existence of multiple classes. In this paper, for improving the biomarker discovery, we propose to make the best use of these two aspects: the differential expressions (that can be viewed as the domain knowledge of gene expression data) and the multiple classes (that can be viewed as a kind of data set characteristic). Therefore, we simultaneously take into account these two aspects by employing the 1-rank generalized matrix approximations (GMA). Our results show that the consideration of both aspects can not only improve the accuracy of classifying the samples, but also provide a visualization method to effectively analyze the gene expression data on both genes and samples. Based on the GMA mechanism, we further propose an algorithm for obtaining the compact biomarker by reducing the redundancy.


Subject(s)
Biomarkers, Tumor/metabolism , Biomarkers/metabolism , Computational Biology/methods , Gene Expression Profiling/methods , Gene Expression Regulation, Leukemic , Gene Expression Regulation , Algorithms , Cell Line, Tumor , Cluster Analysis , Genomics , Humans , Lupus Vulgaris/metabolism , Models, Genetic , Models, Statistical , Oligonucleotide Array Sequence Analysis
18.
Nucleic Acids Res ; 33(10): 3145-53, 2005.
Article in English | MEDLINE | ID: mdl-15939932

ABSTRACT

Helix 42 of Domain II of Escherichia coli 23S ribosomal RNA underlies the L7/L12 stalk in the ribosome and may be significant in positioning this feature relative to the rest of the 50S ribosomal subunit. Unlike the Haloarcula marismortui and Deinococcus radiodurans examples, the lower portion of helix 42 in E.coli contains two consecutive G*A oppositions with both adenines on the same side of the stem. Herein, the structure of an analog of positions 1037-1043 and 1112-1118 in the helix 42 region is reported. NMR spectra and structure calculations support a cis Watson-Crick/Watson-Crick (cis W.C.) G*A conformation for the tandem (G*A)2 in the analog and a minimally perturbed helical duplex stem. Mg2+ titration studies imply that the cis W.C. geometry of the tandem (G*A)2 probably allows O6 of G20 and N1 of A4 to coordinate with a Mg2+ ion as indicated by the largest chemical shift changes associated with the imino group of G20 and the H8 of G20 and A4. A cross-strand bridging Mg2+ coordination has also been found in a different sequence context in the crystal structure of H.marismortui 23S rRNA, and therefore it may be a rare but general motif in Mg2+ coordination.


Subject(s)
Escherichia coli/genetics , Magnesium/chemistry , Models, Molecular , RNA, Bacterial/chemistry , RNA, Ribosomal, 23S/chemistry , Adenine/chemistry , Base Pairing , Base Sequence , Binding Sites , Guanine/chemistry , Hydrogen Bonding , Molecular Sequence Data , Nuclear Magnetic Resonance, Biomolecular , Protons , RNA, Bacterial/metabolism , RNA, Ribosomal, 23S/metabolism , Ribosomal Proteins/chemistry
19.
RNA ; 11(4): 412-23, 2005 Apr.
Article in English | MEDLINE | ID: mdl-15769871

ABSTRACT

We have developed a computational approach for the comparison and classification of RNA loop structures. Hairpin or interior loops identified in atomic resolution RNA structures were intercompared by conformational matching. The root-mean-square deviation (RMSD) values between all pairs of RNA fragments of interest, even if from different molecules, are calculated. Subsequently, cluster analysis is performed on the resulting matrix of RMSD distances using the unweighted pair group method with arithmetic mean (UPGMA). The cluster analysis objectively reveals groups of folds that resemble one another. To demonstrate the utility of the approach, a comprehensive analysis of all the terminal hairpin tetraloops that have been observed in 15 RNA structures that have been determined by X-ray crystallography was undertaken. The method found major clusters corresponding to the well-known GNRA and UNCG types. In addition, two tetraloops with the unusual primary sequence UMAC (M is A or C) were successfully assigned to the GNRA cluster. Larger loop structures were also examined and the clustering results confirmed the occurrence of variations of the GNRA and UNCG tetraloops in these loops and provided a systematic means for locating them. Nineteen examples of larger loops that closely resemble either the GNRA or UNCG tetraloop were found in the large ribosomal RNAs. When the clustering approach was extended to include all structures in the SCOR database, novel relationships were detected including one between the ANYA motif and a less common folding of the GAAA tetraloop sequence.


Subject(s)
Nucleic Acid Conformation , RNA/chemistry , Cluster Analysis , Crystallography, X-Ray , Databases, Nucleic Acid
20.
Biopolymers ; 63(4): 247-60, 2002 Apr 05.
Article in English | MEDLINE | ID: mdl-11807752

ABSTRACT

Many examples exist of charged amino acids that play a role in attracting or holding a charged ligand toward or inside an oppositely charged binding pocket of the protein. For example, the enzymes superoxide dismutase, triose-phosphate isomerase, and acetylcholinesterase can steer ligands toward their oppositely charged binding pockets or gorges. Interestingly, in our Brownian dynamics simulations of a phosphate-binding protein, we discovered that negatively charged phosphate (HPO(2-)(4)) could make its way into the negatively charged binding pocket. In fact, the phosphate-binding protein exhibits counterintuitive kinetics of association. That is, one would expect that the rate of association would increase on increases to the ionic strength since the interaction between the ligand, with a charge of -2, and the electronegative binding pocket would be repulsive and greater screening should reduce this repulsion and increase the rate of association. However, the opposite is seen-i.e., the rate of association decreases on increases in the ionic strength. We used Brownian dynamics techniques to compute the diffusion limited association rate constants between the negatively charged phosphate ligand and several open forms of PBP (wild-type and several mutants based on an x-ray structure of open-form PBP, mutant T141D). With the appropriate choices of reaction criteria and molecular parameters, the ligand was able to diffuse into the binding pocket. A number of residues influence binding of the ligand within the pocket via hydrogen bonds or salt bridges. Arg135 partially neutralizes the charges on the HPO(2-)(4) ligand in the binding pocket, allowing it to enter. It is also found that the positive electrostatic patches above and below the binding entrance of PBP contribute the major attractive forces that direct the ligand toward the surface of the protein near the binding site.


Subject(s)
Phosphate-Binding Proteins/chemistry , Bacterial Proteins/chemistry , Bacterial Proteins/genetics , Binding Sites , Biopolymers/chemistry , Computer Simulation , Electrochemistry , Escherichia coli/chemistry , Escherichia coli/genetics , Ligands , Models, Molecular , Mutagenesis, Site-Directed , Phosphate-Binding Proteins/genetics , Phosphates/chemistry , Protein Conformation , Thermodynamics
SELECTION OF CITATIONS
SEARCH DETAIL
...