Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 4 de 4
Filter
Add more filters










Database
Language
Publication year range
1.
Bioinformatics ; 15(7-8): 669-86, 1999.
Article in English | MEDLINE | ID: mdl-10487874

ABSTRACT

MOTIVATION: The goal of the work was to develop a WWW-oriented computer system providing a maximal integration of informational and software resources on the regulation of gene expression and navigation through them. Rapid growth of the variety and volume of information accumulated in the databases on regulation of gene expression necessarily requires the development of computer systems for automated discovery of the knowledge that can be further used for analysis of regulatory genomic sequences. RESULTS: The GeneExpress system developed includes the following major informational and software modules: (1) Transcription Regulation (TRRD) module, which contains the databases on transcription regulatory regions of eukaryotic genes and TRRD Viewer for data visualization; (2) Site Activity Prediction (ACTIVITY), the module for analysis of functional site activity and its prediction; (3) Site Recognition module, which comprises (a) B-DNA-VIDEO system for detecting the conformational and physicochemical properties of DNA sites significant for their recognition, (b) Consensus and Weight Matrices (ConsFrec) and (c) Transcription Factor Binding Sites Recognition (TFBSR) systems for detecting conservative contextual regions of functional sites and their recognition; (4) Gene Networks (GeneNet), which contains an object-oriented database accumulating the data on gene networks and signal transduction pathways, and the Java-based Viewer for exploration and visualization of the GeneNet information; (5) mRNA Translation (Leader mRNA), designed to analyze structural and contextual properties of mRNA 5'-untranslated regions (5'-UTRs) and predict their translation efficiency; (6) other program modules designed to study the structure-function organization of regulatory genomic sequences and regulatory proteins. AVAILABILITY: GeneExpress is available at http://wwwmgs.bionet.nsc. ru/systems/GeneExpress/ and the links to the mirror site(s) can be found at http://wwwmgs.bionet.nsc.ru/mgs/links/mirrors.html+ ++.


Subject(s)
Computer Systems , Databases, Factual , Gene Expression , Algorithms , Artificial Intelligence , Base Sequence , Binding Sites/genetics , Chemical Phenomena , Chemistry, Physical , DNA/chemistry , DNA/genetics , DNA/metabolism , Eukaryotic Cells , Internet , Nucleic Acid Conformation , Promoter Regions, Genetic , Protein Biosynthesis , RNA, Messenger/genetics , Software , TATA Box , Transcription Factors/metabolism
2.
Comput Appl Biosci ; 11(5): 477-88, 1995 Oct.
Article in English | MEDLINE | ID: mdl-8590170

ABSTRACT

A method for identification of eukaryotic promoters by localization of binding sites for transcription factors has been suggested. The binding sites for a range of transcription factors have been found to be distributed unevenly. Based on these distributions, we have constructed a weight matrix of binding site localization. On the basis of the weight matrix we have, in turn, designed an algorithm for promoter recognition. To increase the accuracy of the method, we have developed a routine that breaks any promoter sample into subsamples. The method to be reported on allows much better recognition accuracy than does the approach based on detection of the TATA box. In particular, the overprediction error is three times lower following our method. The program FunSiteP recognizes promoters from newly uncovered sequences and tentatively identifies the functional class the promoters must belong to. We have introduced the notion of 'regulatory potential' for the degree to which any region of the sequences is similar to the real eukaryotic promoter. By making use of the potential, we have revealed putative transcription start sites and extended regions of transcription regulation.


Subject(s)
Promoter Regions, Genetic , Software , Transcription Factors/metabolism , Algorithms , Animals , Base Sequence , Binding Sites/genetics , DNA/genetics , DNA/metabolism , Databases, Factual , Eukaryotic Cells , Humans , Molecular Sequence Data , TATA Box/genetics
3.
Comput Appl Biosci ; 11(1): 39-44, 1995 Feb.
Article in English | MEDLINE | ID: mdl-7540937

ABSTRACT

A new version of the program PROANAL is described. A multiple linear regression analysis of the protein structure--activity relationship allows one to investigate the combinations of protein sites and factors influencing the activity. The program also provides the possibility to seek out protein sites, conservative or variable in variations of physicochemical characteristics, and regions with high or low values of these characteristics. PROANAL2 may be useful in the simulation of protein-engineering experiments and in the search of a number of protein regions such as functional sites, secondary structures, solvent-exposed regions, T- and B-cell antigenic determinants, etc.


Subject(s)
Proteins/genetics , Sequence Alignment/methods , Software , Algorithms , Amino Acid Sequence , Chemical Phenomena , Chemistry, Physical , Conserved Sequence , Disintegrins , Evaluation Studies as Topic , Interferons/chemistry , Interferons/genetics , Peptides/chemistry , Peptides/genetics , Proteins/chemistry , Regression Analysis , Sequence Alignment/statistics & numerical data , Structure-Activity Relationship
4.
Article in English | MEDLINE | ID: mdl-7584437

ABSTRACT

We present the computer tool FUNSITE for description and analysis of regulatory sequences of eukaryotic genomes. The tool consists of the following main parts: 1) An integrated database for genomic regulatory sequences. The integrated database was designed on the basis of the databases TRANSFAC (Wingender 1994) and TRRD (Kel et al. 1995) that are currently under development. The following functions are performed: i) linkage to the EMBL database; ii) preparing samples of definite types of functional sites with their flanking sequences; iii) preparing samples of promoter sequences; iv) preparing samples of transcription factors classified with regard to structural and functional features of DNA binding and activating domains, functional families of the factors, their tissue specificity and other functional features; v) access to data on mutual disposition of cis-elements within the regulatory regions. 2) The second component of FUNSITE tool is the set of programs for analysis of the structural organization of regulatory sequences: i) Program for revealing of potential transcription factors binding sites based on their consensi; ii) program for revealing of the potential binding sites using homology search with nucleotide sequences of real binding sites; iii) program for analysis of oligonucleotide context features which are characteristic of flank sequences of the binding sites; iv) program for design of recognition method for the functional sites based on generalized weight matrix; v) program for revealing potential composite elements. The results of analysis of the promoter sequences of eukaryotic genes with the FUNSITE are presented, too.


Subject(s)
DNA/chemistry , Databases, Factual , Genome , Multigene Family , Regulatory Sequences, Nucleic Acid , Software , Amino Acid Sequence , Animals , Arabidopsis/genetics , Base Sequence , Binding Sites , Consensus Sequence , DNA/genetics , DNA/metabolism , Genes , Humans , Mathematics , Mice , Models, Genetic , Models, Statistical , Molecular Sequence Data , Promoter Regions, Genetic , Saccharomyces cerevisiae/genetics , Sequence Homology, Nucleic Acid , Transcription Factors/chemistry , Transcription Factors/metabolism
SELECTION OF CITATIONS
SEARCH DETAIL
...