Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 15 de 15
Filter
Add more filters










Publication year range
1.
Nucleic Acids Res ; 36(Web Server issue): W465-9, 2008 Jul 01.
Article in English | MEDLINE | ID: mdl-18424797

ABSTRACT

Phylogenetic analyses are central to many research areas in biology and typically involve the identification of homologous sequences, their multiple alignment, the phylogenetic reconstruction and the graphical representation of the inferred tree. The Phylogeny.fr platform transparently chains programs to automatically perform these tasks. It is primarily designed for biologists with no experience in phylogeny, but can also meet the needs of specialists; the first ones will find up-to-date tools chained in a phylogeny pipeline to analyze their data in a simple and robust way, while the specialists will be able to easily build and run sophisticated analyses. Phylogeny.fr offers three main modes. The 'One Click' mode targets non-specialists and provides a ready-to-use pipeline chaining programs with recognized accuracy and speed: MUSCLE for multiple alignment, PhyML for tree building, and TreeDyn for tree rendering. All parameters are set up to suit most studies, and users only have to provide their input sequences to obtain a ready-to-print tree. The 'Advanced' mode uses the same pipeline but allows the parameters of each program to be customized by users. The 'A la Carte' mode offers more flexibility and sophistication, as users can build their own pipeline by selecting and setting up the required steps from a large choice of tools to suit their specific needs. Prior to phylogenetic analysis, users can also collect neighbors of a query sequence by running BLAST on general or specialized databases. A guide tree then helps to select neighbor sequences to be used as input for the phylogeny pipeline. Phylogeny.fr is available at: http://www.phylogeny.fr/


Subject(s)
Phylogeny , Software , Internet , Sequence Alignment , Sequence Analysis, DNA , Sequence Analysis, Protein
2.
Ann N Y Acad Sci ; 1063: 26-34, 2005 Dec.
Article in English | MEDLINE | ID: mdl-16481487

ABSTRACT

Rickettsia felis has been recently cultured in XTC2 cells. This allows production of enough bacteria to create a genomic bank and to sequence it. The chromosome of R. felis is longer than that of previously sequenced rickettsiae and it possess 2 plasmids. Microscopically, this bacterium exhibits two forms of pili: one resembles a conjugative pilus and another forms hair-like projections that may play a role in pathogenicity. R. felis also exhibits several copies of ankyrin-repeat genes and tetratricopeptide encoding gene that are specifically linked to pathogenic host-associated bacteria. It also contains toxin-antitoxin system encoding genes that are extremely rare in intracellular bacteria and may be linked to plasmid maintenance.


Subject(s)
DNA, Bacterial/analysis , Genome, Bacterial , Rickettsia felis/growth & development , Rickettsia felis/genetics , Sequence Analysis, DNA , Animals , Bacteriological Techniques , Humans , Rickettsia felis/pathogenicity , Rickettsia felis/ultrastructure
3.
Pharmacogenomics ; 2(4): 361-72, 2001 Nov.
Article in English | MEDLINE | ID: mdl-11722286

ABSTRACT

In the post-genomic era, the new discipline of functional genomics is now facing the challenge of associating a function (as well as estimating its relevance to industrial applications) to about 100,000 microbial, plant or animal genes of known sequence but unknown function. Besides the design of databases, computational methods are increasingly becoming intimately linked with the various experimental approaches. Consequently, bioinformatics is rapidly evolving into independent fields addressing the specific problems of interpreting i) genomic sequences, ii) protein sequences and 3D-structures, as well as iii) transcriptome and macromolecular interaction data. It is thus increasingly difficult for the biologist to choose the computational approaches that perform best in these various areas. This paper attempts to review the most useful developments of the last 2 years.


Subject(s)
Computational Biology/methods , Genomics/methods , Computational Biology/trends , Databases, Factual , Humans , Internet , Private Sector , Proteome , Sequence Analysis, Protein , Software Design
4.
Science ; 293(5537): 2093-8, 2001 Sep 14.
Article in English | MEDLINE | ID: mdl-11557893

ABSTRACT

Rickettsia conorii is an obligate intracellular bacterium that causes Mediterranean spotted fever in humans. We determined the 1,268,755-nucleotide complete genome sequence of R. conorii, containing 1374 open reading frames. This genome exhibits 804 of the 834 genes of the previously determined R. prowazekii genome plus 552 supplementary open reading frames and a 10-fold increase in the number of repetitive elements. Despite these differences, the two genomes exhibit a nearly perfect colinearity that allowed the clear identification of different stages of gene alterations with gene remnants and 37 genes split in 105 fragments, of which 59 are transcribed. A 38-kilobase sequence inversion was dated shortly after the divergence of the genus.


Subject(s)
Evolution, Molecular , Genome, Bacterial , Rickettsia conorii/genetics , Rickettsia prowazekii/genetics , Adaptation, Physiological , Chlamydia/genetics , Computational Biology , DNA, Bacterial/genetics , DNA, Intergenic , Gene Dosage , Gene Silencing , Gene Transfer, Horizontal , Genes, Bacterial , Open Reading Frames , Phylogeny , Polymerase Chain Reaction , Repetitive Sequences, Nucleic Acid , Rickettsia/genetics , Rickettsia conorii/physiology , Rickettsia prowazekii/physiology , Sequence Analysis, DNA , Transcription, Genetic
5.
Science ; 290(5490): 347-50, 2000 Oct 13.
Article in English | MEDLINE | ID: mdl-11030655

ABSTRACT

Rickettsia conorii, the aetiological agent of Mediterranean spotted fever, is an intracellular bacterium transmitted by ticks. Preliminary analyses of the nearly complete genome sequence of R. conorii have revealed 44 occurrences of a previously undescribed palindromic repeat (150 base pairs long) throughout the genome. Unexpectedly, this repeat was found inserted in-frame within 19 different R. conorii open reading frames likely to encode functional proteins. We found the same repeat in proteins of other Rickettsia species. The finding of a mobile element inserted in many unrelated genes suggests the potential role of selfish DNA in the creation of new protein sequences.


Subject(s)
Bacterial Proteins/genetics , DNA, Bacterial/genetics , Interspersed Repetitive Sequences , Open Reading Frames/genetics , RNA, Messenger/genetics , Rickettsia conorii/genetics , Rickettsia/genetics , Bacterial Proteins/chemistry , Base Sequence , Conserved Sequence , Evolution, Molecular , Genome, Bacterial , Molecular Sequence Data , Mutagenesis, Insertional , Nucleic Acid Conformation , Protein Biosynthesis , Protein Conformation , Protein Structure, Secondary , RNA, Bacterial/chemistry , RNA, Bacterial/genetics , RNA, Bacterial/metabolism , RNA, Messenger/chemistry , RNA, Messenger/metabolism
6.
Genome Res ; 9(10): 950-9, 1999 Oct.
Article in English | MEDLINE | ID: mdl-10523523

ABSTRACT

Large, publicly available collections of expressed sequence tags (ESTs) have been generated from Arabidopsis thaliana and rice (Oryza sativa). A potential, but relatively unexplored application of this data is in the study of plant gene expression. Other EST data, mainly from human and mouse, have been successfully used to point out genes exhibiting tissue- or disease-specific expression, as well as for identification of alternative transcripts. In this report, we go a step further in showing that computer analyses of plant EST data can be used to generate evidence of correlated expression patterns of genes across various tissues. Furthermore, tissue types and organs can be classified with respect to one another on the basis of their global gene expression patterns. As in previous studies, expression profiles are first estimated from EST counts. By clustering gene expression profiles or whole cDNA library profiles, we show that genes with similar functions, or cDNA libraries expected to share patterns of gene expression, are grouped together. Promising uses of this technique include functional genomics, in which evidence of correlated expression might complement (or substitute for) those of sequence similarity in the annotation of anonymous genes and identification of surrogate markers. The analysis presented here combines the application of a correlation-based clustering method with a graphical color map allowing intuitive visualization of patterns within a large table of expression measurements.


Subject(s)
Expressed Sequence Tags , Gene Expression Regulation, Plant , Oryza/genetics , Contig Mapping , DNA, Complementary/analysis , Databases, Factual , Models, Genetic , Models, Statistical , Multigene Family
7.
Proc Natl Acad Sci U S A ; 95(17): 10026-31, 1998 Aug 18.
Article in English | MEDLINE | ID: mdl-9707594

ABSTRACT

A new method for predicting protein-coding regions in microbial genomic DNA sequences is presented. It uses an ab initio iterative Markov modeling procedure to automatically perform the partition of genomic sequences into three subsets shown to correspond to coding, coding on the opposite strand, and noncoding segments. In contrast to current methods, such as GENEMARK [Borodovsky, M. & McIninch, J. D. (1993) Comput. Chem. 17, 123-133], no training set or prior knowledge of the statistical properties of the studied genome are required. This new method tolerates error rates of 1-2% and can process unassembled sequences. It is thus ideal for the analysis of genome survey and/or fragmented sequence data from uncharacterized microorganisms. The method was validated on 10 complete bacterial genomes (from four major phylogenetic lineages). The results show that protein-coding regions can be identified with an accuracy of up to 90% with a totally automated and objective procedure.


Subject(s)
Bacterial Proteins/genetics , Genetic Techniques , Genome, Bacterial , Algorithms , Bacteria/genetics , Base Composition , DNA, Bacterial/chemistry , DNA, Bacterial/genetics , Markov Chains , Models, Genetic , Reproducibility of Results
8.
Genome Res ; 8(5): 524-30, 1998 May.
Article in English | MEDLINE | ID: mdl-9582195

ABSTRACT

Alternate polyadenylation is an important post-transcriptional regulatory process now open to large-scale analysis by use of cDNA databases. We clustered 164,000 expressed sequence tags (ESTs) into approximately 15,000 groups and aligned each group to a putative mRNA 3' end. By use of stringent criteria to discard artifactual mRNA extremities, clear evidence for alternate polyadenylation was obtained in 189 of the 1000 EST clusters studied. A number of previously unreported polyadenylation sites were identified, together with possible instances of tissue-specific differential polyadenylation. This study demonstrates that, besides quantitative aspects of gene expression, the distribution of alternate mRNA forms can be analyzed through EST sampling.


Subject(s)
Computational Biology/methods , Databases, Factual , Gene Expression/genetics , Poly A/metabolism , RNA, Messenger/metabolism , Humans , Protein Biosynthesis , RNA, Messenger/chemistry , Regulatory Sequences, Nucleic Acid
10.
Nat Biotechnol ; 15(9): 855-8, 1997 Sep.
Article in English | MEDLINE | ID: mdl-9306399

ABSTRACT

The first successful recovery of ancient DNA, from quagga and human mummies inspired significant enough interest to open an entire field of research. Efforts from many research groups, often in a hunt for the oldest sequences, showed that ancient DNA was a poor substrate for the enzymes used in molecular biology; it is present in tiny amounts, hard to purify, and frequently damaged. These obstacles have been partially overcome by the use of drastic laboratory precautions and by the introduction of polymerase chain reaction and phylogenetic studies. Ancient DNA analysis now finds applications in many research domains.


Subject(s)
DNA/genetics , Paleontology , Animals , Archaeology , Communicable Diseases/genetics , DNA/chemistry , DNA, Bacterial/chemistry , DNA, Bacterial/genetics , DNA, Plant/chemistry , DNA, Plant/genetics , Genetic Diseases, Inborn/genetics , Humans , Polymerase Chain Reaction , Templates, Genetic
11.
Proteins ; 29(2): 252-7, 1997 Oct.
Article in English | MEDLINE | ID: mdl-9329089

ABSTRACT

SAmBA is a new software for the design of minimal experimental protocols using the notion of orthogonal arrays of strength 2. The main application of SAmBA is the search of protein crystallization conditions. Given a user input defining the relevant effectors/variables (e.g., pH, temperature, salts) and states (e.g., pH: 5, 6, 7 and 8), this software proposes an optimal set of experiments in which all tested variables and the pairwise interactions between them are symmetrically sampled. No a priori restrictions on the number and range of experimental variables is imposed. SAmBA consists of two complementary programs, SAm and BA, using a simulated annealing approach and a backtracking algorithm, respectively. The software is freely available as C code or as an interactive JAVA applet at http:/(/)igs-server.cnrs-mrs.fr.


Subject(s)
Proteins/chemistry , Software , Algorithms , Crystallization , Hydrogen-Ion Concentration , Temperature
12.
Genome Res ; 7(10): 986-95, 1997 Oct.
Article in English | MEDLINE | ID: mdl-9331369

ABSTRACT

Genes differentially expressed in different tissues, during development, or during specific pathologies are of foremost interest to both basic and pharmaceutical research. "Transcript profiles" or "digital Northerns" are generated routinely by partially sequencing thousands of randomly selected clones from relevant cDNA libraries. Differentially expressed genes can then be detected from variations in the counts of their cognate sequence tags. Here we present the first systematic study on the influence of random fluctuations and sampling size on the reliability of this kind of data. We establish a rigorous significance test and demonstrate its use on publicly available transcript profiles. The theory links the threshold of selection of putatively regulated genes (e.g., the number of pharmaceutical leads) to the fraction of false positive clones one is willing to risk. Our results delineate more precisely and extend the limits within which digital Northern data can be used.


Subject(s)
DNA, Complementary/genetics , Gene Expression , Models, Genetic , Models, Statistical , Actins/genetics , Bayes Theorem , Brain , Confidence Intervals , Dimethyl Sulfoxide/pharmacology , Humans , Liver , Pancreas , Probability , Sample Size , Tetradecanoylphorbol Acetate/pharmacology , Tumor Cells, Cultured/drug effects
13.
Comput Chem ; 21(4): 223-7, 1997.
Article in English | MEDLINE | ID: mdl-9440929

ABSTRACT

Eukaryotic promoters are among the most important functional domains yet to be characterized in a satisfactory manner in genomic sequences. Most current detection methods rely on the recognition of individual transcription elements using position-weight matrices (PWM) or consensus sequences. Here, we study a simple promoter detection algorithm based on Markov transition matrices built from sequences upward from proven transcription initiation sites. The performances have been evaluated on the training set and on a test set of promoter-containing sequences. The results on the training set are surprisingly good, given that the algorithm does not incorporate any specific knowledge about promoters. Yet, the program exhibits the pathological behaviour typical of all training set-based methods: a significant decline in performance when confronted with previously unseen sequences. Thus, the Markov algorithm, like the others presently available, does not truly capture the essence of eukaryotic promoters. A detection program based on a Markov model is likely to be blind to categories of promoters without close representatives in the training set.


Subject(s)
Markov Chains , Promoter Regions, Genetic , Animals , Base Sequence , Computer Simulation , Consensus Sequence , Eukaryotic Cells , Genetic Techniques , Models, Genetic , TATA Box , Transcription, Genetic
14.
Comput Appl Biosci ; 12(5): 431-9, 1996 Oct.
Article in English | MEDLINE | ID: mdl-8996792

ABSTRACT

MOTIVATION: To improve the detection of nucleotide sequence signals (e.g. promoter elements) by position-weight matrices (PWM) using the concept of statistically significant matches. RESULTS: The Mksite program was originally developed for analyzing protein sequences. We report NMksite, a new version adapted to the processing of nucleotide sequences. NMksite creates PWM from nucleotide sequence block alignments or occurrence tables using three weight computation schemes. An original feature of NMksite is the numerical computation of the statistical significance of PWM matches. The utility of this concept is demonstrated in the context of the prediction of splice sites and promoter regions.


Subject(s)
Mathematical Computing , Promoter Regions, Genetic , Software , Base Sequence , False Positive Reactions , Pattern Recognition, Automated , Sequence Alignment , TATA Box , Transcription, Genetic
15.
Comput Appl Biosci ; 11(5): 489-95, 1995 Oct.
Article in English | MEDLINE | ID: mdl-8590171

ABSTRACT

We describe a new technique for the automatic detection and characterization, in terms of location, size and intensity, of positive hybridization spots on hybridization filter images. We will also discuss how this information can be used to reconstruct the spot deposition pattern on the filter and thus identify the spots. The part of the algorithm dedicated to the detection of spots is derived from a technique used in astronomy for the automatic recognition of galaxies on photographic plates. This technique has been modified so that it becomes able to cope with the strongly varying background typical of hybridization filter images. The technique does not require any human intervention.


Subject(s)
Genetic Techniques , Image Processing, Computer-Assisted/methods , Nucleic Acid Hybridization , Software , Algorithms , DNA/genetics , DNA/isolation & purification , DNA Probes
SELECTION OF CITATIONS
SEARCH DETAIL
...