Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 36
Filter
Add more filters










Publication year range
1.
J Mol Graph Model ; 98: 107610, 2020 07.
Article in English | MEDLINE | ID: mdl-32302938

ABSTRACT

Hepatitis B, one of the world's most common liver infections, is caused by the Hepatitis B Virus (HBV). Via the infected cells, this virus generates non pathogen particles with similar surface structures as those found in the full virus. These particles are used in a recombinant form (HBsAg) to produce efficient vaccines. The atomic structure of the HBsAg particles is currently unsolved, and the only existing structural data for the full particle were obtained by electronic microscopy with a maximum resolution of 12 Å. As many vaccines, HBsAg is a complex bio-system. This complexity results from numerous sources of heterogeneity, and traditional bio-immuno-chemistry analytic tools are often limited in their ability to fully describe the molecular surface or the particle. For the Hepatitis B vaccine particle (HBsAg), no atomic data are available so far. In this study, we used the principal well-known elements of HBsAg structure to reconstitute and model the full HBsAg particle assembly at a molecular level (protein assembly, particle formation and maturation). Full HBsAg particle atomic models were built based on an exhaustive experimental data review, amino acid sequence analysis, iterative threading modeling, and molecular dynamic approaches.


Subject(s)
Hepatitis B Surface Antigens , Hepatitis B Vaccines , Hepatitis B virus
2.
Bioinformatics ; 33(24): 3991-3992, 2017 Dec 15.
Article in English | MEDLINE | ID: mdl-28961944

ABSTRACT

MOTIVATION: ALIGNSEC is a module within ANTHEPROT designed for the interactive display, edition and printing of large-scale multiple alignments integrating secondary structure predictions. AVAILABILITY AND IMPLEMENTATION: The ALIGNSEC module is part of the ANTHEPROT package (http://antheprot-pbil.ibcp.fr) which can be used freely for academic users. It is running on Windows Operating systems. For commercial use, please contact the author. CONTACT: gilbert.deleage@ibcp.fr.


Subject(s)
Protein Structure, Secondary , Sequence Alignment/methods , Software , Computer Simulation
3.
J Biol Chem ; 291(32): 16553-66, 2016 08 05.
Article in English | MEDLINE | ID: mdl-27311711

ABSTRACT

Thyroid hormones modulate not only multiple functions in vertebrates (energy metabolism, central nervous system function, seasonal changes in physiology, and behavior) but also in some non-vertebrates where they control critical post-embryonic developmental transitions such as metamorphosis. Despite their obvious biological importance, the thyroid hormone precursor protein, thyroglobulin (Tg), has been experimentally investigated only in mammals. This may bias our view of how thyroid hormones are produced in other organisms. In this study we searched genomic databases and found Tg orthologs in all vertebrates including the sea lamprey (Petromyzon marinus). We cloned a full-size Tg coding sequence from western clawed frog (Xenopus tropicalis) and zebrafish (Danio rerio). Comparisons between the representative mammal, amphibian, teleost fish, and basal vertebrate indicate that all of the different domains of Tg, as well as Tg regional structure, are conserved throughout the vertebrates. Indeed, in Xenopus, zebrafish, and lamprey Tgs, key residues, including the hormonogenic tyrosines and the disulfide bond-forming cysteines critical for Tg function, are well conserved despite overall divergence of amino acid sequences. We uncovered upstream sequences that include start codons of zebrafish and Xenopus Tgs and experimentally proved that these are full-length secreted proteins, which are specifically recognized by antibodies against rat Tg. By contrast, we have not been able to find any orthologs of Tg among non-vertebrate species. Thus, Tg appears to be a novel protein elaborated as a single event at the base of vertebrates and virtually unchanged thereafter.


Subject(s)
Evolution, Molecular , Lampreys/genetics , Thyroglobulin/genetics , Xenopus Proteins/genetics , Zebrafish Proteins/genetics , Zebrafish/genetics , Animals , Rats , Xenopus
4.
Database (Oxford) ; 2014: bau013, 2014.
Article in English | MEDLINE | ID: mdl-24608034

ABSTRACT

BCL2DB (http://bcl2db.ibcp.fr) is a database designed to integrate data on BCL-2 family members and BH3-only proteins. These proteins control the mitochondrial apoptotic pathway and probably many other cellular processes as well. This large protein group is formed by a family of pro-apoptotic and anti-apoptotic homologs that have phylogenetic relationships with BCL-2, and by a collection of evolutionarily and structurally unrelated proteins characterized by the presence of a region of local sequence similarity with BCL-2, termed the BH3 motif. BCL2DB is monthly built, thanks to an automated procedure relying on a set of homemade profile HMMs computed from seed reference sequences representative of the various BCL-2 homologs and BH3-only proteins. The BCL2DB entries integrate data from the Ensembl, Ensembl Genomes, European Nucleotide Archive and Protein Data Bank databases and are enriched with specific information like protein classification into orthology groups and distribution of BH motifs along the sequences. The Web interface allows for easy browsing of the site and fast access to data, as well as sequence analysis with generic and specific tools. BCL2DB provides a helpful and powerful tool to both 'BCL-2-ologists' and researchers working in the various fields of physiopathology. Database URL: http://bcl2db.ibcp.fr.


Subject(s)
Databases, Protein , Multigene Family , Proto-Oncogene Proteins c-bcl-2/metabolism , Amino Acid Sequence , Animals , Humans , Internet , Molecular Sequence Annotation , Molecular Sequence Data , Proto-Oncogene Proteins c-bcl-2/chemistry , User-Computer Interface
5.
J Virol ; 88(1): 574-82, 2014 Jan.
Article in English | MEDLINE | ID: mdl-24173223

ABSTRACT

Last-generation nucleoside/nucleotide analogues are potent against hepatitis B virus (HBV) and have a high barrier to resistance. However, delayed responses have been observed in patients previously exposed to other drugs of the same class, long-term resistance is possible, and cure of infection cannot be achieved with these therapies, emphasizing the need for alternative therapeutic approaches. The HBV RNase H represents an interesting target because its enzyme activity is essential to the HBV life cycle. The goal of our study was to characterize the structure of the HBV RNase H by computing a 3-dimensional molecular model derived from E. coli RNase H and analyzing 2,326 sequences of all HBV genotypes available in public databases and 958,000 sequences generated by means of ultradeep pyrosequencing of sequences from a homogenous population of 73 treatment-naive patients infected with HBV genotype D. Our data revealed that (i) the putative 4th catalytic residue displays unexpected variability that could be explained by the overlap of the HBx gene and has no apparent impact on HBV replicative capacity and that (ii) the C-helix-containing basic protrusion, which is required to guide the RNA/DNA heteroduplex into the catalytic site, is highly conserved and bears unique structural properties that can be used to target HBV-specific RNase H inhibitors without cross-species activity. The model shows substantial differences from other known RNases H and paves the way for functional and structural studies as a prerequisite to the development of new inhibitors of the HBV cell cycle specifically targeting RNase H activity.


Subject(s)
Hepatitis B virus/enzymology , High-Throughput Nucleotide Sequencing , Ribonuclease H/genetics , Amino Acid Sequence , Antiviral Agents/pharmacology , Genotype , Hepatitis B virus/drug effects , Hepatitis B virus/genetics , Models, Molecular , Molecular Sequence Data , Ribonuclease H/chemistry , Sequence Homology, Amino Acid
6.
Nucleic Acids Res ; 41(Database issue): D566-70, 2013 Jan.
Article in English | MEDLINE | ID: mdl-23125365

ABSTRACT

We have developed a specialized database, HBVdb (http://hbvdb.ibcp.fr), allowing the researchers to investigate the genetic variability of Hepatitis B Virus (HBV) and viral resistance to treatment. HBV is a major health problem worldwide with more than 350 million individuals being chronically infected. HBV is an enveloped DNA virus that replicates by reverse transcription of an RNA intermediate. HBV genome is optimized, being circular and encoding four overlapping reading frames. Indeed, each nucleotide of the genome takes part in the coding of at least one protein. However, HBV shows some genome variability leading to at least eight different genotypes and recombinant forms. The main drugs used to treat infected patients are nucleos(t)ides analogs (reverse transcriptase inhibitors). Unfortunately, HBV mutants resistant to these drugs may be selected and be responsible for treatment failure. HBVdb contains a collection of computer-annotated sequences based on manually annotated reference genomes. The database can be accessed through a web interface that allows static and dynamic queries and offers integrated generic sequence analysis tools and specialized analysis tools (e.g. annotation, genotyping, drug resistance profiling).


Subject(s)
Databases, Genetic , Hepatitis B virus/genetics , Drug Resistance, Viral/genetics , Genetic Variation , Genome, Viral , Genotyping Techniques , Hepatitis B virus/drug effects , Internet , Molecular Sequence Annotation , User-Computer Interface , Viral Proteins/genetics
7.
Nucleic Acids Res ; 40(Database issue): D321-4, 2012 Jan.
Article in English | MEDLINE | ID: mdl-22080550

ABSTRACT

Bacterial tyrosine-kinases share no resemblance with their eukaryotic counterparts and they have been unified in a new protein family named BY-kinases. These enzymes have been shown to control several biological functions in the bacterial cells. In recent years biochemical studies, sequence analyses and structure resolutions allowed the deciphering of a common signature. However, BY-kinase sequence annotations in primary databases remain incomplete. This prompted us to develop a specialized database of computer-annotated BY-kinase sequences: the Bacterial protein tyrosine-kinase database (BYKdb). BY-kinase sequences are first identified, thanks to a workflow developed in a previous work. A second workflow annotates the UniProtKB entries in order to provide the BYKdb entries. The database can be accessed through a web interface that allows static and dynamic queries and offers integrated sequence analysis tools. BYKdb can be found at http://bykdb.ibcp.fr.


Subject(s)
Bacteria/enzymology , Bacterial Proteins/chemistry , Databases, Protein , Protein-Tyrosine Kinases/chemistry , Molecular Sequence Annotation , Sequence Analysis, Protein , User-Computer Interface
8.
Hum Mutat ; 31(2): 127-35, 2010 Feb.
Article in English | MEDLINE | ID: mdl-19921752

ABSTRACT

Understanding how genetic alterations affect gene products at the molecular level represents a first step in the elucidation of the complex relationships between genotypic and phenotypic variations, and is thus a major challenge in the postgenomic era. Here, we present SM2PH-db (http://decrypthon.igbmc.fr/sm2ph), a new database designed to investigate structural and functional impacts of missense mutations and their phenotypic effects in the context of human genetic diseases. A wealth of up-to-date interconnected information is provided for each of the 2,249 disease-related entry proteins (August 2009), including data retrieved from biological databases and data generated from a Sequence-Structure-Evolution Inference in Systems-based approach, such as multiple alignments, three-dimensional structural models, and multidimensional (physicochemical, functional, structural, and evolutionary) characterizations of mutations. SM2PH-db provides a robust infrastructure associated with interactive analysis tools supporting in-depth study and interpretation of the molecular consequences of mutations, with the more long-term goal of elucidating the chain of events leading from a molecular defect to its pathology. The entire content of SM2PH-db is regularly and automatically updated thanks to a computational grid data federation facilities provided in the context of the Decrypthon program.


Subject(s)
Databases, Protein , Genetic Diseases, Inborn/genetics , Mutation, Missense/genetics , Software , Humans , Internet , Phenotype , Proteins , User-Computer Interface
9.
J Appl Crystallogr ; 42(Pt 3): 376-384, 2009 Jun 01.
Article in English | MEDLINE | ID: mdl-22477769

ABSTRACT

Structural biology, homology modelling and rational drug design require accurate three-dimensional macromolecular coordinates. However, the coordinates in the Protein Data Bank (PDB) have not all been obtained using the latest experimental and computational methods. In this study a method is presented for automated re-refinement of existing structure models in the PDB. A large-scale benchmark with 16 807 PDB entries showed that they can be improved in terms of fit to the deposited experimental X-ray data as well as in terms of geometric quality. The re-refinement protocol uses TLS models to describe concerted atom movement. The resulting structure models are made available through the PDB_REDO databank (http://www.cmbi.ru.nl/pdb_redo/). Grid computing techniques were used to overcome the computational requirements of this endeavour.

10.
Bioinformatics ; 24(23): 2782-3, 2008 Dec 01.
Article in English | MEDLINE | ID: mdl-18826958

ABSTRACT

MOTIVATION: The technique of chemical cross-linking followed by mass spectrometry has proven to bring valuable information about the protein structure and interactions between proteic subunits. It is an effective and efficient way to experimentally investigate some aspects of a protein structure when NMR and X-ray crystallography data are lacking. RESULTS: We introduce MSX-3D, a tool specifically geared to validate protein models using mass spectrometry. In addition to classical peptides identifications, it allows an interactive 3D visualization of the distance constraints derived from a cross-linking experiment. AVAILABILITY: Freely available at http://proteomics-pbil.ibcp.fr


Subject(s)
Mass Spectrometry/methods , Protein Conformation , Software , Computer Simulation , Databases, Protein , Models, Molecular , Proteins/chemistry , Proteomics/methods
11.
Bioinformatics ; 24(21): 2427-30, 2008 Nov 01.
Article in English | MEDLINE | ID: mdl-18772155

ABSTRACT

MOTIVATION: Most of the protein tyrosine kinases found in bacteria have been recently classified in a new family, termed BY-kinase. Indeed, they share no sequence homology with their eukaryotic counterparts and have no known eukaryotic homologues. They are involved in several biological functions (e.g. capsule biosynthesis, antibiotic resistance, virulence mechanism). Thus, they can be considered interesting therapeutic targets to develop new drugs to treat infectious diseases. However, their identification is rendered difficult due to slow progress in their structural characterization and comes most often from biochemical experiments. Moreover BY-kinase sequences are related to many other bacterial proteins involved in several biological functions (e.g. ParA family proteins). Accordingly, their annotations in generalist databases, sequence analysis and classification remain partial and inhomogeneous and there is no bioinformatics resource dedicated to these proteins. RESULTS: The combination of similarity search with sequence-profile alignment, pattern matching and sliding window computation to detect the tyrosine cluster was used to identify BY-kinase sequences in UniProt Knowledgebase. Cross-validations with keywords searches, pattern matching with several patterns and checking of motifs conservation in multiple sequence alignments were performed. Our pipeline identified 640 sequences as BY-kinases and allowed the definition of a PROSITE pattern that is the signature of the BY-kinases. The sequences identified by our pipeline as BY-kinases share a good sequence similarity with BY-kinases that have already been biochemically characterized, and they all bear the characteristic motifs of the catalytic domain, including the three Walker-like motifs followed by a tyrosine cluster. AVAILABILITY: http://bykdb.ibcp.fr


Subject(s)
Bacterial Proteins/chemistry , Protein-Tyrosine Kinases/chemistry , Amino Acid Motifs , Amino Acid Sequence , Computational Biology , Databases, Protein , Molecular Sequence Data , Proteins/chemistry , Sequence Alignment
12.
BMC Bioinformatics ; 8: 62, 2007 Feb 23.
Article in English | MEDLINE | ID: mdl-17319945

ABSTRACT

BACKGROUND: The post-genomic era is characterised by a torrent of biological information flooding the public databases. As a direct consequence, similarity searches starting with a single query sequence frequently lead to the identification of hundreds, or even thousands of potential homologues. The huge volume of data renders the subsequent structural, functional and evolutionary analyses very difficult. It is therefore essential to develop new strategies for efficient sampling of this large sequence space, in order to reduce the number of sequences to be processed. At the same time, it is important to retain the most pertinent sequences for structural and functional studies. RESULTS: An exhaustive analysis on a large scale test set (284 protein families) was performed to compare the efficiency of four different sampling methods aimed at selecting the most pertinent sequences. These four methods sample the proteins detected by BlastP searches and can be divided into two categories: two customisable methods where the user defines either the maximal number or the percentage of sequences to be selected; two automatic methods in which the number of sequences selected is determined by the program. We focused our analysis on the potential information content of the sampled sets of sequences using multiple alignment of complete sequences as the main validation tool. The study considered two criteria: the total number of sequences in BlastP and their associated E-values. The subsequent analyses investigated the influence of the sampling methods on the E-value distributions, the sequence coverage, the final multiple alignment quality and the active site characterisation at various residue conservation thresholds as a function of these criteria. CONCLUSION: The comparative analysis of the four sampling methods allows us to propose a suitable sampling strategy that significantly reduces the number of homologous sequences required for alignment, while at the same time maintaining the relevant information concerning the active site residues.


Subject(s)
Algorithms , Databases, Protein , Information Storage and Retrieval/methods , Proteins/chemistry , Proteins/metabolism , Sequence Alignment/methods , Sequence Analysis, Protein/methods , Amino Acid Sequence , Conserved Sequence , Database Management Systems , Molecular Sequence Data , Sequence Homology, Amino Acid , Structure-Activity Relationship
13.
Nucleic Acids Res ; 35(Database issue): D363-6, 2007 Jan.
Article in English | MEDLINE | ID: mdl-17142229

ABSTRACT

The hepatitis C virus (HCV) genome shows remarkable sequence variability, leading to the classification of at least six major genotypes, numerous subtypes and a myriad of quasispecies within a given host. A database allowing researchers to investigate the genetic and structural variability of all available HCV sequences is an essential tool for studies on the molecular virology and pathogenesis of hepatitis C as well as drug design and vaccine development. We describe here the European Hepatitis C Virus Database (euHCVdb, http://euhcvdb.ibcp.fr), a collection of computer-annotated sequences based on reference genomes. The annotations include genome mapping of sequences, use of recommended nomenclature, subtyping as well as three-dimensional (3D) molecular models of proteins. A WWW interface has been developed to facilitate database searches and the export of data for sequence and structure analyses. As part of an international collaborative effort with the US and Japanese databases, the European HCV Database (euHCVdb) is mainly dedicated to HCV protein sequences, 3D structures and functional analyses.


Subject(s)
Databases, Protein , Hepacivirus/genetics , Viral Proteins/chemistry , Viral Proteins/genetics , Genome, Viral , Genomics , Internet , Models, Molecular , Protein Conformation , Sequence Analysis, Protein , User-Computer Interface
14.
J Virol ; 81(6): 2745-57, 2007 Mar.
Article in English | MEDLINE | ID: mdl-17192310

ABSTRACT

Nonstructural protein 5A (NS5A) is a membrane-associated essential component of the hepatitis C virus (HCV) replication complex. An N-terminal amphipathic alpha helix mediates in-plane membrane association of HCV NS5A and at the same time is likely involved in specific protein-protein interactions required for the assembly of a functional replication complex. The aim of this study was to identify the determinants for membrane association of NS5A from the related GB viruses and pestiviruses. Although primary amino acid sequences differed considerably, putative membrane anchor domains with amphipathic features were predicted in the N-terminal domains of NS5A proteins from these viruses. Confocal laser scanning microscopy, as well as membrane flotation analyses, demonstrated that NS5As from GB virus B (GBV-B), GBV-C, and bovine viral diarrhea virus, the prototype pestivirus, display membrane association characteristics very similar to those of HCV NS5A. The N-terminal 27 to 33 amino acid residues of these NS5A proteins were sufficient for membrane association. Circular dichroism analyses confirmed the capacity of these segments to fold into alpha helices upon association with lipid-like molecules. Despite structural conservation, only very limited exchanges with sequences from related viruses were tolerated in the context of functional HCV RNA replication, suggesting virus-specific interactions of these segments. In conclusion, membrane association of NS5A by an N-terminal amphipathic alpha helix is a feature shared by HCV and related members of the family Flaviviridae. This observation points to conserved roles of the N-terminal amphipathic alpha helices of NS5A in replication complex formation.


Subject(s)
Cell Membrane/metabolism , Diarrhea Viruses, Bovine Viral/chemistry , GB virus A/chemistry , GB virus B/chemistry , Viral Nonstructural Proteins/chemistry , Amino Acid Sequence , Amino Acids/chemistry , Animals , Cattle , Cell Line, Tumor , Circular Dichroism , Conserved Sequence , Electroporation , Humans , Molecular Sequence Data , Osteosarcoma/pathology , Peptides/chemistry , Protein Biosynthesis , Protein Structure, Secondary , Protein Structure, Tertiary , Tetracycline/pharmacology , Transfection
16.
Mol Biol Evol ; 23(12): 2288-302, 2006 Dec.
Article in English | MEDLINE | ID: mdl-16945979

ABSTRACT

Collagens are thought to represent one of the most important molecular innovations in the metazoan line. Basement membrane type IV collagen is present in all Eumetazoa and was found in Homoscleromorpha, a sponge group with a well-organized epithelium, which may represent the first stage of tissue differentiation during animal evolution. In contrast, spongin seems to be a demosponge-specific collagenous protein, which can totally substitute an inorganic skeleton, such as in the well-known bath sponge. In the freshwater sponge Ephydatia mülleri, we previously characterized a family of short-chain collagens that are likely to be main components of spongins. Using a combination of sequence- and structure-based methods, we present evidence of remote homology between the carboxyl-terminal noncollagenous NC1 domain of spongin short-chain collagens and type IV collagen. Unexpectedly, spongin short-chain collagen-related proteins were retrieved in nonsponge animals, suggesting that a family related to spongin constitutes an evolutionary sister to the type IV collagen family. Formation of the ancestral NC1 domain and divergence of the spongin short-chain collagen-related and type IV collagen families may have occurred before the parazoan-eumetazoan split, the earliest divergence among extant animal phyla. Molecular phylogenetics based on NC1 domain sequences suggest distinct evolutionary histories for spongin short-chain collagen-related and type IV collagen families that include spongin short-chain collagen-related gene loss in the ancestors of Ecdyzosoa and of vertebrates. The fact that a majority of invertebrates encodes spongin short-chain collagen-related proteins raises the important question to the possible function of its members. Considering the importance of collagens for animal structure and substratum attachment, both families may have played crucial roles in animal diversification.


Subject(s)
Collagen Type IV/genetics , Evolution, Molecular , Extracellular Matrix/genetics , Non-Fibrillar Collagens/genetics , Porifera/genetics , Amino Acid Sequence , Animals , Collagen Type IV/chemistry , Invertebrates/genetics , Membrane Proteins/genetics , Models, Biological , Models, Molecular , Molecular Sequence Data , Multigene Family/genetics , Non-Fibrillar Collagens/chemistry , Phylogeny , Protein Structure, Secondary , Protein Structure, Tertiary , Sequence Homology, Amino Acid
17.
Stud Health Technol Inform ; 120: 142-52, 2006.
Article in English | MEDLINE | ID: mdl-16823131

ABSTRACT

Biological data are most times published and then become public ones. They, then, do not need to be isolated or encrypted. But, in some cases, these data stemed from patients or are analyzed with, for instance, pharmaceutical or agronomics goals. Also in simple ways , these data, before to become public, have to be kept confidential while researchers haven't been able to publish their work or to register them. So they are a lot of cases where the integrity and the confidentiality of biological data have to be protected against unauthorized accesses. But, as these private data are also large datasets, they need high-throughput computing and huge data storage to processed, such as ones produced by complete genome projects. These requirements are enhanced in the context of a Grid such EGEE, where the computing and storage resources are distributed across a large-scale platform. We have developed a secured distributed service to manage biological data on grid: the EncFile encrypted files management system. We have deployed it on the production platform of the EGEE grid project. Thus we provided grid users with a user-friendly component that doesn't require any user privileges. And we have integrated into a bioinformatics grid portal associated to encrypted representative biological resources: world-famous databases and programs.


Subject(s)
Computational Biology , Computer Security , Databases as Topic/organization & administration , Europe , Information Dissemination , Programming Languages , Software
18.
Stud Health Technol Inform ; 120: 187-93, 2006.
Article in English | MEDLINE | ID: mdl-16823137

ABSTRACT

Bioinformatics analysis of data produced by complete genome sequencing projects is one of the major challenges of the current years. Integrating up-to-date databanks and relevant algorithms is a clear requirement of such analysis. Grid computing would be a viable solution to distribute data, algorithms, computing and storage resources for Genomics. Providing bioinformaticians with a good interface to grid infrastructure, such as the one provided by the EGEE European project, is also a challenge to take up. The GPS@ web portal, "Grid Protein Sequence Analysis", aims to provide such a user-friendly interface for these grid genomic resources on the EGEE grid.


Subject(s)
Computational Biology , Internet , Europe , Genomics , User-Computer Interface
19.
BMC Bioinformatics ; 7: 255, 2006 May 16.
Article in English | MEDLINE | ID: mdl-16704727

ABSTRACT

BACKGROUND: Membrane proteins are estimated to represent about 25% of open reading frames in fully sequenced genomes. However, the experimental study of proteins remains difficult. Considerable efforts have thus been made to develop prediction methods. Most of these were conceived to detect transmembrane helices in polytopic proteins. Alternatively, a membrane protein can be monotopic and anchored via an amphipathic helix inserted in a parallel way to the membrane interface, so-called in-plane membrane (IPM) anchors. This type of membrane anchor is still poorly understood and no suitable prediction method is currently available. RESULTS: We report here the "AmphipaSeeK" method developed to predict IPM anchors. It uses a set of 21 reported examples of IPM anchored proteins. The method is based on a pattern recognition Support Vector Machine with a dedicated kernel. CONCLUSION: AmphipaSeeK was shown to be highly specific, in contrast with classically used methods (e.g. hydrophobic moment). Additionally, it has been able to retrieve IPM anchors in naively tested sets of transmembrane proteins (e.g. PagP). AmphipaSeek and the list of the 21 IPM anchored proteins is available on NPS@, our protein sequence analysis server.


Subject(s)
Computational Biology/methods , Membrane Proteins/metabolism , Pattern Recognition, Automated/methods , Algorithms , Databases, Protein , Membrane Proteins/chemistry , Membrane Proteins/genetics , Protein Structure, Secondary , Reproducibility of Results , Sequence Alignment/methods , Sequence Analysis, Protein/methods
20.
Hepatology ; 43(5): 1157-65, 2006 May.
Article in English | MEDLINE | ID: mdl-16628639

ABSTRACT

Part of the effort to develop hepatitis C-specific drugs a nd vaccines is the study of genetic variability of allpublicly available HCV sequences. Three HCV databases are currently available to aid this effort and to provide additional insight into the basic biology, immunology, and evolution of the virus. The Japanese HCV database (http://s2as02.genes.nig.ac.jp) gives access to a genomic mapping of sequences as well as their phylogenetic relationships. The European HCV database (http://euhcvdb.ibcp.fr) offers access to a computer-annotated set of sequences and molecular models of HCV proteins and focuses on protein sequence, structure and function analysis. The HCV database at the Los Alamos National Laboratory in the United States (http://hcv.lanl.gov) provides access to a manually annotated sequence database and a database of immunological epitopes which contains concise descriptions of experimental results. In this paper, we briefly describe each of these databases and their associated websites and tools, and give some examples of their use in furthering HCV research.


Subject(s)
Biomedical Research , Databases, Factual , Hepacivirus , Genomics , Hepacivirus/genetics , Hepacivirus/immunology , Models, Molecular , Phylogeny
SELECTION OF CITATIONS
SEARCH DETAIL
...