Pesquisa | Portal Regional da BVS (teste)

1.

ScalaBLAST 2.0: rapid and robust BLAST calculations on multiprocessor systems.

Oehmen, Christopher S; Baxter, Douglas J.

Bioinformatics ; 29(6): 797-8, 2013 Mar 15.

Artigo em Inglês | MEDLINE | ID: mdl-23361326

RESUMO

MOTIVATION: BLAST remains one of the most widely used tools in computational biology. The rate at which new sequence data is available continues to grow exponentially, driving the emergence of new fields of biological research. At the same time, multicore systems and conventional clusters are more accessible. ScalaBLAST has been designed to run on conventional multiprocessor systems with an eye to extreme parallelism, enabling parallel BLAST calculations using >16 000 processing cores with a portable, robust, fault-resilient design that introduces little to no overhead with respect to serial BLAST.

Assuntos

Alinhamento de Sequência/métodos , Software , Algoritmos , Biologia Computacional/métodos

2.

Inhibition of dengue virus infections in cell cultures and in AG129 mice by a small interfering RNA targeting a highly conserved sequence.

Stein, David A; Perry, Stuart T; Buck, Michael D; Oehmen, Christopher S; Fischer, Matthew A; Poore, Elizabeth; Smith, Jessica L; Lancaster, Alissa M; Hirsch, Alec J; Slifka, Mark K; Nelson, Jay A; Shresta, Sujan; Früh, Klaus.

J Virol ; 85(19): 10154-66, 2011 Oct.

Artigo em Inglês | MEDLINE | ID: mdl-21795337

RESUMO

The dengue viruses (DENVs) exist as numerous genetic strains that are grouped into four antigenically distinct serotypes. DENV strains from each serotype can cause severe disease and threaten public health in tropical and subtropical regions worldwide. No licensed antiviral agent to treat DENV infections is currently available, and there is an acute need for the development of novel therapeutics. We found that a synthetic small interfering RNA (siRNA) (DC-3) targeting the highly conserved 5' cyclization sequence (5'CS) region of the DENV genome reduced, by more than 100-fold, the titers of representative strains from each DENV serotype in vitro. To determine if DC-3 siRNA could inhibit DENV in vivo, an "in vivo-ready" version of DC-3 was synthesized and tested against DENV-2 by using a mouse model of antibody-dependent enhancement of infection (ADE)-induced disease. Compared with the rapid weight loss and 5-day average survival time of the control groups, mice receiving the DC-3 siRNA had an average survival time of 15 days and showed little weight loss for approximately 12 days. DC-3-treated mice also contained significantly less virus than control groups in several tissues at various time points postinfection. These results suggest that exogenously introduced siRNA combined with the endogenous RNA interference processing machinery has the capacity to prevent severe dengue disease. Overall, the data indicate that DC-3 siRNA represents a useful research reagent and has potential as a novel approach to therapeutic intervention against the genetically diverse dengue viruses.

Assuntos

Antivirais/administração & dosagem , Antivirais/farmacologia , Vírus da Dengue/efeitos dos fármacos , Dengue/tratamento farmacológico , RNA Interferente Pequeno/administração & dosagem , RNA Interferente Pequeno/farmacologia , Animais , Anticorpos Facilitadores , Produtos Biológicos/administração & dosagem , Produtos Biológicos/farmacologia , Peso Corporal , Técnicas de Cultura de Células , Chlorocebus aethiops , Sequência Conservada , Dengue/patologia , Dengue/virologia , Vírus da Dengue/genética , Modelos Animais de Doenças , Humanos , Camundongos , RNA Interferente Pequeno/genética , Doenças dos Roedores/tratamento farmacológico , Doenças dos Roedores/patologia , Doenças dos Roedores/virologia , Análise de Sobrevida

3.

A model of cyclic transcriptomic behavior in the cyanobacterium Cyanothece sp. ATCC 51142.

McDermott, Jason E; Oehmen, Christopher S; McCue, Lee Ann; Hill, Eric; Choi, Daniel M; Stöckel, Jana; Liberton, Michelle; Pakrasi, Himadri B; Sherman, Louis A.

Mol Biosyst ; 7(8): 2407-18, 2011 Aug.

Artigo em Inglês | MEDLINE | ID: mdl-21698331

RESUMO

Systems biology attempts to reconcile large amounts of disparate data with existing knowledge to provide models of functioning biological systems. The cyanobacterium Cyanothece sp. ATCC 51142 is an excellent candidate for such systems biology studies because: (i) it displays tight functional regulation between photosynthesis and nitrogen fixation; (ii) it has robust cyclic patterns at the genetic, protein and metabolomic levels; and (iii) it has potential applications for bioenergy production and carbon sequestration. We have represented the transcriptomic data from Cyanothece 51142 under diurnal light/dark cycles as a high-level functional abstraction and describe development of a predictive in silico model of diurnal and circadian behavior in terms of regulatory and metabolic processes in this organism. We show that incorporating network topology into the model improves performance in terms of our ability to explain the behavior of the system under new conditions. The model presented robustly describes transcriptomic behavior of Cyanothece 51142 under different cyclic and non-cyclic growth conditions, and represents a significant advance in the understanding of gene regulation in this important organism.

Assuntos

Cyanothece/genética , Modelos Genéticos , Transcrição Gênica , Linhagem Celular , Análise por Conglomerados , Simulação por Computador , Cyanothece/metabolismo , Bases de Dados Genéticas , Perfilação da Expressão Gênica , Regulação Bacteriana da Expressão Gênica , Redes Reguladoras de Genes , Nitrogenase/genética , Nitrogenase/metabolismo , Reprodutibilidade dos Testes , Ribulose-Bifosfato Carboxilase/genética , Ribulose-Bifosfato Carboxilase/metabolismo , Biologia de Sistemas/métodos

4.

A support vector machine model for the prediction of proteotypic peptides for accurate mass and time proteomics.

Webb-Robertson, Bobbie-Jo M; Cannon, William R; Oehmen, Christopher S; Shah, Anuj R; Gurumoorthi, Vidhya; Lipton, Mary S; Waters, Katrina M.

Bioinformatics ; 26(13): 1677-83, 2010 Jul 01.

Artigo em Inglês | MEDLINE | ID: mdl-20568665

RESUMO

MOTIVATION: The standard approach to identifying peptides based on accurate mass and elution time (AMT) compares profiles obtained from a high resolution mass spectrometer to a database of peptides previously identified from tandem mass spectrometry (MS/MS) studies. It would be advantageous, with respect to both accuracy and cost, to only search for those peptides that are detectable by MS (proteotypic). RESULTS: We present a support vector machine (SVM) model that uses a simple descriptor space based on 35 properties of amino acid content, charge, hydrophilicity and polarity for the quantitative prediction of proteotypic peptides. Using three independently derived AMT databases (Shewanella oneidensis, Salmonella typhimurium, Yersinia pestis) for training and validation within and across species, the SVM resulted in an average accuracy measure of approximately 0.83 with an SD of <0.038. Furthermore, we demonstrate that these results are achievable with a small set of 13 variables and can achieve high proteome coverage. AVAILABILITY: http://omics.pnl.gov/software/STEPP.php CONTACT: bj@pnl.gov SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Assuntos

Peptídeos/isolamento & purificação , Proteômica/métodos , Espectrometria de Massas , Peptídeos/química , Salmonella typhimurium/química , Shewanella/química , Yersinia pestis/química

5.

Physicochemical property distributions for accurate and rapid pairwise protein homology detection.

Webb-Robertson, Bobbie-Jo M; Ratuiste, Kyle G; Oehmen, Christopher S.

BMC Bioinformatics ; 11: 145, 2010 Mar 19.

Artigo em Inglês | MEDLINE | ID: mdl-20302613

RESUMO

BACKGROUND: The challenge of remote homology detection is that many evolutionarily related sequences have very little similarity at the amino acid level. Kernel-based discriminative methods, such as support vector machines (SVMs), that use vector representations of sequences derived from sequence properties have been shown to have superior accuracy when compared to traditional approaches for the task of remote homology detection. RESULTS: We introduce a new method for feature vector representation based on the physicochemical properties of the primary protein sequence. A distribution of physicochemical property scores are assembled from 4-mers of the sequence and normalized based on the null distribution of the property over all possible 4-mers. With this approach there is little computational cost associated with the transformation of the protein into feature space, and overall performance in terms of remote homology detection is comparable with current state-of-the-art methods. We demonstrate that the features can be used for the task of pairwise remote homology detection with improved accuracy versus sequence-based methods such as BLAST and other feature-based methods of similar computational cost. CONCLUSIONS: A protein feature method based on physicochemical properties is a viable approach for extracting features in a computationally inexpensive manner while retaining the sensitivity of SVM protein homology detection. Furthermore, identifying features that can be used for generic pairwise homology detection in lieu of family-based homology detection is important for applications such as large database searches and comparative genomics.

Assuntos

Biologia Computacional/métodos , Proteínas/química , Homologia de Sequência de Aminoácidos , Bases de Dados de Proteínas , Reconhecimento Automatizado de Padrão

6.

A feature vector integration approach for a generalized support vector machine pairwise homology algorithm.

Webb-Robertson, Bobbie-Jo M; Oehmen, Christopher S; Shah, Anuj R.

Comput Biol Chem ; 32(6): 458-61, 2008 Dec.

Artigo em Inglês | MEDLINE | ID: mdl-18722814

RESUMO

Due to the exponential growth of sequenced genomes, the need to quickly provide accurate annotation for existing and new sequences is paramount to facilitate biological research. Current sequence comparison approaches fail to detect homologous relationships when sequence similarity is low. Support vector machine (SVM) algorithms approach this problem by transforming all proteins into a feature space of equal dimension based on protein properties, such as sequence similarity scores against a basis set of proteins or motifs. This multivariate representation of the protein space is then used to build a classifier specific to a pre-defined protein family. However, this approach is not well suited to large-scale annotation. We have developed a SVM approach that formulates remote homology as a single classifier that answers the pairwise comparison problem by integrating the two feature vectors for a pair of sequences into a single vector representation that can be used to build a classifier that separates sequence pairs into homologs and non-homologs. This pairwise SVM approach significantly improves the task of remote homology detection on the benchmark dataset, quantified as the area under the receiver operating characteristic curve; 0.97 versus 0.73 and 0.70 for PSI-BLAST and Basic Local Alignment Search Tool (BLAST), respectively.

Assuntos

Algoritmos , Homologia de Sequência de Aminoácidos , Bases de Dados de Proteínas

7.

A support vector machine model for the prediction of proteotypic peptides for accurate mass and time proteomics.

Webb-Robertson, Bobbie-Jo M; Cannon, William R; Oehmen, Christopher S; Shah, Anuj R; Gurumoorthi, Vidhya; Lipton, Mary S; Waters, Katrina M.

Bioinformatics ; 24(13): 1503-9, 2008 Jul 01.

Artigo em Inglês | MEDLINE | ID: mdl-18453551

RESUMO

MOTIVATION: The standard approach to identifying peptides based on accurate mass and elution time (AMT) compares profiles obtained from a high resolution mass spectrometer to a database of peptides previously identified from tandem mass spectrometry (MS/MS) studies. It would be advantageous, with respect to both accuracy and cost, to only search for those peptides that are detectable by MS (proteotypic). RESULTS: We present a support vector machine (SVM) model that uses a simple descriptor space based on 35 properties of amino acid content, charge, hydrophilicity and polarity for the quantitative prediction of proteotypic peptides. Using three independently derived AMT databases (Shewanella oneidensis, Salmonella typhimurium, Yersinia pestis) for training and validation within and across species, the SVM resulted in an average accuracy measure of 0.8 with a SD of <0.025. Furthermore, we demonstrate that these results are achievable with a small set of 12 variables and can achieve high proteome coverage. AVAILABILITY: http://omics.pnl.gov/software/STEPP.php. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Assuntos

Algoritmos , Inteligência Artificial , Proteínas de Bactérias/química , Reconhecimento Automatizado de Padrão/métodos , Mapeamento de Peptídeos/métodos , Peptídeos/química , Proteoma/química , Simulação por Computador , Modelos Químicos , Proteômica/métodos , Reprodutibilidade dos Testes , Sensibilidade e Especificidade

8.

SVM-HUSTLE--an iterative semi-supervised machine learning approach for pairwise protein remote homology detection.

Shah, Anuj R; Oehmen, Christopher S; Webb-Robertson, Bobbie-Jo.

Bioinformatics ; 24(6): 783-90, 2008 Mar 15.

Artigo em Inglês | MEDLINE | ID: mdl-18245127

RESUMO

MOTIVATION: As the amount of biological sequence data continues to grow exponentially we face the increasing challenge of assigning function to this enormous molecular 'parts list'. The most popular approaches to this challenge make use of the simplifying assumption that similar functional molecules, or proteins, sometimes have similar composition, or sequence. However, these algorithms often fail to identify remote homologs (proteins with similar function but dissimilar sequence) which often are a significant fraction of the total homolog collection for a given sequence. We introduce a Support Vector Machine (SVM)-based tool to detect homology using semi-supervised iterative learning (SVM-HUSTLE) that identifies significantly more remote homologs than current state-of-the-art sequence or cluster-based methods. As opposed to building profiles or position specific scoring matrices, SVM-HUSTLE builds an SVM classifier for a query sequence by training on a collection of representative high-confidence training sets, recruits additional sequences and assigns a statistical measure of homology between a pair of sequences. SVM-HUSTLE combines principles of semi-supervised learning theory with statistical sampling to create many concurrent classifiers to iteratively detect and refine, on-the-fly, patterns indicating homology. RESULTS: When compared against existing methods for identifying protein homologs (BLAST, PSI-BLAST, COMPASS, PROF_SIM, RANKPROP and their variants) on two different benchmark datasets SVM-HUSTLE significantly outperforms each of the above methods using the most stringent ROC(1) statistic with P-values less than 1e-20. SVM-HUSTLE also yields results comparable to HHSearch but at a substantially reduced computational cost since we do not require the construction of HMMs. AVAILABILITY: The software executable to run SVM-HUSTLE can be downloaded from http://www.sysbio.org/sysbio/networkbio/svm_hustle

Assuntos

Algoritmos , Inteligência Artificial , Reconhecimento Automatizado de Padrão/métodos , Proteínas/química , Alinhamento de Sequência/métodos , Análise de Sequência de Proteína/métodos , Homologia de Sequência de Aminoácidos , Sequência de Aminoácidos , Dados de Sequência Molecular , Software

9.

PQuad--a visual analysis platform for proteomic data exploration of microbial organisms.

Webb-Robertson, Bobbie-Jo M; Peterson, Elena S; Singhal, Mudita; Klicker, Kyle R; Oehmen, Christopher S; Adkins, Joshua N; Havre, Susan L.

Bioinformatics ; 23(13): 1705-7, 2007 Jul 01.

Artigo em Inglês | MEDLINE | ID: mdl-17483503

RESUMO

UNLABELLED: The visual Platform for Proteomics Peptide and Protein data exploration (PQuad) is a multi-resolution environment that visually integrates genomic and proteomic data for prokaryotic systems, overlays categorical annotation and compares differential expression experiments. PQuad requires Java 1.5 and has been tested to run across different operating systems. AVAILABILITY: http://ncrr.pnl.gov/software.

Assuntos

Algoritmos , Fenômenos Fisiológicos Bacterianos , Gráficos por Computador , Perfilação da Expressão Gênica/métodos , Proteoma/fisiologia , Software , Interface Usuário-Computador , Integração de Sistemas

10.

Integrating subcellular location for improving machine learning models of remote homology detection in eukaryotic organisms.

Shah, Anuj R; Oehmen, Christopher S; Harper, Jill; Webb-Robertson, Bobbie-Jo M.

Comput Biol Chem ; 31(2): 138-42, 2007 Apr.

Artigo em Inglês | MEDLINE | ID: mdl-17416337

RESUMO

A significant challenge in homology detection is to identify sequences that share a common evolutionary ancestor, despite significant primary sequence divergence. Remote homologs will often have less than 30% sequence identity, yet still retain common structural and functional properties. We demonstrate a novel method for identifying remote homologs using a support vector machine (SVM) classifier trained by fusing sequence similarity scores and subcellular location prediction. SVMs have been shown to perform well in a variety of applications where binary classification of data is the goal. At the same time, data fusion methods have been shown to be highly effective in enhancing discriminative power of data. Combining these two approaches in the application SVM-SimLoc resulted in identification of significantly more remote homologs (p-value<0.006) than using either sequence similarity or subcellular location independently.

Assuntos

Inteligência Artificial , Biologia Computacional/métodos , Espaço Intracelular/metabolismo , Reconhecimento Automatizado de Padrão/métodos , Proteínas/química , Homologia Estrutural de Proteína , Algoritmos , Células Eucarióticas , Modelos Biológicos , Proteínas/metabolismo , Alinhamento de Sequência

11.

Comparison of probability and likelihood models for peptide identification from tandem mass spectrometry data.

Cannon, William R; Jarman, Kristin H; Webb-Robertson, Bobbie-Jo M; Baxter, Douglas J; Oehmen, Christopher S; Jarman, Kenneth D; Heredia-Langner, Alejandro; Auberry, Kenneth J; Anderson, Gordon A.

J Proteome Res ; 4(5): 1687-98, 2005.

Artigo em Inglês | MEDLINE | ID: mdl-16212422

RESUMO

We evaluate statistical models used in two-hypothesis tests for identifying peptides from tandem mass spectrometry data. The null hypothesis H(0), that a peptide matches a spectrum by chance, requires information on the probability of by-chance matches between peptide fragments and peaks in the spectrum. Likewise, the alternate hypothesis H(A), that the spectrum is due to a particular peptide, requires probabilities that the peptide fragments would indeed be observed if it was the causative agent. We compare models for these probabilities by determining the identification rates produced by the models using an independent data set. The initial models use different probabilities depending on fragment ion type, but uniform probabilities for each ion type across all of the labile bonds along the backbone. More sophisticated models for probabilities under both H(A) and H(0) are introduced that do not assume uniform probabilities for each ion type. In addition, the performance of these models using a standard likelihood model is compared to an information theory approach derived from the likelihood model. Also, a simple but effective model for incorporating peak intensities is described. Finally, a support-vector machine is used to discriminate between correct and incorrect identifications based on multiple characteristics of the scoring functions. The results are shown to reduce the misidentification rate significantly when compared to a benchmark cross-correlation based approach.

Assuntos

Proteoma , Proteômica/métodos , Bases de Dados de Proteínas , Deinococcus/metabolismo , Funções Verossimilhança , Espectrometria de Massas , Modelos Estatísticos , Peptídeos/química , Probabilidade , Curva ROC

12.

Mathematical model of the rapidly activating delayed rectifier potassium current I(Kr) in rabbit sinoatrial node.

Oehmen, Christopher S; Giles, Wayne R; Demir, Semahat S.

J Cardiovasc Electrophysiol ; 13(11): 1131-40, 2002 Nov.

Artigo em Inglês | MEDLINE | ID: mdl-12475105

RESUMO

INTRODUCTION: A rapidly activating delayed rectifier potassium current (I(Kr)) is known to have an important role in determining the properties of spontaneous pacing in enzymatically isolated rabbit sinoatrial node (SAN) cells. The functional characteristics of I(Kr) are conferred by its dependence on time, voltage, and external potassium. The aim of this study was to develop a rigorous mathematical representation for I(Kr) based on experimental findings and to investigate the role of I(Kr) in the automaticity and intercellular communication of SAN cells. METHODS AND RESULTS: A Markov model was developed using available experimental data for I(Kr) in rabbit SAN. The dependence of I(Kr) on external potassium, [K+]o, was incorporated using data from both in vitro preparations and results from heterologous expression experiments for this ether-a-go-go related gene product. Our simulation results show the following. (1) I(Kr) is the dominant repolarizing current in rabbit SAN cells. (2) Deactivation of I(Kr) contributes to the net current change during the early diastolic depolarization phase. (3) Inward rectification of I(Kr) results in a decrease in membrane resistance during repolarization relative to plateau. (4) The complex [K+]o dependence of I(Kr) confers [K+]o insensitivity on isolated cells, which may account for the sensitivity of pacing rate to elevated [K+]o at the tissue level. CONCLUSION: Model results show that I(Kr) mediates diastolic depolarization by the kinetics of its decay and by lowering resistance during late repolarization. In elevated [K+]o, increased chord conductance is balanced by the changes in kinetics and voltage dependence of I(Kr) so that the pacing rate of single cells may be more [K+]o insensitive than expected. In addition, elevated [K+]o increases I(Kr) magnitude during repolarization but lowers resistance, so current flow through gap junctions is less able to hyperpolarize pacing cells.

Assuntos

Modelos Cardiovasculares , Canais de Potássio de Abertura Dependente da Tensão da Membrana , Canais de Potássio/fisiologia , Nó Sinoatrial/metabolismo , Potenciais de Ação , Animais , Simulação por Computador , Canais de Potássio de Retificação Tardia , Diástole , Eletrofisiologia , Cadeias de Markov , Técnicas de Patch-Clamp , Coelhos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA