Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 6 de 6
Filter
Add more filters










Database
Language
Publication year range
1.
Genome Biol ; 22(1): 38, 2021 01 19.
Article in English | MEDLINE | ID: mdl-33468205

ABSTRACT

We develop a general computational approach for improving the accuracy of basecalling with Oxford Nanopore's 1D2 and related sequencing protocols. Our software PoreOver ( https://github.com/jordisr/poreover ) finds the consensus of two neural networks by aligning their probability profiles, and is compatible with multiple nanopore basecallers. When applied to the recently-released Bonito basecaller, our method reduces the median sequencing error by more than half.


Subject(s)
Consensus , Nanopore Sequencing , Neural Networks, Computer , Humans , Nanopores , Software
2.
Bioinformatics ; 37(1): 29-35, 2021 04 09.
Article in English | MEDLINE | ID: mdl-32683444

ABSTRACT

MOTIVATION: Many software libraries for using Hidden Markov Models in bioinformatics focus on inference tasks, such as likelihood calculation, parameter-fitting and alignment. However, construction of the state machines can be a laborious task, automation of which would be time-saving and less error-prone. RESULTS: We present Machine Boss, a software tool implementing not just inference and parameter-fitting algorithms, but also a set of operations for manipulating and combining automata. The aim is to make prototyping of bioinformatics HMMs as quick and easy as the construction of regular expressions, with one-line 'recipes' for many common applications. We report data from several illustrative examples involving protein-to-DNA alignment, DNA data storage and nanopore sequence analysis. AVAILABILITY AND IMPLEMENTATION: Machine Boss is released under the BSD-3 open source license and is available from http://machineboss.org/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
Computational Biology , Software , Algorithms , Information Storage and Retrieval , Sequence Analysis
3.
Biophys J ; 105(5): 1192-8, 2013 Sep 03.
Article in English | MEDLINE | ID: mdl-24010662

ABSTRACT

Structural conversion of the presynaptic, intrinsically disordered protein α-synuclein into amyloid fibrils underlies neurotoxicity in Parkinson's disease. The detailed mechanism by which this conversion occurs is largely unknown. Here, we identify a discrete pattern of transient tertiary interactions in monomeric α-synuclein involving amino acid residues that are, in the fibrillar state, part of ß-strands. Importantly, this pattern of pairwise interactions does not correspond to that found in the amyloid state. A redistribution of this network of fibril-like contacts must precede aggregation into the amyloid structure.


Subject(s)
Protein Multimerization , alpha-Synuclein/chemistry , Amino Acid Sequence , Humans , Models, Molecular , Molecular Sequence Data , Protein Structure, Secondary , Solubility , alpha-Synuclein/metabolism
4.
Biophys J ; 104(8): 1740-51, 2013 Apr 16.
Article in English | MEDLINE | ID: mdl-23601321

ABSTRACT

In the last decade it has become evident that disordered states of proteins play important physiological and pathological roles and that the transient tertiary interactions often present in these systems can play a role in their biological activity. The structural characterization of such states has so far largely relied on ensemble representations, which in principle account for both their local and global structural features. However, these approaches are inherently of low resolution due to the large number of degrees of freedom of conformational ensembles and to the sparse nature of the experimental data used to determine them. Here, we overcome these limitations by showing that tertiary interactions in disordered states can be mapped at high resolution by fitting paramagnetic relaxation enhancement data to a small number of conformations, which can be as low as one. This result opens up the possibility of determining the topology of cooperatively collapsed and hidden folded states when these are present in the vast conformational landscape accessible to disordered states of proteins. As a first application, we study the long-range tertiary interactions of acid-unfolded apomyoglobin from experimentally measured paramagnetic relaxation enhancement data.


Subject(s)
Apoproteins/chemistry , Molecular Dynamics Simulation , Myoglobin/chemistry , Protein Folding , Amino Acid Sequence , Humans , Molecular Sequence Data , Protein Conformation
5.
J Am Chem Soc ; 133(41): 16617-24, 2011 Oct 19.
Article in English | MEDLINE | ID: mdl-21877736

ABSTRACT

At phase boundaries, physical activities of enzymes such as substrate complexation play critical roles in driving biocatalysis. A prominent example is the cellulase cocktails secreted by fungi and bacteria for deconstructing crystalline cellulose in biomass into soluble sugars. At interfaces, molecular mechanisms of the physical steps in biocatalysis remain elusive due to the difficulties of characterizing protein action with high temporal and spatial resolution. Here, we focus on endoglucanase I (Cel7B) from the fungus Trichoderma reesei that hydrolyzes glycosidic bonds on cellulose randomly. We employ all-atom molecular dynamics (MD) simulations to elucidate the interactions of the catalytic domain (CD) of Cel7B with a cellulose microfibril before and after complexing a glucan chain in the binding cleft. The calculated mechanical coupling networks in Cel7B-glucan and Cel7B-microfibril complexes reveal a previously unresolved allosteric coupling at the solid-liquid interface: attachment of the Cel7B CD to the cellulose surface affects glucan chain clenching in the binding cleft. Alternative loop segments of the Cel7B CD were found to affix to intact or defective surface structures on the microfibril, depending on the complexation state. From a multiple sequence alignment, residues in surface-affixing segments show strong conservation, highlighting the functional importance of the physical activities that they facilitate. Surface-affixing residues also demonstrate significant sequence correlation with active-site residues, revealing the functional connection between complexation and hydrolysis. Analysis of the Cel7B CD exemplifies that the mechanical coupling networks calculated from atomistic MD simulations can be used to capture the conservation and correlation in sequence alignment.


Subject(s)
Cellulase/chemistry , Cellulose/chemistry , Glucans/chemistry , Binding Sites , Biocatalysis , Cellulase/metabolism , Cellulose/metabolism , Glucans/metabolism , Models, Molecular , Molecular Dynamics Simulation , Trichoderma/enzymology
6.
PLoS Comput Biol ; 7(3): e1002023, 2011 Mar.
Article in English | MEDLINE | ID: mdl-21455286

ABSTRACT

The mechanism of intra-protein communication and allosteric coupling is key to understanding the structure-property relationship of protein function. For subtilisin Carlsberg, the Ca²+-binding loop is distal to substrate-binding and active sites, yet the serine protease function depends on Ca²+ binding. The atomic molecular dynamics (MD) simulations of apo and Ca²+-bound subtilisin show similar structures and there is no direct evidence that subtilisin has alternative conformations. To model the intra-protein communication due to Ca²+ binding, we transform the sequential segments of an atomic MD trajectory into separate elastic network models to represent anharmonicity and nonlinearity effectively as the temporal and spatial variation of the mechanical coupling network. In analogy to the spectrogram of sound waves, this transformation is termed the "fluctuogram" of protein dynamics. We illustrate that the Ca²+-bound and apo states of subtilisin have different fluctuograms and that intra-protein communication proceeds intermittently both in space and in time. We found that residues with large mechanical coupling variation due to Ca²+ binding correlate with the reported mutation sites selected by directed evolution for improving the stability of subtilisin and its activity in a non-aqueous environment. Furthermore, we utilize the fluctuograms calculated from MD to capture the highly correlated residues in a multiple sequence alignment. We show that in addition to the magnitude, the variance of coupling strength is also an indicative property for the sequence correlation observed in a statistical coupling analysis. The results of this work illustrate that the mechanical coupling networks calculated from atomic details can be used to correlate with functionally important mutation sites and co-evolution.


Subject(s)
Subtilisins/chemistry , Subtilisins/metabolism , Binding Sites , Calcium/metabolism , Models, Molecular , Molecular Dynamics Simulation , Molecular Sequence Data , Mutation , Protein Conformation , Sequence Alignment
SELECTION OF CITATIONS
SEARCH DETAIL
...