Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 17 de 17
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
Sensors (Basel) ; 23(10)2023 May 20.
Artigo em Inglês | MEDLINE | ID: mdl-37430842

RESUMO

This paper provides a novel methodology for human-driven decision support for capacity allocation in labour-intensive manufacturing systems. In such systems (where output depends solely on human labour) it is essential that any changes aimed at improving productivity are informed by the workers' actual working practices, rather than attempting to implement strategies based on an idealised representation of a theoretical production process. This paper reports how worker position data (obtained by localisation sensors) can be used as input to process mining algorithms to generate a data-driven process model to understand how manufacturing tasks are actually performed and how this model can then be used to build a discrete event simulation to investigate the performance of capacity allocation adjustments made to the original working practice observed in the data. The proposed methodology is demonstrated using a real-world dataset generated by a manual assembly line involving six workers performing six manufacturing tasks. It is found that, with small capacity adjustments, one can reduce the completion time by 7% (i.e., without requiring any additional workers), and with an additional worker a 16% reduction in completion time can be achieved by increasing the capacity of the bottleneck tasks which take relatively longer time than others.

2.
PLoS One ; 16(11): e0259680, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-34762716

RESUMO

Cities and towns have often developed infrastructure that enabled a variety of socio-economic interactions. Street networks within these urban settings provide key access to resources, neighborhoods, and cultural facilities. Studies on settlement scaling have also demonstrated that a variety of urban infrastructure and resources indicate clear population scaling relationships in both modern and ancient settings. This article presents an approach that investigates past street network centrality and its relationship to population scaling in urban contexts. Centrality results are compared statistically among different urban settings, which are categorized as orthogonal (i.e., planned) or self-organizing (i.e., organic) urban settings, with places having both characteristics classified as hybrid. Results demonstrate that street nodes have a power law relationship to urban area, where the number of nodes increases and node density decreases in a sub-linear manner for larger sites. Most median centrality values decrease in a negative sub-linear manner as sites are larger, with organic and hybrid urban sites' centrality being generally less and diminishing more rapidly than orthogonal settings. Diminishing centrality shows comparability to modern urban systems, where larger urban districts may restrict overall interaction due to increasing transport costs over wider areas. Centrality results indicate that scaling results have multiples of approximately ⅙ or ⅓ that are comparable to other urban and road infrastructure, suggesting a potential relationship between different infrastructure features and population in urban centers. The results have implications for archaeological settlements where urban street plans are incomplete or undetermined, as it allows forecasts to be made on past urban sites' street network centrality. Additionally, a tool to enable analysis of street networks and centrality is provided as part of the contribution.


Assuntos
Planejamento de Cidades/métodos , Cidades , Saúde Ambiental , Geografia , Humanos , Modelos Estatísticos , Saúde Pública , Urbanização
3.
J Comput Biol ; 27(5): 796-814, 2020 05.
Artigo em Inglês | MEDLINE | ID: mdl-31390220

RESUMO

The folding of a protein structure is a process governed by both local and nonlocal interactions. While incorporating local dependencies into a machine learning algorithm for protein structure prediction is simple and has been exploited for some time, the modeling of long-range dependences which result from structurally-neighboring residues has only recently begun to be addressed. Structural properties designed to localize the prediction space from direct tertiary structure prediction, such as secondary structure, contact maps, and intrinsic disorder, among others, have begun to greatly benefit from machine learning models capable of modeling a widened, potentially global protein context. This has led to a direct enhancement of the quality of predicted tertiary structures through both the optimization of structural constraints and improved reliability of alignments to structural templates. These improvements have stemmed from the application of recurrent and convolutional neural network architectures effective not only at innate sequential context propagation but also deep feature extraction due to novel skip connections and normalization techniques allowing for greatly enhanced error back-propagation. The recent results from independent blind testing in Critical Assessment of protein Structure Prediction 13 have signaled the beginning of a new generation of protein structure prediction through the utilization of these contextual techniques. The ripples from advancements in the determination of one-dimensional and two-dimensional structural properties have us moving ever closer to the solution of the protein structure prediction problem.


Assuntos
Envelhecimento/genética , Aprendizado de Máquina , Conformação Proteica , Proteínas/genética , Envelhecimento/patologia , Algoritmos , Redes Neurais de Computação , Proteínas/ultraestrutura
4.
J Comput Chem ; 41(8): 745-750, 2020 03 30.
Artigo em Inglês | MEDLINE | ID: mdl-31845383

RESUMO

Protein structure determination has long been one of the most challenging problems in molecular biology for the past 60 years. Here we present an ab initio protein tertiary-structure prediction method assisted by predicted contact maps from SPOT-Contact and predicted dihedral angles from SPIDER 3. These predicted properties were then fed to the crystallography and NMR system (CNS) for restrained structure modeling. The resulted structures are first evaluated by the potential energy calculated by CNS, followed by dDFIRE energy function for model selections. The method called SPOT-Fold has been tested on 241 CASP targets between 67 and 670 amino acid residues, 60 randomly selected globular proteins under 100 amino acids. The method has a comparable accuracy to other contact-map-based modeling techniques. © 2019 Wiley Periodicals, Inc.


Assuntos
Proteínas/química , Software , Modelos Moleculares , Conformação Proteica
5.
J Mol Biol ; 432(11): 3379-3387, 2020 05 15.
Artigo em Inglês | MEDLINE | ID: mdl-31870849

RESUMO

Computational predictions of the intrinsic disorder and its functions are instrumental to facilitate annotation for the millions of unannotated proteins. However, access to these predictors is fragmented and requires substantial effort to find them and to collect and combine their results. The DEPICTER (DisorderEd PredictIon CenTER) server provides first-of-its-kind centralized access to 10 popular disorder and disorder function predictions that cover protein and nucleic acids binding, linkers, and moonlighting regions. It automates the prediction process, runs user-selected methods on the server side, visualizes the results, and outputs all predictions in a consistent and easy-to-parse format. DEPICTER also includes two accurate consensus predictors of disorder and disordered protein binding. Empirical tests on an independent (low similarity) benchmark dataset reveal that the computational tools included in DEPICTER generate accurate predictions that are significantly better than the results secured using sequence alignment. The DEPICTER server is freely available at http://biomine.cs.vcu.edu/servers/DEPICTER/.


Assuntos
Biologia Computacional , Bases de Dados de Proteínas , Proteínas Intrinsicamente Desordenadas/genética , Software , Sequência de Aminoácidos/genética , Proteínas Intrinsicamente Desordenadas/ultraestrutura , Ligação Proteica/genética , Análise de Sequência de Proteína
6.
Bioinformatics ; 36(4): 1107-1113, 2020 02 15.
Artigo em Inglês | MEDLINE | ID: mdl-31504193

RESUMO

MOTIVATION: Protein intrinsic disorder describes the tendency of sequence residues to not fold into a rigid three-dimensional shape by themselves. However, some of these disordered regions can transition from disorder to order when interacting with another molecule in segments known as molecular recognition features (MoRFs). Previous analysis has shown that these MoRF regions are indirectly encoded within the prediction of residue disorder as low-confidence predictions [i.e. in a semi-disordered state P(D)≈0.5]. Thus, what has been learned for disorder prediction may be transferable to MoRF prediction. Transferring the internal characterization of protein disorder for the prediction of MoRF residues would allow us to take advantage of the large training set available for disorder prediction, enabling the training of larger analytical models than is currently feasible on the small number of currently available annotated MoRF proteins. In this paper, we propose a new method for MoRF prediction by transfer learning from the SPOT-Disorder2 ensemble models built for disorder prediction. RESULTS: We confirm that directly training on the MoRF set with a randomly initialized model yields substantially poorer performance on independent test sets than by using the transfer-learning-based method SPOT-MoRF, for both deep and simple networks. Its comparison to current state-of-the-art techniques reveals its superior performance in identifying MoRF binding regions in proteins across two independent testing sets, including our new dataset of >800 protein chains. These test chains share <30% sequence similarity to all training and validation proteins used in SPOT-Disorder2 and SPOT-MoRF, and provide a much-needed large-scale update on the performance of current MoRF predictors. The method is expected to be useful in locating functional disordered regions in proteins. AVAILABILITY AND IMPLEMENTATION: SPOT-MoRF and its data are available as a web server and as a standalone program at: http://sparks-lab.org/jack/server/SPOT-MoRF/index.php. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Biologia Computacional , Proteínas Intrinsicamente Desordenadas , Aprendizado de Máquina
7.
Nat Commun ; 10(1): 5407, 2019 11 27.
Artigo em Inglês | MEDLINE | ID: mdl-31776342

RESUMO

The majority of our human genome transcribes into noncoding RNAs with unknown structures and functions. Obtaining functional clues for noncoding RNAs requires accurate base-pairing or secondary-structure prediction. However, the performance of such predictions by current folding-based algorithms has been stagnated for more than a decade. Here, we propose the use of deep contextual learning for base-pair prediction including those noncanonical and non-nested (pseudoknot) base pairs stabilized by tertiary interactions. Since only [Formula: see text]250 nonredundant, high-resolution RNA structures are available for model training, we utilize transfer learning from a model initially trained with a recent high-quality bpRNA dataset of [Formula: see text]10,000 nonredundant RNAs made available through comparative analysis. The resulting method achieves large, statistically significant improvement in predicting all base pairs, noncanonical and non-nested base pairs in particular. The proposed method (SPOT-RNA), with a freely available server and standalone software, should be useful for improving RNA structure modeling, sequence alignment, and functional annotations.


Assuntos
Redes Neurais de Computação , RNA/química , Software , Algoritmos , Pareamento de Bases , Biologia Computacional/métodos , Bases de Dados Genéticas , Aprendizado Profundo , Humanos , Estrutura Secundária de Proteína , RNA não Traduzido/química
8.
J Synchrotron Radiat ; 26(Pt 2): 565-570, 2019 Mar 01.
Artigo em Inglês | MEDLINE | ID: mdl-30855269

RESUMO

SyLMAND, the Synchrotron Laboratory for Micro and Nano Devices, is a recently commissioned microfabrication bend magnet beamline with ancillary cleanroom facilities at the Canadian Light Source. The synchrotron radiation is applied to pattern high-aspect-ratio polymer microstructures used in the area of micro-electro-mechanical systems (MEMS). SyLMAND particularly focuses on spectral and beam power adjustability and large exposable area formats in an inert gas atmosphere; a rotating-disk intensity chopper allows for independent beam-power reduction, while continuous spectral tuning between 1-2 keV and >15 keV photon energies is achieved using a double-mirror system and low-atomic-number filters. Homogeneous exposure of samples up to six inches in diameter is performed in the experimental endstation, a vertically scanning precision stage (scanner) with tilt and rotation capabilities under 100 mbar helium. Commissioning was completed in late 2017, and SyLMAND is currently ramping up its user program, mostly in the areas of RF MEMS, micro-fluidics/life sciences and micro-optics.

9.
Genomics Proteomics Bioinformatics ; 17(6): 645-656, 2019 12.
Artigo em Inglês | MEDLINE | ID: mdl-32173600

RESUMO

Intrinsically disordered or unstructured proteins (or regions in proteins) have been found to be important in a wide range of biological functions and implicated in many diseases. Due to the high cost and low efficiency of experimental determination of intrinsic disorder and the exponential increase of unannotated protein sequences, developing complementary computational prediction methods has been an active area of research for several decades. Here, we employed an ensemble of deep Squeeze-and-Excitation residual inception and long short-term memory (LSTM) networks for predicting protein intrinsic disorder with input from evolutionary information and predicted one-dimensional structural properties. The method, called SPOT-Disorder2, offers substantial and consistent improvement not only over our previous technique based on LSTM networks alone, but also over other state-of-the-art techniques in three independent tests with different ratios of disordered to ordered amino acid residues, and for sequences with either rich or limited evolutionary information. More importantly, semi-disordered regions predicted in SPOT-Disorder2 are more accurate in identifying molecular recognition features (MoRFs) than methods directly designed for MoRFs prediction. SPOT-Disorder2 is available as a web server and as a standalone program at https://sparks-lab.org/server/spot-disorder2/.


Assuntos
Biologia Computacional/métodos , Aprendizado Profundo , Proteínas Intrinsicamente Desordenadas/química , Sequência de Aminoácidos , Evolução Molecular , Internet , Proteínas Intrinsicamente Desordenadas/metabolismo
10.
Bioinformatics ; 35(14): 2403-2410, 2019 07 15.
Artigo em Inglês | MEDLINE | ID: mdl-30535134

RESUMO

MOTIVATION: Sequence-based prediction of one dimensional structural properties of proteins has been a long-standing subproblem of protein structure prediction. Recently, prediction accuracy has been significantly improved due to the rapid expansion of protein sequence and structure libraries and advances in deep learning techniques, such as residual convolutional networks (ResNets) and Long-Short-Term Memory Cells in Bidirectional Recurrent Neural Networks (LSTM-BRNNs). Here we leverage an ensemble of LSTM-BRNN and ResNet models, together with predicted residue-residue contact maps, to continue the push towards the attainable limit of prediction for 3- and 8-state secondary structure, backbone angles (θ, τ, ϕ and ψ), half-sphere exposure, contact numbers and solvent accessible surface area (ASA). RESULTS: The new method, named SPOT-1D, achieves similar, high performance on a large validation set and test set (≈1000 proteins in each set), suggesting robust performance for unseen data. For the large test set, it achieves 87% and 77% in 3- and 8-state secondary structure prediction and 0.82 and 0.86 in correlation coefficients between predicted and measured ASA and contact numbers, respectively. Comparison to current state-of-the-art techniques reveals substantial improvement in secondary structure and backbone angle prediction. In particular, 44% of 40-residue fragment structures constructed from predicted backbone Cα-based θ and τ angles are less than 6 Å root-mean-squared-distance from their native conformations, nearly 20% better than the next best. The method is expected to be useful for advancing protein structure and function prediction. AVAILABILITY AND IMPLEMENTATION: SPOT-1D and its data is available at: http://sparks-lab.org/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Redes Neurais de Computação , Sequência de Aminoácidos , Biologia Computacional , Estrutura Secundária de Proteína , Proteínas , Solventes
11.
Rev Sci Instrum ; 89(11): 115001, 2018 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-30501345

RESUMO

X-ray masks are indispensable tools in deep X-ray lithography (XRL). To date, hardly any fabrication technology can provide affordable and readily available masks with good structure quality. The bottleneck of adequate masks to a large extent limits the widespread use of XRL. In this article, an alternative XRL mask fabrication process is described to significantly improve availability and cost efficiency of XRL masks as key instruments in XRL processing: A 355 nm UV-laser is applied to expose SU-8 resist on an antireflective coating and a copper sacrificial substrate. The voids in this resist template are filled by a two-step electroplating process with sacrificial nickel and 3.6 µm thick gold absorbers. A second SU-8 coat embeds the absorbers, forming the 40 µm mask membrane. This configuration allows for XRL into resists of up to about 200 µm thickness at the SyLMAND beamline, Canada. The absorber structure accuracy is about 1 µm, at smallest tested lateral dimensions of 2 µm isolated features and 500 nm details. Upon release from the substrate, the membrane locally deforms by up to 1.79 µm. PMMA microstructures patterned with such a mask have smooth and vertical sidewalls. The SyLMAND chopper allows one to limit thermal deformations during exposure to the micrometer range: At a beam power of 0.42 W, typical thermal deformations are 0.5 µm-1.4 µm, depending on the layout, and position inaccuracies are about 3.3 µm.

12.
J Chem Inf Model ; 58(11): 2369-2376, 2018 11 26.
Artigo em Inglês | MEDLINE | ID: mdl-30395465

RESUMO

Recognizing the widespread existence of intrinsically disordered regions in proteins spurred the development of computational techniques for their detection. All existing techniques can be classified into methods relying on single-sequence information and those relying on evolutionary sequence profiles generated from multiple-sequence alignments. The methods based on sequence profiles are, in general, more accurate because the presence or absence of conserved amino acid residues in a protein sequence provides important information on the structural and functional roles of the residues. However, the wide applicability of profile-based techniques is limited by time-consuming calculation of sequence profiles. Here we demonstrate that the performance gap between profile-based techniques and single-sequence methods can be reduced by using an ensemble of deep recurrent and convolutional neural networks that allow whole-sequence learning. In particular, the single-sequence method (called SPOT-Disorder-Single) is more accurate than SPOT-Disorder (a profile-based method) for proteins with few homologous sequences and comparable for proteins in predicting long-disordered regions. The method performance is robust across four independent test sets with different amounts of short- and long-disordered regions. SPOT-Disorder-Single is available as a Web server and as a standalone program at http://sparks-lab.org/jack/server/SPOT-Disorder-Single .


Assuntos
Proteínas Intrinsicamente Desordenadas/química , Sequência de Aminoácidos , Animais , Bases de Dados de Proteínas , Humanos , Modelos Químicos , Modelos Moleculares , Redes Neurais de Computação , Conformação Proteica , Análise de Sequência de Proteína , Software
13.
J Chem Inf Model ; 58(9): 2033-2042, 2018 09 24.
Artigo em Inglês | MEDLINE | ID: mdl-30118602

RESUMO

It has been long established that cis conformations of amino acid residues play many biologically important roles despite their rare occurrence in protein structure. Because of this rarity, few methods have been developed for predicting cis isomers from protein sequences, most of which are based on outdated datasets and lack the means for independent testing. In this work, using a database of >10000 high-resolution protein structures, we update the statistics of cis isomers and develop a sequence-based prediction technique using an ensemble of residual convolutional and long short-term memory bidirectional recurrent neural networks that allow learning from the whole protein sequence. We show that ensembling eight neural network models yields maximum Matthews correlation coefficient values of approximately 0.35 for cis-Pro isomers and 0.1 for cis-nonPro residues. The method should be useful for prioritizing functionally important residues in cis isomers for experimental validations and improving the sampling of rare protein conformations for ab initio protein structure prediction.


Assuntos
Aprendizado de Máquina , Prolina/química , Proteínas/química , Sequência de Aminoácidos
14.
Bioinformatics ; 34(23): 4039-4045, 2018 12 01.
Artigo em Inglês | MEDLINE | ID: mdl-29931279

RESUMO

Motivation: Accurate prediction of a protein contact map depends greatly on capturing as much contextual information as possible from surrounding residues for a target residue pair. Recently, ultra-deep residual convolutional networks were found to be state-of-the-art in the latest Critical Assessment of Structure Prediction techniques (CASP12) for protein contact map prediction by attempting to provide a protein-wide context at each residue pair. Recurrent neural networks have seen great success in recent protein residue classification problems due to their ability to propagate information through long protein sequences, especially Long Short-Term Memory (LSTM) cells. Here, we propose a novel protein contact map prediction method by stacking residual convolutional networks with two-dimensional residual bidirectional recurrent LSTM networks, and using both one-dimensional sequence-based and two-dimensional evolutionary coupling-based information. Results: We show that the proposed method achieves a robust performance over validation and independent test sets with the Area Under the receiver operating characteristic Curve (AUC) > 0.95 in all tests. When compared to several state-of-the-art methods for independent testing of 228 proteins, the method yields an AUC value of 0.958, whereas the next-best method obtains an AUC of 0.909. More importantly, the improvement is over contacts at all sequence-position separations. Specifically, a 8.95%, 5.65% and 2.84% increase in precision were observed for the top L∕10 predictions over the next best for short, medium and long-range contacts, respectively. This confirms the usefulness of ResNets to congregate the short-range relations and 2D-BRLSTM to propagate the long-range dependencies throughout the entire protein contact map 'image'. Availability and implementation: SPOT-Contact server url: http://sparks-lab.org/jack/server/SPOT-Contact/. Supplementary information: Supplementary data are available at Bioinformatics online.


Assuntos
Biologia Computacional , Redes Neurais de Computação , Mapeamento de Interação de Proteínas/métodos , Proteínas/química , Sequência de Aminoácidos
15.
Proteins ; 86(6): 629-633, 2018 06.
Artigo em Inglês | MEDLINE | ID: mdl-29508448

RESUMO

Designing protein sequences that can fold into a given structure is a well-known inverse protein-folding problem. One important characteristic to attain for a protein design program is the ability to recover wild-type sequences given their native backbone structures. The highest average sequence identity accuracy achieved by current protein-design programs in this problem is around 30%, achieved by our previous system, SPIN. SPIN is a program that predicts sequences compatible with a provided structure using a neural network with fragment-based local and energy-based nonlocal profiles. Our new model, SPIN2, uses a deep neural network and additional structural features to improve on SPIN. SPIN2 achieves over 34% in sequence recovery in 10-fold cross-validation and independent tests, a 4% improvement over the previous version. The sequence profiles generated from SPIN2 are expected to be useful for improving existing fold recognition and protein design techniques. SPIN2 is available at http://sparks-lab.org.


Assuntos
Redes Neurais de Computação , Proteínas/química , Software , Sequência de Aminoácidos , Bases de Dados de Proteínas , Modelos Moleculares , Dobramento de Proteína , Estrutura Secundária de Proteína
16.
Brief Bioinform ; 19(3): 482-494, 2018 05 01.
Artigo em Inglês | MEDLINE | ID: mdl-28040746

RESUMO

Protein secondary structure prediction began in 1951 when Pauling and Corey predicted helical and sheet conformations for protein polypeptide backbone even before the first protein structure was determined. Sixty-five years later, powerful new methods breathe new life into this field. The highest three-state accuracy without relying on structure templates is now at 82-84%, a number unthinkable just a few years ago. These improvements came from increasingly larger databases of protein sequences and structures for training, the use of template secondary structure information and more powerful deep learning techniques. As we are approaching to the theoretical limit of three-state prediction (88-90%), alternative to secondary structure prediction (prediction of backbone torsion angles and Cα-atom-based angles and torsion angles) not only has more room for further improvement but also allows direct prediction of three-dimensional fragment structures with constantly improved accuracy. About 20% of all 40-residue fragments in a database of 1199 non-redundant proteins have <6 Å root-mean-squared distance from the native conformations by SPIDER2. More powerful deep learning methods with improved capability of capturing long-range interactions begin to emerge as the next generation of techniques for secondary structure prediction. The time has come to finish off the final stretch of the long march towards protein secondary structure prediction.


Assuntos
Algoritmos , Biologia Computacional/métodos , Modelos Teóricos , Redes Neurais de Computação , Estrutura Secundária de Proteína , Proteínas/química , Bases de Dados de Proteínas , Humanos
17.
Bioinformatics ; 33(5): 685-692, 2017 03 01.
Artigo em Inglês | MEDLINE | ID: mdl-28011771

RESUMO

Motivation: Capturing long-range interactions between structural but not sequence neighbors of proteins is a long-standing challenging problem in bioinformatics. Recently, long short-term memory (LSTM) networks have significantly improved the accuracy of speech and image classification problems by remembering useful past information in long sequential events. Here, we have implemented deep bidirectional LSTM recurrent neural networks in the problem of protein intrinsic disorder prediction. Results: The new method, named SPOT-Disorder, has steadily improved over a similar method using a traditional, window-based neural network (SPINE-D) in all datasets tested without separate training on short and long disordered regions. Independent tests on four other datasets including the datasets from critical assessment of structure prediction (CASP) techniques and >10 000 annotated proteins from MobiDB, confirmed SPOT-Disorder as one of the best methods in disorder prediction. Moreover, initial studies indicate that the method is more accurate in predicting functional sites in disordered regions. These results highlight the usefulness combining LSTM with deep bidirectional recurrent neural networks in capturing non-local, long-range interactions for bioinformatics applications. Availability and Implementation: SPOT-disorder is available as a web server and as a standalone program at: http://sparks-lab.org/server/SPOT-disorder/index.php . Contact: j.hanson@griffith.edu.au or yuedong.yang@griffith.edu.au or yaoqi.zhou@griffith.edu.au. Supplementary information: Supplementary data is available at Bioinformatics online.


Assuntos
Biologia Computacional/métodos , Aprendizado de Máquina , Redes Neurais de Computação , Proteínas/química , Algoritmos , Caspases/química , Caspases/metabolismo , Doenças Genéticas Inatas/metabolismo , Memória de Curto Prazo , Conformação Proteica , Proteínas/metabolismo
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...