Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 6 de 6
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
PLoS One ; 13(10): e0205968, 2018.
Artigo em Inglês | MEDLINE | ID: mdl-30372459

RESUMO

MOTIVATION: Modern analytical techniques such as LC-MS, GC-MS and NMR are increasingly being used to study the underlying dynamics of biological systems by tracking changes in metabolite levels over time. Such techniques are capable of providing information on large numbers of metabolites simultaneously, a feature that is exploited in non-targeted studies. However, since the dynamics of specific metabolites are unlikely to be known a priori this presents an initial subjective challenge as to where the focus of the investigation should be. Whilst a number of feed-forward software tools are available for manipulation of metabolomic data, no tool centralizes on clustering and focus is typically directed by a workflow that is chosen in advance. RESULTS: We present an interactive approach to time-course analyses and a complementary implementation in a software package, MetaboClust. This is presented through the analysis of two LC-MS time-course case studies on plants (Medicago truncatula and Alopecurus myosuroides). We demonstrate a dynamic, user-centric workflow to clustering with intrinsic visual feedback at all stages of analysis. The software is used to apply data correction, generate the time-profiles, perform exploratory statistical analysis and assign tentative metabolite identifications. Clustering is used to group metabolites in an unbiased manner, allowing pathway analysis to score metabolic pathways, based on their overlap with clusters showing interesting trends.


Assuntos
Redes e Vias Metabólicas , Metabolômica/métodos , Software , Vias Biossintéticas , Brassinosteroides/metabolismo , Análise por Conglomerados , Secas , Medicago/metabolismo , Fenótipo , Doenças das Plantas , Poaceae/metabolismo , Fatores de Tempo
2.
Metabolomics ; 12: 56, 2016.
Artigo em Inglês | MEDLINE | ID: mdl-27069441

RESUMO

The need for reproducible and comparable results is of increasing importance in non-targeted metabolomic studies, especially when differences between experimental groups are small. Liquid chromatography-mass spectrometry spectra are often acquired batch-wise so that necessary calibrations and cleaning of the instrument can take place. However this may introduce further sources of variation, such as differences in the conditions under which the acquisition of individual batches is performed. Quality control (QC) samples are frequently employed as a means of both judging and correcting this variation. Here we show that the use of QC samples can lead to problems. The non-linearity of the response can result in substantial differences between the recorded intensities of the QCs and experimental samples, making the required adjustment difficult to predict. Furthermore, changes in the response profile between one QC interspersion and the next cannot be accounted for and QC based correction can actually exacerbate the problems by introducing artificial differences. "Background correction" methods utilise all experimental samples to estimate the variation over time rather than relying on the QC samples alone. We compare non-QC correction methods with standard QC correction and demonstrate their success in reducing differences between replicate samples and their potential to highlight differences between experimental groups previously hidden by instrumental variation.

3.
Neural Netw ; 78: 24-35, 2016 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-26403824

RESUMO

In this paper, we introduce a theoretical basis for a Hadoop-based neural network for parallel and distributed feature selection in Big Data sets. It is underpinned by an associative memory (binary) neural network which is highly amenable to parallel and distributed processing and fits with the Hadoop paradigm. There are many feature selectors described in the literature which all have various strengths and weaknesses. We present the implementation details of five feature selection algorithms constructed using our artificial neural network framework embedded in Hadoop YARN. Hadoop allows parallel and distributed processing. Each feature selector can be divided into subtasks and the subtasks can then be processed in parallel. Multiple feature selectors can also be processed simultaneously (in parallel) allowing multiple feature selectors to be compared. We identify commonalities among the five features selectors. All can be processed in the framework using a single representation and the overall processing can also be greatly reduced by only processing the common aspects of the feature selectors once and propagating these aspects across all five feature selectors as necessary. This allows the best feature selector and the actual features to select to be identified for large and high dimensional data sets through exploiting the efficiency and flexibility of embedding the binary associative-memory neural network in Hadoop.


Assuntos
Redes Neurais de Computação , Estatística como Assunto/métodos , Algoritmos , Bases de Dados Factuais/estatística & dados numéricos
4.
Acta Crystallogr F Struct Biol Commun ; 71(Pt 10): 1228-34, 2015 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-26457511

RESUMO

The Protein Data Bank (PDB) is the largest available repository of solved protein structures and contains a wealth of information on successful crystallization. Many centres have used their own experimental data to draw conclusions about proteins and the conditions in which they crystallize. Here, data from the PDB were used to reanalyse some of these results. The most successful crystallization reagents were identified, the link between solution pH and the isoelectric point of the protein was investigated and the possibility of predicting whether a protein will crystallize was explored.


Assuntos
Bases de Dados de Proteínas , Estatística como Assunto , Sulfato de Amônio/química , Cristalização , Concentração de Íons de Hidrogênio , Ponto Isoelétrico , Polietilenoglicóis/química
5.
Bioinformatics ; 31(9): 1444-51, 2015 May 01.
Artigo em Inglês | MEDLINE | ID: mdl-25573921

RESUMO

MOTIVATION: The identification of suitable conditions for crystallization is a rate-limiting step in protein structure determination. The pH of an experiment is an important parameter and has the potential to be used in data-mining studies to help reduce the number of crystallization trials required. However, the pH is usually recorded as that of the buffer solution, which can be highly inaccurate. RESULTS: Here, we show that a better estimate of the true pH can be predicted by considering not only the buffer pH but also any other chemicals in the crystallization solution. We use these more accurate pH values to investigate the disputed relationship between the pI of a protein and the pH at which it crystallizes. AVAILABILITY AND IMPLEMENTATION: Data used to generate models are available as Supplementary Material. CONTACT: julie.wilson@york.ac.uk SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Cristalização , Proteínas/química , Concentração de Íons de Hidrogênio , Ponto Isoelétrico
6.
Acta Crystallogr D Biol Crystallogr ; 70(Pt 9): 2367-75, 2014 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-25195750

RESUMO

The crystallization of proteins is dependent on the careful control of numerous parameters, one of these being pH. The pH of crystallization is generally reported as that of the buffer; however, the true pH has been found to be as many as four pH units away. Measurement of pH with a meter is time-consuming and requires the reformatting of the crystallization solution. To overcome this, a high-throughput method for pH determination of buffered solutions has been developed with results comparable to those of a pH meter.


Assuntos
Colorimetria/métodos , Ensaios de Triagem em Larga Escala , Concentração de Íons de Hidrogênio , Cristalização , Espectrofotometria Ultravioleta
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...