Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 10 de 10
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
J Pathol Inform ; 4(Suppl): S7, 2013.
Artigo em Inglês | MEDLINE | ID: mdl-23766943

RESUMO

CONTEXT: Three dimensional (3D) tissue reconstructions from the histology images with different stains allows the spatial alignment of structural and functional elements highlighted by different stains for quantitative study of many physiological and pathological phenomena. This has significant potential to improve the understanding of the growth patterns and the spatial arrangement of diseased cells, and enhance the study of biomechanical behavior of the tissue structures towards better treatments (e.g. tissue-engineering applications). METHODS: This paper evaluates three strategies for 3D reconstruction from sets of two dimensional (2D) histological sections with different stains, by combining methods of 2D multi-stain registration and 3D volumetric reconstruction from same stain sections. SETTING AND DESIGN: The different strategies have been evaluated on two liver specimens (80 sections in total) stained with Hematoxylin and Eosin (H and E), Sirius Red, and Cytokeratin (CK) 7. RESULTS AND CONCLUSION: A strategy of using multi-stain registration to align images of a second stain to a volume reconstructed by same-stain registration results in the lowest overall error, although an interlaced image registration approach may be more robust to poor section quality.

2.
Bioinformatics ; 26(19): 2431-7, 2010 Oct 01.
Artigo em Inglês | MEDLINE | ID: mdl-20693320

RESUMO

MOTIVATION: Functional genomics data provides a rich source of information that can be used in the annotation of the thousands of genes of unknown function found in most sequenced genomes. However, previous gene function prediction programs are mostly produced for relatively well-annotated organisms that often have a large amount of functional genomics data. Here, we present a novel method for predicting gene function that uses clustering of genes by semantic similarity, a naïve Bayes classifier and 'enrichment analysis' to predict gene function for a genome that is less well annotated but does has a severe effect on human health, that of the malaria parasite Plasmodium falciparum. RESULTS: Predictions for the molecular function, biological process and cellular component of P.falciparum genes were created from eight different datasets with a combined prediction also being produced. The high-confidence predictions produced by the combined prediction were compared to those produced by a simple K-nearest neighbour classifier approach and were shown to improve accuracy and coverage. Finally, two case studies are described, which investigate two biological processes in more detail, that of translation initiation and invasion of the host cell. AVAILABILITY: Predictions produced are available at http://www.bioinformatics.leeds.ac.uk/∼bio5pmrt/PAGODA.


Assuntos
Biologia Computacional/métodos , Genoma de Protozoário , Genômica/métodos , Plasmodium falciparum/genética , Análise por Conglomerados , Proteínas de Protozoários/genética
3.
Plant J ; 61(4): 713-21, 2010 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-19947983

RESUMO

Despite recent advances, accurate gene function prediction remains an elusive goal, with very few methods directly applicable to the plant Arabidopsis thaliana. In this study, we present GO-At (gene ontology prediction in A. thaliana), a method that combines five data types (co-expression, sequence, phylogenetic profile, interaction and gene neighbourhood) to predict gene function in Arabidopsis. Using a simple, yet powerful two-step approach, GO-At first generates a list of genes ranked in descending order of probability of functional association with the query gene. Next, a prediction score is automatically assigned to each function in this list based on the assumption that functions appearing most frequently at the top of the list are most likely to represent the function of the query gene. In this way, the second step provides an effective alternative to simply taking the 'best hit' from the first list, and achieves success rates of up to 79%. GO-At is applicable across all three GO categories: molecular function, biological process and cellular component, and can assign functions at multiple levels of annotation detail. Furthermore, we demonstrate GO-At's ability to predict functions of uncharacterized genes by identifying ten putative golgins/Golgi-associated proteins amongst 8219 genes of previously unknown cellular component and present independent evidence to support our predictions. A web-based implementation of GO-At (http://www.bioinformatics.leeds.ac.uk/goat) is available, providing a unique resource for plant researchers to make predictions for uncharacterized genes and predict novel functions in Arabidopsis.


Assuntos
Arabidopsis/genética , Biologia Computacional/métodos , Bases de Dados de Proteínas , Perfilação da Expressão Gênica/métodos , Genes de Plantas , Internet , Filogenia , Mapeamento de Interação de Proteínas/métodos , Interface Usuário-Computador
4.
BMC Syst Biol ; 3: 85, 2009 Sep 03.
Artigo em Inglês | MEDLINE | ID: mdl-19728870

RESUMO

BACKGROUND: The elucidation of networks from a compendium of gene expression data is one of the goals of systems biology and can be a valuable source of new hypotheses for experimental researchers. For Arabidopsis, there exist several thousand microarrays which form a valuable resource from which to learn. RESULTS: A novel Bayesian network-based algorithm to infer gene regulatory networks from gene expression data is introduced and applied to learn parts of the transcriptomic network in Arabidopsis thaliana from a large number (thousands) of separate microarray experiments. Starting from an initial set of genes of interest, a network is grown by iterative addition to the model of the gene, from another defined set of genes, which gives the 'best' learned network structure. The gene set for iterative growth can be as large as the entire genome. A number of networks are inferred and analysed; these show (i) an agreement with the current literature on the circadian clock network, (ii) the ability to model other networks, and (iii) that the learned network hypotheses can suggest new roles for poorly characterized genes, through addition of relevant genes from an unconstrained list of over 15,000 possible genes. To demonstrate the latter point, the method is used to suggest that particular GATA transcription factors are regulators of photosynthetic genes. Additionally, the performance in recovering a known network from different amounts of synthetically generated data is evaluated. CONCLUSION: Our results show that plausible regulatory networks can be learned from such gene expression data alone. This work demonstrates that network hypotheses can be generated from existing gene expression data for use by experimental biologists.


Assuntos
Proteínas de Arabidopsis/metabolismo , Arabidopsis/metabolismo , Perfilação da Expressão Gênica/métodos , Regulação da Expressão Gênica de Plantas/fisiologia , Modelos Biológicos , Reconhecimento Automatizado de Padrão/métodos , Transdução de Sinais/fisiologia , Algoritmos , Simulação por Computador
6.
Bioinformatics ; 23(6): 664-72, 2007 Mar 15.
Artigo em Inglês | MEDLINE | ID: mdl-17234639

RESUMO

MOTIVATION: To predict which of the vast number of human single nucleotide polymorphisms (SNPs) are deleterious to gene function or likely to be disease associated is an important problem, and many methods have been reported in the literature. All methods require data sets of mutations classified as 'deleterious' or 'neutral' for training and/or validation. While different workers have used different data sets there has been no study of which is best. Here, the three most commonly used data sets are analysed. We examine their contents and relate this to classifiers, with the aims of revealing the strengths and pitfalls of each data set, and recommending a best approach for future studies. RESULTS: The data sets examined are shown to be substantially different in content, particularly with regard to amino acid substitutions, reflecting the different ways in which they are derived. This leads to differences in classifiers and reveals some serious pitfalls of some data sets, making them less than ideal for non-synonymous SNP prediction. AVAILABILITY: Software is available on request from the authors.


Assuntos
Algoritmos , Artefatos , Inteligência Artificial , Mapeamento Cromossômico/métodos , Análise Mutacional de DNA/métodos , Reconhecimento Automatizado de Padrão/métodos , Polimorfismo de Nucleotídeo Único/genética , Sequência de Bases , Variação Genética/genética , Genoma Humano/genética , Humanos , Dados de Sequência Molecular , Reprodutibilidade dos Testes , Sensibilidade e Especificidade
7.
BMC Bioinformatics ; 7: 405, 2006 Sep 06.
Artigo em Inglês | MEDLINE | ID: mdl-16956412

RESUMO

BACKGROUND: A number of methods that use both protein structural and evolutionary information are available to predict the functional consequences of missense mutations. However, many of these methods break down if either one of the two types of data are missing. Furthermore, there is a lack of rigorous assessment of how important the different factors are to prediction. RESULTS: Here we use Bayesian networks to predict whether or not a missense mutation will affect the function of the protein. Bayesian networks provide a concise representation for inferring models from data, and are known to generalise well to new data. More importantly, they can handle the noisy, incomplete and uncertain nature of biological data. Our Bayesian network achieved comparable performance with previous machine learning methods. The predictive performance of learned model structures was no better than a naïve Bayes classifier. However, analysis of the posterior distribution of model structures allows biologically meaningful interpretation of relationships between the input variables. CONCLUSION: The ability of the Bayesian network to make predictions when only structural or evolutionary data was observed allowed us to conclude that structural information is a significantly better predictor of the functional consequences of a missense mutation than evolutionary information, for the dataset used. Analysis of the posterior distribution of model structures revealed that the top three strongest connections with the class node all involved structural nodes. With this in mind, we derived a simplified Bayesian network that used just these three structural descriptors, with comparable performance to that of an all node network.


Assuntos
Teorema de Bayes , Modelos Biológicos , Mutação de Sentido Incorreto , Proteínas/química , Proteínas/fisiologia , Algoritmos , Aminoácidos/química , Aminoácidos/genética , Bases de Dados Genéticas , Cadeias de Markov , Modelos Estatísticos , Método de Monte Carlo , Muramidase/química , Muramidase/genética , Probabilidade , Conformação Proteica , Curva ROC , Proteínas Repressoras/química , Proteínas Repressoras/genética , Relação Estrutura-Atividade
8.
J Mol Biol ; 362(2): 365-86, 2006 Sep 15.
Artigo em Inglês | MEDLINE | ID: mdl-16919296

RESUMO

Identifying the interface between two interacting proteins provides important clues to the function of a protein, and is becoming increasing relevant to drug discovery. Here, surface patch analysis was combined with a Bayesian network to predict protein-protein binding sites with a success rate of 82% on a benchmark dataset of 180 proteins, improving by 6% on previous work and well above the 36% that would be achieved by a random method. A comparable success rate was achieved even when evolutionary information was missing, a further improvement on our previous method which was unable to handle incomplete data automatically. In a case study of the Mog1p family, we showed that our Bayesian network method can aid the prediction of previously uncharacterised binding sites and provide important clues to protein function. On Mog1p itself a putative binding site involved in the SLN1-SKN7 signal transduction pathway was detected, as was a Ran binding site, previously characterized solely by conservation studies, even though our automated method operated without using homologous proteins. On the remaining members of the family (two structural genomics targets, and a protein involved in the photosystem II complex in higher plants) we identified novel binding sites with little correspondence to those on Mog1p. These results suggest that members of the Mog1p family bind to different proteins and probably have different functions despite sharing the same overall fold. We also demonstrated the applicability of our method to drug discovery efforts by successfully locating a number of binding sites involved in the protein-protein interaction network of papilloma virus infection. In a separate study, we attempted to distinguish between the two types of binding site, obligate and non-obligate, within our dataset using a second Bayesian network. This proved difficult although some separation was achieved on the basis of patch size, electrostatic potential and conservation. Such was the similarity between the two interacting patch types, we were able to use obligate binding site properties to predict the location of non-obligate binding sites and vice versa.


Assuntos
Teorema de Bayes , Modelos Moleculares , Conformação Proteica , Proteínas , Animais , Sítios de Ligação , Humanos , Ligação Proteica , Proteínas/química , Proteínas/metabolismo , Curva ROC , Reprodutibilidade dos Testes , Proteína ran de Ligação ao GTP/química , Proteína ran de Ligação ao GTP/genética , Proteína ran de Ligação ao GTP/metabolismo
10.
Stud Health Technol Inform ; 94: 124-6, 2003.
Artigo em Inglês | MEDLINE | ID: mdl-15455876

RESUMO

Pressure has grown over the past decade to provide more rigorous and standardized testing of surgical trainees at both higher and lower levels. VR technologies appear to offer a solution but the cost of equipment and realism of the interface present major research challenges. The central requirement of simulation remains assessment, however, and the present study examines this issue within the context of surgical suturing skills. By tracking users during suturing tasks, we argue that errors in technique can be analysed by the examination of standard pattern spaces.


Assuntos
Competência Clínica , Procedimentos Cirúrgicos Operatórios/normas , Interface Usuário-Computador , Humanos
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...