Pesquisa | Portal Regional da BVS (teste)

1.

Maternal diabetes independent of BMI is associated with altered accretion of adipose tissue in large for gestational age fetuses.

Lam, Penny; Mein, Brendan J; Benzie, Ronald J; Ormerod, John T; Robledo, Kristy P; Hibbert, Emily J; Nanan, Ralph K.

PLoS One ; 17(5): e0268972, 2022.

Artigo em Inglês | MEDLINE | ID: mdl-35639703

RESUMO

AIM: To analyse the effects of maternal diabetes mellitus (DM) and body mass Index (BMI) on central and peripheral fat accretion of large for gestational age (LGA) offspring. METHODS: This retrospective study included LGA fetuses (n = 595) with ultrasound scans at early (19.23 ± 0.68 weeks), mid (28.98 ± 1.62 weeks) and late (36.20 ± 1.59 weeks) stages of adipogenesis and measured abdominal (AFT) and mid-thigh (TFT) fat as surrogates for central and peripheral adiposity. Women were categorised according to BMI and DM status [pre-gestational (P-DM; n = 59), insulin managed (I-GDM; n = 132) and diet managed gestational diabetes (D-GDM; n = 29)]. Analysis of variance and linear regressions were applied. RESULTS: AFT and TFT did not differ significantly between BMI categories (normal, overweight and obese). In contrast, AFT was significantly higher in pregnancies affected by D-GDM compared to non-DM pregnancies from mid stage (0.44 mm difference, p = 0.002) and for all DM categories in late stage of adipogenesis (≥ 0.49 mm difference, p < 0.008). Late stage TFT accretion was higher than controls for P-DM and I-GDM but not for D-GDM (0.67 mm difference, p < 0.001; 0.49 mm difference, p = 0.001, 0.56 mm difference, p = 0.22 respectively). In comparison to the early non-DM group with an AFT to TFT ratio of 1.07, the I-GDM group ratio was 1.25 (p < 0.001), which normalised by 28 weeks becoming similar to control ratios. CONCLUSIONS: DM, independent of BMI, was associated with higher abdominal and mid-thigh fat accretion in fetuses. Use of insulin improved central to peripheral fat ratios in fetuses of GDM mothers.

Assuntos

Diabetes Gestacional , Tecido Adiposo/diagnóstico por imagem , Índice de Massa Corporal , Feminino , Feto/diagnóstico por imagem , Idade Gestacional , Humanos , Insulina , Obesidade/complicações , Gravidez , Estudos Retrospectivos , Aumento de Peso

2.

spicyR: spatial analysis of in situ cytometry data in R.

Canete, Nicolas P; Iyengar, Sourish S; Ormerod, John T; Baharlou, Heeva; Harman, Andrew N; Patrick, Ellis.

Bioinformatics ; 38(11): 3099-3105, 2022 05 26.

Artigo em Inglês | MEDLINE | ID: mdl-35438129

RESUMO

MOTIVATION: High parameter histological techniques have allowed for the identification of a variety of distinct cell types within an image, providing a comprehensive overview of the tissue environment. This allows the complex cellular architecture and environment of diseased tissue to be explored. While spatial analysis techniques have revealed how cell-cell interactions are important within the disease pathology, there remains a gap in exploring changes in these interactions within the disease process. Specifically, there are currently few established methods for performing inference on cell-type co-localization changes across images, hindering an understanding of how cellular environments change with a disease pathology. RESULTS: We have developed the spicyR R package to perform inference on changes in the spatial co-localization of types across groups of images. Application to simulated data demonstrates a high sensitivity and specificity. We the utility of spicyR by applying it to a type 1 diabetes imaging mass cytometry dataset, revealing changes in cellular associations that were relevant to the disease progression. Ultimately, spicyR allows changes in cellular environments to be explored under different pathologies or disease states. AVAILABILITY AND IMPLEMENTATION: R package is freely available at http://bioconductor.org/packages/release/bioc/html/spicyR.html and shiny app implementation at http://shiny.maths.usyd.edu.au/spicyR/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Assuntos

Software , Análise Espacial

3.

scREMOTE: Using multimodal single cell data to predict regulatory gene relationships and to build a computational cell reprogramming model.

Tran, Andy; Yang, Pengyi; Yang, Jean Y H; Ormerod, John T.

NAR Genom Bioinform ; 4(1): lqac023, 2022 Mar.

Artigo em Inglês | MEDLINE | ID: mdl-35300460

RESUMO

Cell reprogramming offers a potential treatment to many diseases, by regenerating specialized somatic cells. Despite decades of research, discovering the transcription factors that promote cell reprogramming has largely been accomplished through trial and error, a time-consuming and costly method. A computational model for cell reprogramming, however, could guide the hypothesis formulation and experimental validation, to efficiently utilize time and resources. Current methods often cannot account for the heterogeneity observed in cell reprogramming, or they only make short-term predictions, without modelling the entire reprogramming process. Here, we present scREMOTE, a novel computational model for cell reprogramming that leverages single cell multiomics data, enabling a more holistic view of the regulatory mechanisms at cellular resolution. This is achieved by first identifying the regulatory potential of each transcription factor and gene to uncover regulatory relationships, then a regression model is built to estimate the effect of transcription factor perturbations. We show that scREMOTE successfully predicts the long-term effect of overexpressing two key transcription factors in hair follicle development by capturing higher-order gene regulations. Together, this demonstrates that integrating the multimodal processes governing gene regulation creates a more accurate model for cell reprogramming with significant potential to accelerate research in regenerative medicine.

4.

scDC: single cell differential composition analysis.

Cao, Yue; Lin, Yingxin; Ormerod, John T; Yang, Pengyi; Yang, Jean Y H; Lo, Kitty K.

BMC Bioinformatics ; 20(Suppl 19): 721, 2019 Dec 24.

Artigo em Inglês | MEDLINE | ID: mdl-31870280

RESUMO

BACKGROUND: Differences in cell-type composition across subjects and conditions often carry biological significance. Recent advancements in single cell sequencing technologies enable cell-types to be identified at the single cell level, and as a result, cell-type composition of tissues can now be studied in exquisite detail. However, a number of challenges remain with cell-type composition analysis - none of the existing methods can identify cell-type perfectly and variability related to cell sampling exists in any single cell experiment. This necessitates the development of method for estimating uncertainty in cell-type composition. RESULTS: We developed a novel single cell differential composition (scDC) analysis method that performs differential cell-type composition analysis via bootstrap resampling. scDC captures the uncertainty associated with cell-type proportions of each subject via bias-corrected and accelerated bootstrap confidence intervals. We assessed the performance of our method using a number of simulated datasets and synthetic datasets curated from publicly available single cell datasets. In simulated datasets, scDC correctly recovered the true cell-type proportions. In synthetic datasets, the cell-type compositions returned by scDC were highly concordant with reference cell-type compositions from the original data. Since the majority of datasets tested in this study have only 2 to 5 subjects per condition, the addition of confidence intervals enabled better comparisons of compositional differences between subjects and across conditions. CONCLUSIONS: scDC is a novel statistical method for performing differential cell-type composition analysis for scRNA-seq data. It uses bootstrap resampling to estimate the standard errors associated with cell-type proportion estimates and performs significance testing through GLM and GLMM models. We have made this method available to the scientific community as part of the scdney package (Single Cell Data Integrative Analysis) R package, available from https://github.com/SydneyBioX/scdney.

Assuntos

Análise de Célula Única/métodos , Humanos

5.

scMerge leverages factor analysis, stable expression, and pseudoreplication to merge multiple single-cell RNA-seq datasets.

Lin, Yingxin; Ghazanfar, Shila; Wang, Kevin Y X; Gagnon-Bartsch, Johann A; Lo, Kitty K; Su, Xianbin; Han, Ze-Guang; Ormerod, John T; Speed, Terence P; Yang, Pengyi; Yang, Jean Yee Hwa.

Proc Natl Acad Sci U S A ; 116(20): 9775-9784, 2019 05 14.

Artigo em Inglês | MEDLINE | ID: mdl-31028141

RESUMO

Concerted examination of multiple collections of single-cell RNA sequencing (RNA-seq) data promises further biological insights that cannot be uncovered with individual datasets. Here we present scMerge, an algorithm that integrates multiple single-cell RNA-seq datasets using factor analysis of stably expressed genes and pseudoreplicates across datasets. Using a large collection of public datasets, we benchmark scMerge against published methods and demonstrate that it consistently provides improved cell type separation by removing unwanted factors; scMerge can also enhance biological discovery through robust data integration, which we show through the inference of development trajectory in a liver dataset collection.

Assuntos

Metanálise como Assunto , Análise de Sequência de RNA , Análise de Célula Única , Software , Algoritmos , Animais , Desenvolvimento Embrionário , Análise Fatorial , Expressão Gênica , Humanos , Camundongos

6.

AdaSampling for Positive-Unlabeled and Label Noise Learning With Bioinformatics Applications.

Yang, Pengyi; Ormerod, John T; Liu, Wei; Ma, Chendong; Zomaya, Albert Y; Yang, Jean Y H.

IEEE Trans Cybern ; 49(5): 1932-1943, 2019 May.

Artigo em Inglês | MEDLINE | ID: mdl-29993676

RESUMO

Class labels are required for supervised learning but may be corrupted or missing in various applications. In binary classification, for example, when only a subset of positive instances is labeled whereas the remaining are unlabeled, positive-unlabeled (PU) learning is required to model from both positive and unlabeled data. Similarly, when class labels are corrupted by mislabeled instances, methods are needed for learning in the presence of class label noise (LN). Here we propose adaptive sampling (AdaSampling), a framework for both PU learning and learning with class LN. By iteratively estimating the class mislabeling probability with an adaptive sampling procedure, the proposed method progressively reduces the risk of selecting mislabeled instances for model training and subsequently constructs highly generalizable models even when a large proportion of mislabeled instances is present in the data. We demonstrate the utilities of proposed methods using simulation and benchmark data, and compare them to alternative approaches that are commonly used for PU learning and/or learning with LN. We then introduce two novel bioinformatics applications where AdaSampling is used to: 1) identify kinase-substrates from mass spectrometry-based phosphoproteomics data and 2) predict transcription factor target genes by integrating various next-generation sequencing data.

Assuntos

Biologia Computacional/métodos , Aprendizado de Máquina , Proteínas , Algoritmos , Modelos Estatísticos , Fosfoproteínas/química , Fosfoproteínas/genética , Fosfoproteínas/metabolismo , Fosfotransferases/química , Fosfotransferases/genética , Fosfotransferases/metabolismo , Proteínas/química , Proteínas/genética , Proteínas/metabolismo , Fatores de Transcrição

7.

DCARS: differential correlation across ranked samples.

Ghazanfar, Shila; Strbenac, Dario; Ormerod, John T; Yang, Jean Y H; Patrick, Ellis.

Bioinformatics ; 35(5): 823-829, 2019 03 01.

Artigo em Inglês | MEDLINE | ID: mdl-30102408

RESUMO

MOTIVATION: Genes act as a system and not in isolation. Thus, it is important to consider coordinated changes of gene expression rather than single genes when investigating biological phenomena such as the aetiology of cancer. We have developed an approach for quantifying how changes in the association between pairs of genes may inform the outcome of interest called Differential Correlation across Ranked Samples (DCARS). Modelling gene correlation across a continuous sample ranking does not require the dichotomisation of samples into two distinct classes and can identify differences in gene correlation across early, mid or late stages of the outcome of interest. RESULTS: When we evaluated DCARS against the typical Fisher Z-transformation test for differential correlation, as well as a typical approach testing for interaction within a linear model, on real TCGA data, DCARS significantly ranked gene pairs containing known cancer genes more highly across several cancers. Similar results are found with our simulation study. DCARS was applied to 13 cancers datasets in TCGA, revealing several distinct relationships for which survival ranking was found to be associated with a change in correlation between genes. Furthermore, we demonstrated that DCARS can be used in conjunction with network analysis techniques to extract biological meaning from multi-layered and complex data. AVAILABILITY AND IMPLEMENTATION: DCARS R package and sample data are available at https://github.com/shazanfar/DCARS. Publicly available data from The Cancer Genome Atlas (TCGA) was used using the TCGABiolinks R package. Supplementary Files and DCARS R package is available at https://github.com/shazanfar/DCARS. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Assuntos

Neoplasias , Genoma , Humanos , Software

8.

A multi-step classifier addressing cohort heterogeneity improves performance of prognostic biomarkers in three cancer types.

Patrick, Ellis; Schramm, Sarah-Jane; Ormerod, John T; Scolyer, Richard A; Mann, Graham J; Mueller, Samuel; Yang, Jean Y H.

Oncotarget ; 8(2): 2807-2815, 2017 Jan 10.

Artigo em Inglês | MEDLINE | ID: mdl-27833072

RESUMO

Cancer research continues to highlight the extensive genetic diversity that exists both between and within tumors. This intrinsic heterogeneity poses one of the central challenges to predicting patient clinical outcome and the personalization of treatments. Despite progress in some individual tumor types, it is not yet possible to prospectively, accurately classify patients by expected survival. One hypothesis proposed to explain this is that the prognostic classifiers developed to date are insufficiently sensitive and specific; however it is also possible that patients are not equally easy to classify by any given biomarker. We demonstrate in a cohort of 45 AJCC stage III melanoma patients that clinico-pathologic biomarkers can identify those patients that are most likely to be misclassified by a molecular biomarker. The process of modelling the classifiability of patients was then replicated in a cohort of 49 stage II breast cancer patients and 53 stage III colon cancer patients. A multi-step procedure incorporating this information not only improved classification accuracy but also indicated the specific clinical attributes that had made classification problematic in each cohort. These findings show that, even when cohorts are of moderate size, including features that explain the patient-specific performance of a prognostic biomarker in a classification framework can improve the modelling and estimation of survival.

Assuntos

Biomarcadores Tumorais , Neoplasias/diagnóstico , Neoplasias/mortalidade , Biologia Computacional/métodos , Bases de Dados de Ácidos Nucleicos , Feminino , Perfilação da Expressão Gênica/métodos , Humanos , Melanoma/diagnóstico , Melanoma/genética , Melanoma/mortalidade , Metástase Neoplásica , Estadiamento de Neoplasias/métodos , Prognóstico

9.

Differential distribution improves gene selection stability and has competitive classification performance for patient survival.

Strbenac, Dario; Mann, Graham J; Yang, Jean Y H; Ormerod, John T.

Nucleic Acids Res ; 44(13): e119, 2016 07 27.

Artigo em Inglês | MEDLINE | ID: mdl-27190235

RESUMO

A consistent difference in average expression level, often referred to as differential expression (DE), has long been used to identify genes useful for classification. However, recent cancer studies have shown that when transcription factors or epigenetic signals become deregulated, a change in expression variability (DV) of target genes is frequently observed. This suggests that assessing the importance of genes by either differential expression or variability alone potentially misses sets of important biomarkers that could lead to improved predictions and treatments. Here, we describe a new approach for assessing the importance of genes based on differential distribution (DD), which combines information from differential expression and differential variability into a unified metric. We show that feature ranking and selection stability based on DD can perform two to three times better than DE or DV alone, and that DD yields equivalent error rates to DE and DV. Finally, assessing genes via differential distribution produces a complementary set of selected genes to DE and DV, potentially opening up new categories of biomarkers.

Assuntos

Biomarcadores Tumorais/genética , Regulação Neoplásica da Expressão Gênica/genética , Melanoma/genética , Análise de Sequência com Séries de Oligonucleotídeos/métodos , Adenocarcinoma/genética , Adenocarcinoma/patologia , Adenocarcinoma de Pulmão , Algoritmos , Biomarcadores Tumorais/biossíntese , Feminino , Perfilação da Expressão Gênica/métodos , Humanos , Neoplasias Pulmonares/genética , Neoplasias Pulmonares/patologia , Melanoma/patologia , Neoplasias Ovarianas/genética , Neoplasias Ovarianas/patologia

10.

A Variational Bayes Approach to the Analysis of Occupancy Models.

Clark, Allan E; Altwegg, Res; Ormerod, John T.

PLoS One ; 11(2): e0148966, 2016.

Artigo em Inglês | MEDLINE | ID: mdl-26928878

RESUMO

Detection-nondetection data are often used to investigate species range dynamics using Bayesian occupancy models which rely on the use of Markov chain Monte Carlo (MCMC) methods to sample from the posterior distribution of the parameters of the model. In this article we develop two Variational Bayes (VB) approximations to the posterior distribution of the parameters of a single-season site occupancy model which uses logistic link functions to model the probability of species occurrence at sites and of species detection probabilities. This task is accomplished through the development of iterative algorithms that do not use MCMC methods. Simulations and small practical examples demonstrate the effectiveness of the proposed technique. We specifically show that (under certain circumstances) the variational distributions can provide accurate approximations to the true posterior distributions of the parameters of the model when the number of visits per site (K) are as low as three and that the accuracy of the approximations improves as K increases. We also show that the methodology can be used to obtain the posterior distribution of the predictive distribution of the proportion of sites occupied (PAO).

Assuntos

Algoritmos , Teorema de Bayes

11.

Integrated single cell data analysis reveals cell specific networks and novel coactivation markers.

Ghazanfar, Shila; Bisogni, Adam J; Ormerod, John T; Lin, David M; Yang, Jean Y H.

BMC Syst Biol ; 10(Suppl 5): 127, 2016 Dec 05.

Artigo em Inglês | MEDLINE | ID: mdl-28105940

RESUMO

BACKGROUND: Large scale single cell transcriptome profiling has exploded in recent years and has enabled unprecedented insight into the behavior of individual cells. Identifying genes with high levels of expression using data from single cell RNA sequencing can be useful to characterize very active genes and cells in which this occurs. In particular single cell RNA-Seq allows for cell-specific characterization of high gene expression, as well as gene coexpression. RESULTS: We offer a versatile modeling framework to identify transcriptional states as well as structures of coactivation for different neuronal cell types across multiple datasets. We employed a gamma-normal mixture model to identify active gene expression across cells, and used these to characterize markers for olfactory sensory neuron cell maturity, and to build cell-specific coactivation networks. We found that combined analysis of multiple datasets results in more known maturity markers being identified, as well as pointing towards some novel genes that may be involved in neuronal maturation. We also observed that the cell-specific coactivation networks of mature neurons tended to have a higher centralization network measure than immature neurons. CONCLUSION: Integration of multiple datasets promises to bring about more statistical power to identify genes and patterns of interest. We found that transforming the data into active and inactive gene states allowed for more direct comparison of datasets, leading to identification of maturity marker genes and cell-specific network observations, taking into account the unique characteristics of single cell transcriptomics data.

Assuntos

Biologia Computacional/métodos , Perfilação da Expressão Gênica , Marcadores Genéticos/genética , Neurônios/citologia , Neurônios/metabolismo , Análise de Célula Única , Ativação Transcricional , Redes Reguladoras de Genes , Neurônios Receptores Olfatórios/citologia , Neurônios Receptores Olfatórios/metabolismo

12.

ClassifyR: an R package for performance assessment of classification with applications to transcriptomics.

Strbenac, Dario; Mann, Graham J; Ormerod, John T; Yang, Jean Y H.

Bioinformatics ; 31(11): 1851-3, 2015 Jun 01.

Artigo em Inglês | MEDLINE | ID: mdl-25644269

RESUMO

UNLABELLED: Although a large collection of classification software packages exist in R, a new generic framework for linking custom classification functions with classification performance measures is needed. A generic classification framework has been designed and implemented as an R package in an object oriented style. Its design places emphasis on parallel processing, reproducibility and extensibility. Finally, a comprehensive set of performance measures are available to ease post-processing. Taken together, these important characteristics enable rapid and reproducible benchmarking of alternative classifiers. AVAILABILITY AND IMPLEMENTATION: ClassifyR is implemented in R and can be obtained from the Bioconductor project: http://bioconductor.org/packages/release/bioc/html/ClassifyR.html.

Assuntos

Perfilação da Expressão Gênica , Software , Classificação/métodos , Humanos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA