Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 9 de 9
Filtrar
1.
bioRxiv ; 2024 Jan 30.
Artigo em Inglês | MEDLINE | ID: mdl-38352454

RESUMO

Bacterial genome dynamics are vital for understanding the mechanisms underlying microbial adaptation, growth, and their broader impact on host phenotype. Structural variants (SVs), genomic alterations of 10 base pairs or more, play a pivotal role in driving evolutionary processes and maintaining genomic heterogeneity within bacterial populations. While SV detection in isolate genomes is relatively straightforward, metagenomes present broader challenges due to absence of clear reference genomes and presence of mixed strains. In response, our proposed method rhea, forgoes reference genomes and metagenome-assembled genomes (MAGs) by encompassing a single metagenome coassembly graph constructed from all samples in a series. The log fold change in graph coverage between subsequent samples is then calculated to call SVs that are thriving or declining throughout the series. We show rhea to outperform existing methods for SV and horizontal gene transfer (HGT) detection in two simulated mock metagenomes, which is particularly noticeable as the simulated reads diverge from reference genomes and an increase in strain diversity is incorporated. We additionally demonstrate use cases for rhea on series metagenomic data of environmental and fermented food microbiomes to detect specific sequence alterations between subsequent time and temperature samples, suggesting host advantage. Our innovative approach leverages raw read patterns rather than references or MAGs to include all sequencing reads in analysis, and thus provide versatility in studying SVs across diverse and poorly characterized microbial communities for more comprehensive insights into microbial genome dynamics.

2.
Bioinformatics ; 39(9)2023 09 02.
Artigo em Inglês | MEDLINE | ID: mdl-37632792

RESUMO

MOTIVATION: Model organisms are widely used to better understand the molecular causes of human disease. While sequence similarity greatly aids this cross-species transfer, sequence similarity does not imply functional similarity, and thus, several current approaches incorporate protein-protein interactions to help map findings between species. Existing transfer methods either formulate the alignment problem as a matching problem which pits network features against known orthology, or more recently, as a joint embedding problem. RESULTS: We propose a novel state-of-the-art joint embedding solution: Embeddings to Network Alignment (ETNA). ETNA generates individual network embeddings based on network topological structure and then uses a Natural Language Processing-inspired cross-training approach to align the two embeddings using sequence-based orthologs. The final embedding preserves both within and between species gene functional relationships, and we demonstrate that it captures both pairwise and group functional relevance. In addition, ETNA's embeddings can be used to transfer genetic interactions across species and identify phenotypic alignments, laying the groundwork for potential opportunities for drug repurposing and translational studies. AVAILABILITY AND IMPLEMENTATION: https://github.com/ylaboratory/ETNA.


Assuntos
Reposicionamento de Medicamentos , Mapas de Interação de Proteínas , Humanos , Processamento de Linguagem Natural
3.
Front Big Data ; 6: 1020173, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-36896444

RESUMO

We study p-Laplacians and spectral clustering for a recently proposed hypergraph model that incorporates edge-dependent vertex weights (EDVW). These weights can reflect different importance of vertices within a hyperedge, thus conferring the hypergraph model higher expressivity and flexibility. By constructing submodular EDVW-based splitting functions, we convert hypergraphs with EDVW into submodular hypergraphs for which the spectral theory is better developed. In this way, existing concepts and theorems such as p-Laplacians and Cheeger inequalities proposed under the submodular hypergraph setting can be directly extended to hypergraphs with EDVW. For submodular hypergraphs with EDVW-based splitting functions, we propose an efficient algorithm to compute the eigenvector associated with the second smallest eigenvalue of the hypergraph 1-Laplacian. We then utilize this eigenvector to cluster the vertices, achieving higher clustering accuracy than traditional spectral clustering based on the 2-Laplacian. More broadly, the proposed algorithm works for all submodular hypergraphs that are graph reducible. Numerical experiments using real-world data demonstrate the effectiveness of combining spectral clustering based on the 1-Laplacian and EDVW.

4.
Hum Factors ; : 187208221147341, 2022 Dec 22.
Artigo em Inglês | MEDLINE | ID: mdl-36562114

RESUMO

OBJECTIVE: We explore the relationships between objective communication patterns displayed during virtual team meetings and established, qualitative measures of team member effectiveness. BACKGROUND: A key component of teamwork is communication. Automated measures of objective communication patterns are becoming more feasible and offer the ability to measure and monitor communication in a scalable, consistent and continuous manner. However, their validity in reflecting meaningful measures of teamwork processes are not well established, especially in real-world settings. METHOD: We studied real-world virtual student teams working on semester-long projects. We captured virtual team meetings using the Zoom video conferencing platform throughout the semester and periodic surveys comprising peer ratings of team member effectiveness. Leveraging audio transcripts, we examined relationships between objective measures of speaking time, silence gap duration and vocal turn-taking and peer ratings of team member effectiveness. RESULTS: Speaking time, speaking turn count, degree centrality and (marginally) speaking turn duration, but not silence gap duration, were positively related to individual-level team member effectiveness. Time in dyadic interactions and interaction count, but not interaction length, were positively related to dyad-level team member effectiveness. CONCLUSION: Our study highlights the relevance of objective measures of speaking time and vocal turn-taking to team member effectiveness in virtual project-based teams, supporting the validity of these objective measures and their use in future research. APPLICATION: Our approach offers a scalable, easy-to-use method for measuring communication patterns and team member effectiveness in virtual teams and opens the opportunity to study these patterns in a more continuous and dynamic manner.

5.
Comput Struct Biotechnol J ; 20: 3208-3222, 2022.
Artigo em Inglês | MEDLINE | ID: mdl-35832621

RESUMO

Characterizing metagenomes via kmer-based, database-dependent taxonomic classification has yielded key insights into underlying microbiome dynamics. However, novel approaches are needed to track community dynamics and genomic flux within metagenomes, particularly in response to perturbations. We describe KOMB, a novel method for tracking genome level dynamics within microbiomes. KOMB utilizes K-core decomposition to identify Structural variations (SVs), specifically, population-level Copy Number Variation (CNV) within microbiomes. K-core decomposition partitions the graph into shells containing nodes of induced degree at least K, yielding reduced computational complexity compared to prior approaches. Through validation on a synthetic community, we show that KOMB recovers and profiles repetitive genomic regions in the sample. KOMB is shown to identify functionally-important regions in Human Microbiome Project datasets, and was used to analyze longitudinal data and identify keystone taxa in Fecal Microbiota Transplantation (FMT) samples. In summary, KOMB represents a novel graph-based, taxonomy-oblivious, and reference-free approach for tracking CNV within microbiomes. KOMB is open source and available for download at https://gitlab.com/treangenlab/komb.

6.
Genome Biol ; 23(1): 133, 2022 06 20.
Artigo em Inglês | MEDLINE | ID: mdl-35725628

RESUMO

The COVID-19 pandemic has emphasized the importance of accurate detection of known and emerging pathogens. However, robust characterization of pathogenic sequences remains an open challenge. To address this need we developed SeqScreen, which accurately characterizes short nucleotide sequences using taxonomic and functional labels and a customized set of curated Functions of Sequences of Concern (FunSoCs) specific to microbial pathogenesis. We show our ensemble machine learning model can label protein-coding sequences with FunSoCs with high recall and precision. SeqScreen is a step towards a novel paradigm of functionally informed synthetic DNA screening and pathogen characterization, available for download at www.gitlab.com/treangenlab/seqscreen .


Assuntos
Aprendizado de Máquina , Bactérias/genética , Bactérias/patogenicidade , COVID-19 , Humanos , Leucócitos Mononucleares/virologia , Fases de Leitura Aberta
7.
BMC Med Inform Decis Mak ; 20(Suppl 12): 327, 2020 12 24.
Artigo em Inglês | MEDLINE | ID: mdl-33357222

RESUMO

BACKGROUND: Sudden unexpected death in epilepsy (SUDEP) is a leading cause of premature death in patients with epilepsy. If timely assessment of SUDEP risk can be made, early interventions for optimized treatments might be provided. One of the biomarkers being investigated for SUDEP risk assessment is postictal generalized EEG suppression [postictal generalized EEG suppression (PGES)]. For example, prolonged PGES has been found to be associated with a higher risk for SUDEP. Accurate characterization of PGES requires correct identification of the end of PGES, which is often complicated due to signal noise and artifacts, and has been reported to be a difficult task even for trained clinical professionals. In this work we present a method for automatic detection of the end of PGES using multi-channel EEG recordings, thus enabling the downstream task of SUDEP risk assessment by PGES characterization. METHODS: We address the detection of the end of PGES as a classification problem. Given a short EEG snippet, a trained model classifies whether it consists of the end of PGES or not. Scalp EEG recordings from a total of 134 patients with epilepsy are used for training a random forest based classification model. Various time-series based features are used to characterize the EEG signal for the classification task. The features that we have used are computationally inexpensive, making it suitable for real-time implementations and low-power solutions. The reference labels for classification are based on annotations by trained clinicians identifying the end of PGES in an EEG recording. RESULTS: We evaluated our classification model on an independent test dataset from 34 epileptic patients and obtained an AUreceiver operating characteristic (ROC) (area under the curve) of 0.84. We found that inclusion of multiple EEG channels is important for better classification results, possibly owing to the generalized nature of PGES. Of among the channels included in our analysis, the central EEG channels were found to provide the best discriminative representation for the detection of the end of PGES. CONCLUSION: Accurate detection of the end of PGES is important for PGES characterization and SUDEP risk assessment. In this work, we showed that it is feasible to automatically detect the end of PGES-otherwise difficult to detect due to EEG noise and artifacts-using time-series features derived from multi-channel EEG recordings. In future work, we will explore deep learning based models for improved detection and investigate the downstream task of PGES characterization for SUDEP risk assessment.


Assuntos
Epilepsia , Convulsões , Eletroencefalografia , Humanos , Convulsões/diagnóstico
8.
Neurology ; 89(13): 1373-1381, 2017 Sep 26.
Artigo em Inglês | MEDLINE | ID: mdl-28779011

RESUMO

OBJECTIVE: To apply network-based statistics to diffusion-weighted imaging tractography data and detect Alzheimer disease vs non-Alzheimer degeneration in the context of corticobasal syndrome. METHODS: In a cross-sectional design, pathology was confirmed by autopsy or a pathologically validated CSF total tau-to-ß-amyloid ratio (T-tau/Aß). Using structural MRI data, we identify association areas in fronto-temporo-parietal cortex with reduced gray matter density in corticobasal syndrome (n = 40) relative to age-matched controls (n = 40). Using these fronto-temporo-parietal regions of interest, we construct structural brain networks in clinically similar subgroups of individuals with Alzheimer disease (n = 21) or non-Alzheimer pathology (n = 19) by linking these regions by the number of white matter streamlines identified in a deterministic tractography analysis of diffusion tensor imaging data. We characterize these structural networks using 5 graph-based statistics, and assess their relative utility in classifying underlying pathology with leave-one-out cross-validation using a supervised support vector machine. RESULTS: Gray matter density poorly discriminates between Alzheimer disease and non-Alzheimer pathology subgroups with low sensitivity (57%) and specificity (52%). In contrast, a statistic of local network efficiency demonstrates very good discriminatory power, with 85% sensitivity and 84% specificity. CONCLUSIONS: Our results indicate that the underlying pathologic sources of corticobasal syndrome can be classified more accurately using graph theoretical statistics derived from patterns of white matter network organization in association cortex than by regional gray matter density alone. These results highlight the importance of a multimodal neuroimaging approach to diagnostic analyses of corticobasal syndrome.


Assuntos
Encéfalo/diagnóstico por imagem , Imagem de Tensor de Difusão , Interpretação de Imagem Assistida por Computador , Imageamento por Ressonância Magnética , Doenças Neurodegenerativas/classificação , Doenças Neurodegenerativas/diagnóstico por imagem , Idoso , Doença de Alzheimer/líquido cefalorraquidiano , Doença de Alzheimer/classificação , Doença de Alzheimer/diagnóstico por imagem , Doença de Alzheimer/patologia , Peptídeos beta-Amiloides/líquido cefalorraquidiano , Estudos de Coortes , Estudos Transversais , Diagnóstico Diferencial , Feminino , Substância Cinzenta/diagnóstico por imagem , Humanos , Masculino , Pessoa de Meia-Idade , Doenças Neurodegenerativas/líquido cefalorraquidiano , Doenças Neurodegenerativas/patologia , Sensibilidade e Especificidade , Máquina de Vetores de Suporte , Síndrome , Substância Branca/diagnóstico por imagem , Proteínas tau/líquido cefalorraquidiano
9.
Artigo em Inglês | MEDLINE | ID: mdl-18722805

RESUMO

Infrared spectroscopic studies of 1:1 and 1:2 complexes of lead(II) bromide and lead(II) iodide with 1,10-phenanthroline were reported. Vibrational assignments are made by comparison to reported spectra of the uncomplexed 1,10-phenanthroline molecule. Small shifts of the ligand vibrational bands are characteristic of the complexes.


Assuntos
Chumbo/química , Fenantrolinas/química , Brometos/química , Brometos/efeitos da radiação , Halogênios/química , Halogênios/efeitos da radiação , Iodetos/química , Iodetos/efeitos da radiação , Chumbo/efeitos da radiação , Luz , Modelos Biológicos , Fenantrolinas/efeitos da radiação , Fotoquímica , Espectrofotometria Infravermelho , Difração de Raios X
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...