Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 16 de 16
Filter
Add more filters










Publication year range
1.
HGG Adv ; 5(3): 100302, 2024 May 03.
Article in English | MEDLINE | ID: mdl-38704641

ABSTRACT

Polygenic scores (PGSs) summarize the combined effect of common risk variants and are associated with breast cancer risk in patients without identifiable monogenic risk factors. One of the most well-validated PGSs in breast cancer to date is PGS313, which was developed from a Northern European biobank but has shown attenuated performance in non-European ancestries. We further investigate the generalizability of the PGS313 for American women of European (EA), African (AFR), Asian (EAA), and Latinx (HL) ancestry within one institution with a singular electronic health record (EHR) system, genotyping platform, and quality control process. We found that the PGS313 achieved overlapping areas under the receiver operator characteristic (ROC) curve (AUCs) in females of HL (AUC = 0.68, 95% confidence interval [CI] = 0.65-0.71) and EA ancestry (AUC = 0.70, 95% CI = 0.69-0.71) but lower AUCs for the AFR and EAA populations (AFR: AUC = 0.61, 95% CI = 0.56-0.65; EAA: AUC = 0.64, 95% CI = 0.60-0.680). While PGS313 is associated with hormone-receptor-positive (HR+) disease in EA Americans (odds ratio [OR] = 1.42, 95% CI = 1.16-1.64), this association is lost in African, Latinx, and Asian Americans. In summary, we found that PGS313 was significantly associated with breast cancer but with attenuated accuracy in women of AFR and EAA descent within a singular health system in Los Angeles. Our work further highlights the need for additional validation in diverse cohorts prior to the clinical implementation of PGSs.

2.
Genome Biol Evol ; 16(3)2024 03 02.
Article in English | MEDLINE | ID: mdl-38513111

ABSTRACT

Spermatogenesis is critical to sexual reproduction yet evolves rapidly in many organisms. High-throughput single-cell transcriptomics promises unparalleled insight into this important process but understanding can be impeded in nonmodel systems by a lack of known genes that can reliably demarcate biologically meaningful cell populations. Tribolium castaneum, the red flour beetle, lacks known markers for spermatogenesis found in insect species like Drosophila melanogaster. Using single-cell sequencing data collected from adult beetle testes, we implement a strategy for elucidating biologically meaningful cell populations by using transient expression stage identification markers, weighted principal component clustering, and SNP-based haploid/diploid phasing. We identify populations that correspond to observable points in sperm differentiation and find species specific markers for each stage. Our results indicate that molecular pathways underlying spermatogenesis in Coleoptera are substantially diverged from those in Diptera. We also show that most genes on the X chromosome experience meiotic sex chromosome inactivation. Temporal expression of Drosophila MSL complex homologs coupled with spatial analysis of potential chromatin entry sites further suggests that the dosage compensation machinery may mediate escape from meiotic sex chromosome inactivation and postmeiotic reactivation of the X chromosome.


Subject(s)
Coleoptera , Tribolium , Animals , Male , Tribolium/genetics , Drosophila melanogaster/genetics , Single-Cell Gene Expression Analysis , Semen , Sex Chromosomes , Spermatogenesis/genetics , Drosophila/genetics , Coleoptera/genetics
3.
PLoS One ; 19(3): e0299549, 2024.
Article in English | MEDLINE | ID: mdl-38489336

ABSTRACT

Insulin secretion from pancreatic ß-cells is integral in maintaining the delicate equilibrium of blood glucose levels. Calcium is known to be a key regulator and triggers the release of insulin. This sub-cellular process can be monitored and tracked through live-cell imaging and subsequent cell segmentation, registration, tracking, and analysis of the calcium level in each cell. Current methods of analysis typically require the manual outlining of ß-cells, involve multiple software packages, and necessitate multiple researchers-all of which tend to introduce biases. Utilizing deep learning algorithms, we have therefore created a pipeline to automatically segment and track thousands of cells, which greatly reduces the time required to gather and analyze a large number of sub-cellular images and improve accuracy. Tracking cells over a time-series image stack also allows researchers to isolate specific calcium spiking patterns and spatially identify those of interest, creating an efficient and user-friendly analysis tool. Using our automated pipeline, a previous dataset used to evaluate changes in calcium spiking activity in ß-cells post-electric field stimulation was reanalyzed. Changes in spiking activity were found to be underestimated previously with manual segmentation. Moreover, the machine learning pipeline provides a powerful and rapid computational approach to examine, for example, how calcium signaling is regulated by intracellular interactions.


Subject(s)
Calcium , Image Processing, Computer-Assisted , Image Processing, Computer-Assisted/methods , Algorithms , Computers , Microscopy, Fluorescence/methods
4.
J Comput Biol ; 30(7): 766-782, 2023 07.
Article in English | MEDLINE | ID: mdl-37437088

ABSTRACT

The development of tools for the annotation of genes from newly sequenced species has not evolved much from homologous alignment to prior annotated species. While the quality of gene annotations continues to decline as we sequence and assemble more evolutionary distant gut microbiome species, machine learning presents a high quality alternative to traditional techniques. In this study, we investigate the relative performance of common classical and nonclassical machine learning algorithms in the problem of gene annotation using human microbiome-associated species genes from the KEGG database. The majority of the ensemble, clustering, and deep learning algorithms that we investigated showed higher prediction accuracy than CD-Hit in predicting partial KEGG function. Motif-based, machine-learning methods of annotation in new species were faster and had higher precision-recall than methods of homologous alignment or orthologous gene clustering. Gradient boosted ensemble methods and neural networks also predicted higher connectivity in reconstructed KEGG pathways, finding twice as many new pathway interactions than blast alignment. The use of motif-based, machine-learning algorithms in annotation software will allow researchers to develop powerful tools to interact with bacterial microbiomes in ways previously unachievable through homologous sequence alignment alone.


Subject(s)
Algorithms , Genes, Microbial , Humans , Molecular Sequence Annotation , Neural Networks, Computer , Machine Learning
5.
bioRxiv ; 2023 Apr 06.
Article in English | MEDLINE | ID: mdl-37066375

ABSTRACT

Insulin secretion from pancreatic ß-cells is integral in maintaining the delicate equilibrium of blood glucose levels. Calcium is known to be a key regulator and triggers the release of insulin. This sub-cellular process can be monitored and tracked through live-cell imaging and subsequent cell segmentation, registration, tracking, and analysis of the calcium level in each cell. Current methods of analysis typically require the manual outlining of ß-cells, involve multiple software packages, and necessitate multiple researchers - all of which tend to introduce biases. Utilizing deep learning algorithms, we have therefore created a pipeline to automatically segment and track thousands of cells, which greatly reduces the time required to gather and analyze a large number of sub-cellular images and improve accuracy. Tracking cells over a time-series image stack also allows researchers to isolate specific calcium spiking patterns and spatially identify those of interest, creating an efficient and user-friendly analysis tool. Using our automated pipeline, a previous dataset used to evaluate changes in calcium spiking activity in ß-cells post-electric field stimulation was reanalyzed. Changes in spiking activity were found to be underestimated previously with manual segmentation. Moreover, the machine learning pipeline provides a powerful and rapid computational approach to examine, for example, how calcium signaling is regulated by intracellular interactions in a cluster of ß-cells.

6.
Nature ; 594(7862): 234-239, 2021 06.
Article in English | MEDLINE | ID: mdl-33981035

ABSTRACT

Loss of gut microbial diversity1-6 in industrial populations is associated with chronic diseases7, underscoring the importance of studying our ancestral gut microbiome. However, relatively little is known about the composition of pre-industrial gut microbiomes. Here we performed a large-scale de novo assembly of microbial genomes from palaeofaeces. From eight authenticated human palaeofaeces samples (1,000-2,000 years old) with well-preserved DNA from southwestern USA and Mexico, we reconstructed 498 medium- and high-quality microbial genomes. Among the 181 genomes with the strongest evidence of being ancient and of human gut origin, 39% represent previously undescribed species-level genome bins. Tip dating suggests an approximate diversification timeline for the key human symbiont Methanobrevibacter smithii. In comparison to 789 present-day human gut microbiome samples from eight countries, the palaeofaeces samples are more similar to non-industrialized than industrialized human gut microbiomes. Functional profiling of the palaeofaeces samples reveals a markedly lower abundance of antibiotic-resistance and mucin-degrading genes, as well as enrichment of mobile genetic elements relative to industrial gut microbiomes. This study facilitates the discovery and characterization of previously undescribed gut microorganisms from ancient microbiomes and the investigation of the evolutionary history of the human gut microbiota through genome reconstruction from palaeofaeces.


Subject(s)
Bacteria/isolation & purification , Biodiversity , Biological Evolution , Feces/microbiology , Gastrointestinal Microbiome , Genome, Bacterial/genetics , Host Microbial Interactions , Anti-Bacterial Agents/administration & dosage , Bacteria/classification , Bacteria/genetics , Chronic Disease , Developed Countries , Developing Countries , Diet, Western , History, Ancient , Humans , Industrial Development/trends , Methanobrevibacter/classification , Methanobrevibacter/genetics , Methanobrevibacter/isolation & purification , Mexico , Sedentary Behavior , Southwestern United States , Species Specificity , Symbiosis
7.
J Exp Med ; 218(4)2021 04 05.
Article in English | MEDLINE | ID: mdl-33651880

ABSTRACT

The ability to monitor anti-tumor CD8+ T cell responses in the blood has tremendous therapeutic potential. Here, we used paired single-cell RNA and TCR sequencing to detect and characterize "tumor-matching" (TM) CD8+ T cells in the blood of mice with MC38 tumors or melanoma patients using the TCR as a molecular barcode. TM cells showed increased activation compared with nonmatching T cells in blood and were less exhausted than matching cells in tumors. Importantly, PD-1, which has been used to identify putative circulating anti-tumor CD8+ T cells, showed poor sensitivity for identifying TM cells. By leveraging the transcriptome, we identified candidate cell surface markers for TM cells in mice and patients and validated NKG2D, CD39, and CX3CR1 in mice. These data show that the TCR can be used to identify tumor-relevant cells for characterization, reveal unique transcriptional properties of TM cells, and develop marker panels for tracking and analysis of these cells.


Subject(s)
Adenocarcinoma/immunology , CD8-Positive T-Lymphocytes/immunology , Colonic Neoplasms/immunology , Melanoma/blood , Melanoma/immunology , Single-Cell Analysis/methods , Skin Neoplasms/blood , Skin Neoplasms/immunology , Adenocarcinoma/pathology , Animals , Biomarkers, Tumor/genetics , Biomarkers, Tumor/metabolism , Cell Line, Tumor , Colonic Neoplasms/pathology , Female , Humans , Mice , Mice, Inbred C57BL , Programmed Cell Death 1 Receptor/genetics , Programmed Cell Death 1 Receptor/metabolism , Receptors, Antigen, T-Cell/genetics , Receptors, Antigen, T-Cell/metabolism , Transcriptome
8.
J Exp Med ; 217(9)2020 09 07.
Article in English | MEDLINE | ID: mdl-32539073

ABSTRACT

Tumor-infiltrating CD8+ T cells mediate antitumor immune responses. However, the mechanisms by which T cells remain poised to kill cancer cells despite expressing high levels of inhibitory receptors are unknown. Here, we report that layilin, a C-type lectin domain-containing membrane glycoprotein, is selectively expressed on highly activated, clonally expanded, but phenotypically exhausted CD8+ T cells in human melanoma. Lineage-specific deletion of layilin on murine CD8+ T cells reduced their accumulation in tumors and increased tumor growth in vivo. Congruently, gene editing of LAYN in human CD8+ T cells reduced direct tumor cell killing ex vivo. On a molecular level, layilin colocalized with integrin αLß2 (LFA-1) on T cells, and cross-linking layilin promoted the activated state of this integrin. Accordingly, LAYN deletion resulted in attenuated LFA-1-dependent cellular adhesion. Collectively, our results identify layilin as part of a molecular pathway in which exhausted or "dysfunctional" CD8+ T cells enhance cellular adhesiveness to maintain their cytotoxic potential.


Subject(s)
Carrier Proteins/metabolism , Immunity , Integrins/metabolism , Lectins, C-Type/metabolism , Membrane Glycoproteins/metabolism , Neoplasms/immunology , Animals , CD8-Positive T-Lymphocytes/immunology , Cell Adhesion , Cell Proliferation , Clone Cells , Cytokines/biosynthesis , Cytotoxicity, Immunologic , Gene Editing , Humans , Lymphocyte Activation/immunology , Lymphocyte Function-Associated Antigen-1/metabolism , Lymphocytes, Tumor-Infiltrating/immunology , Melanoma/immunology , Melanoma/pathology , Mice, Inbred C57BL , Neoplasm Metastasis , Neoplasms/pathology , Protein Binding , Talin/metabolism
9.
PLoS Comput Biol ; 16(5): e1007895, 2020 05.
Article in English | MEDLINE | ID: mdl-32392251

ABSTRACT

The microbiome is a new frontier for building predictors of human phenotypes. However, machine learning in the microbiome is fraught with issues of reproducibility, driven in large part by the wide range of analytic models and metagenomic data types available. We aimed to build robust metagenomic predictors of host phenotype by comparing prediction performances and biological interpretation across 8 machine learning methods and 4 different types of metagenomic data. Using 1,570 samples from 300 infants, we fit 7,865 models for 6 host phenotypes. We demonstrate the dependence of accuracy on algorithm choice and feature definition in microbiome data and propose a framework for building microbiome-derived indicators of host phenotype. We additionally identify biological features predictive of age, sex, breastfeeding status, historical antibiotic usage, country of origin, and delivery type. Our complete results can be viewed at http://apps.chiragjpgroup.org/ubiome_predictions/.


Subject(s)
Anti-Bacterial Agents/administration & dosage , Breast Feeding , Machine Learning , Metagenomics , Algorithms , Female , Geography , Humans , Infant , Male , Models, Theoretical
10.
Cell Host Microbe ; 26(2): 283-295.e8, 2019 08 14.
Article in English | MEDLINE | ID: mdl-31415755

ABSTRACT

Despite substantial interest in the species diversity of the human microbiome and its role in disease, the scale of its genetic diversity, which is fundamental to deciphering human-microbe interactions, has not been quantified. Here, we conducted a cross-study meta-analysis of metagenomes from two human body niches, the mouth and gut, covering 3,655 samples from 13 studies. We found staggering genetic heterogeneity in the dataset, identifying a total of 45,666,334 non-redundant genes (23,961,508 oral and 22,254,436 gut) at the 95% identity level. Fifty percent of all genes were "singletons," or unique to a single metagenomic sample. Singletons were enriched for different functions (compared with non-singletons) and arose from sub-population-specific microbial strains. Overall, these results provide potential bases for the unexplained heterogeneity observed in microbiome-derived human phenotypes. One the basis of these data, we built a resource, which can be accessed at https://microbial-genes.bio.


Subject(s)
Metagenome/genetics , Microbiota/genetics , Microbiota/physiology , Bacteria/classification , Bacteria/genetics , Biodiversity , Cluster Analysis , DNA Fingerprinting , Databases, Factual , Gastrointestinal Tract/microbiology , Genetic Heterogeneity , Host Microbial Interactions , Humans , Metagenomics , Mouth/microbiology , Multigene Family , Phenotype
11.
Nat Med ; 25(7): 1104-1109, 2019 07.
Article in English | MEDLINE | ID: mdl-31235964

ABSTRACT

The human gut microbiome is linked to many states of human health and disease1. The metabolic repertoire of the gut microbiome is vast, but the health implications of these bacterial pathways are poorly understood. In this study, we identify a link between members of the genus Veillonella and exercise performance. We observed an increase in Veillonella relative abundance in marathon runners postmarathon and isolated a strain of Veillonella atypica from stool samples. Inoculation of this strain into mice significantly increased exhaustive treadmill run time. Veillonella utilize lactate as their sole carbon source, which prompted us to perform a shotgun metagenomic analysis in a cohort of elite athletes, finding that every gene in a major pathway metabolizing lactate to propionate is at higher relative abundance postexercise. Using 13C3-labeled lactate in mice, we demonstrate that serum lactate crosses the epithelial barrier into the lumen of the gut. We also show that intrarectal instillation of propionate is sufficient to reproduce the increased treadmill run time performance observed with V. atypica gavage. Taken together, these studies reveal that V. atypica improves run time via its metabolic conversion of exercise-induced lactate into propionate, thereby identifying a natural, microbiome-encoded enzymatic process that enhances athletic performance.


Subject(s)
Athletes , Gastrointestinal Microbiome , Lactic Acid/metabolism , Metagenomics , Running , Veillonella/metabolism , Animals , Exercise , Humans , Mice , Mice, Inbred C57BL , Propionates/metabolism
12.
Sci Immunol ; 4(32)2019 02 01.
Article in English | MEDLINE | ID: mdl-30709844

ABSTRACT

HLA haplotypes in conjunction with serum anticommensal antibody responses are predictive of type 1 diabetes progression. See related Research Article by Paun et al.


Subject(s)
Diabetes Mellitus, Type 1 , Antibodies , Antibody Formation , Autoimmunity , Bacteria , Child , Humans
13.
Genome Biol ; 19(1): 125, 2018 08 24.
Article in English | MEDLINE | ID: mdl-30143029

ABSTRACT

We present HiGlass, an open source visualization tool built on web technologies that provides a rich interface for rapid, multiplex, and multiscale navigation of 2D genomic maps alongside 1D genomic tracks, allowing users to combine various data types, synchronize multiple visualization modalities, and share fully customizable views with others. We demonstrate its utility in exploring different experimental conditions, comparing the results of analyses, and creating interactive snapshots to share with collaborators and the broader public. HiGlass is accessible online at http://higlass.io and is also available as a containerized application that can be run on any platform.


Subject(s)
Chromosome Mapping , Genome , Internet , User-Computer Interface
14.
Gigascience ; 7(3): 1-8, 2018 03 01.
Article in English | MEDLINE | ID: mdl-29293960

ABSTRACT

Background: More extensive use of metagenomic shotgun sequencing in microbiome research relies on the development of high-throughput, cost-effective sequencing. Here we present a comprehensive evaluation of the performance of the new high-throughput sequencing platform BGISEQ-500 for metagenomic shotgun sequencing and compare its performance with that of 2 Illumina platforms. Findings: Using fecal samples from 20 healthy individuals, we evaluated the intra-platform reproducibility for metagenomic sequencing on the BGISEQ-500 platform in a setup comprising 8 library replicates and 8 sequencing replicates. Cross-platform consistency was evaluated by comparing 20 pairwise replicates on the BGISEQ-500 platform vs the Illumina HiSeq 2000 platform and the Illumina HiSeq 4000 platform. In addition, we compared the performance of the 2 Illumina platforms against each other. By a newly developed overall accuracy quality control method, an average of 82.45 million high-quality reads (96.06% of raw reads) per sample, with 90.56% of bases scoring Q30 and above, was obtained using the BGISEQ-500 platform. Quantitative analyses revealed extremely high reproducibility between BGISEQ-500 intra-platform replicates. Cross-platform replicates differed slightly more than intra-platform replicates, yet a high consistency was observed. Only a low percentage (2.02%-3.25%) of genes exhibited significant differences in relative abundance comparing the BGISEQ-500 and HiSeq platforms, with a bias toward genes with higher GC content being enriched on the HiSeq platforms. Conclusions: Our study provides the first set of performance metrics for human gut metagenomic sequencing data using BGISEQ-500. The high accuracy and technical reproducibility confirm the applicability of the new platform for metagenomic studies, though caution is still warranted when combining metagenomic data from different platforms.


Subject(s)
Bacteria/genetics , Gastrointestinal Microbiome/genetics , Metagenomics/methods , Sequence Analysis, DNA/methods , Bacteria/classification , Computational Biology , Feces/microbiology , High-Throughput Nucleotide Sequencing/methods , Humans
15.
Bioinformatics ; 34(9): 1565-1567, 2018 05 01.
Article in English | MEDLINE | ID: mdl-29228186

ABSTRACT

Motivation: Across biology, we are seeing rapid developments in scale of data production without a corresponding increase in data analysis capabilities. Results: Here, we present Aether (http://aether.kosticlab.org), an intuitive, easy-to-use, cost-effective and scalable framework that uses linear programming to optimally bid on and deploy combinations of underutilized cloud computing resources. Our approach simultaneously minimizes the cost of data analysis and provides an easy transition from users' existing HPC pipelines. Availability and implementation: Data utilized are available at https://pubs.broadinstitute.org/diabimmune and with EBI SRA accession ERP005989. Source code is available at (https://github.com/kosticlab/aether). Examples, documentation and a tutorial are available at http://aether.kosticlab.org. Contact: chirag_patel@hms.harvard.edu or aleksandar.kostic@joslin.harvard.edu. Supplementary information: Supplementary data are available at Bioinformatics online.


Subject(s)
Cloud Computing , Genomics/methods , Programming, Linear , Software
16.
Curr Biol ; 27(8): R307-R310, 2017 04 24.
Article in English | MEDLINE | ID: mdl-28441565

ABSTRACT

The human gut metagenome was recently discovered to encode vast collections of biosynthetic gene clusters with diverse chemical potential, almost none of which are yet functionally validated. Recent work elucidates common microbiome-derived biosynthetic gene clusters encoding peptide aldehydes that inhibit human proteases.


Subject(s)
Gastrointestinal Microbiome , Microbiota , Humans , Metagenome , Multigene Family , Peptide Hydrolases
SELECTION OF CITATIONS
SEARCH DETAIL
...