Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 36
Filter
1.
Am J Hum Genet ; 111(5): 863-876, 2024 May 02.
Article in English | MEDLINE | ID: mdl-38565148

ABSTRACT

Copy number variants (CNVs) are significant contributors to the pathogenicity of rare genetic diseases and, with new innovative methods, can now reliably be identified from exome sequencing. Challenges still remain in accurate classification of CNV pathogenicity. CNV calling using GATK-gCNV was performed on exomes from a cohort of 6,633 families (15,759 individuals) with heterogeneous phenotypes and variable prior genetic testing collected at the Broad Institute Center for Mendelian Genomics of the Genomics Research to Elucidate the Genetics of Rare Diseases consortium and analyzed using the seqr platform. The addition of CNV detection to exome analysis identified causal CNVs for 171 families (2.6%). The estimated sizes of CNVs ranged from 293 bp to 80 Mb. The causal CNVs consisted of 140 deletions, 15 duplications, 3 suspected complex structural variants (SVs), 3 insertions, and 10 complex SVs, the latter two groups being identified by orthogonal confirmation methods. To classify CNV variant pathogenicity, we used the 2020 American College of Medical Genetics and Genomics/ClinGen CNV interpretation standards and developed additional criteria to evaluate allelic and functional data as well as variants on the X chromosome to further advance the framework. We interpreted 151 CNVs as likely pathogenic/pathogenic and 20 CNVs as high-interest variants of uncertain significance. Calling CNVs from existing exome data increases the diagnostic yield for individuals undiagnosed after standard testing approaches, providing a higher-resolution alternative to arrays at a fraction of the cost of genome sequencing. Our improvements to the classification approach advances the systematic framework to assess the pathogenicity of CNVs.


Subject(s)
DNA Copy Number Variations , Exome Sequencing , Exome , Rare Diseases , Humans , DNA Copy Number Variations/genetics , Rare Diseases/genetics , Rare Diseases/diagnosis , Exome/genetics , Male , Female , Cohort Studies , Genetic Testing/methods
2.
medRxiv ; 2024 Feb 07.
Article in English | MEDLINE | ID: mdl-38496558

ABSTRACT

Genes encoding long non-coding RNAs (lncRNAs) comprise a large fraction of the human genome, yet haploinsufficiency of a lncRNA has not been shown to cause a Mendelian disease. CHASERR is a highly conserved human lncRNA adjacent to CHD2-a coding gene in which de novo loss-of-function variants cause developmental and epileptic encephalopathy. Here we report three unrelated individuals each harboring an ultra-rare heterozygous de novo deletion in the CHASERR locus. We report similarities in severe developmental delay, facial dysmorphisms, and cerebral dysmyelination in these individuals, distinguishing them from the phenotypic spectrum of CHD2 haploinsufficiency. We demonstrate reduced CHASERR mRNA expression and corresponding increased CHD2 mRNA and protein in whole blood and patient-derived cell lines-specifically increased expression of the CHD2 allele in cis with the CHASERR deletion, as predicted from a prior mouse model of Chaserr haploinsufficiency. We show for the first time that de novo structural variants facilitated by Alu-mediated non-allelic homologous recombination led to deletion of a non-coding element (the lncRNA CHASERR) to cause a rare syndromic neurodevelopmental disorder. We also demonstrate that CHD2 has bidirectional dosage sensitivity in human disease. This work highlights the need to carefully evaluate other lncRNAs, particularly those upstream of genes associated with Mendelian disorders.

3.
Nat Commun ; 15(1): 32, 2024 01 02.
Article in English | MEDLINE | ID: mdl-38167262

ABSTRACT

Single-cell transcriptomics has become the definitive method for classifying cell types and states, and can be augmented with genotype information to improve cell lineage identification. Due to constraints of short-read sequencing, current methods to detect natural genetic barcodes often require cumbersome primer panels and early commitment to targets. Here we devise a flexible long-read sequencing workflow and analysis pipeline, termed nanoranger, that starts from intermediate single-cell cDNA libraries to detect cell lineage-defining features, including single-nucleotide variants, fusion genes, isoforms, sequences of chimeric antigen and TCRs. Through systematic analysis of these classes of natural 'barcodes', we define the optimal targets for nanoranger, namely those loci close to the 5' end of highly expressed genes with transcript lengths shorter than 4 kB. As proof-of-concept, we apply nanoranger to longitudinal tracking of subclones of acute myeloid leukemia (AML) and describe the heterogeneous isoform landscape of thousands of marrow-infiltrating immune cells. We propose that enhanced cellular genotyping using nanoranger can improve the tracking of single-cell tumor and immune cell co-evolution.


Subject(s)
High-Throughput Nucleotide Sequencing , Leukemia, Myeloid, Acute , Humans , Genotype , High-Throughput Nucleotide Sequencing/methods , Leukemia, Myeloid, Acute/genetics , Leukemia, Myeloid, Acute/pathology , Phenotype , Gene Expression Profiling/methods
4.
Nat Biotechnol ; 42(4): 582-586, 2024 Apr.
Article in English | MEDLINE | ID: mdl-37291427

ABSTRACT

Full-length RNA-sequencing methods using long-read technologies can capture complete transcript isoforms, but their throughput is limited. We introduce multiplexed arrays isoform sequencing (MAS-ISO-seq), a technique for programmably concatenating complementary DNAs (cDNAs) into molecules optimal for long-read sequencing, increasing the throughput >15-fold to nearly 40 million cDNA reads per run on the Sequel IIe sequencer. When applied to single-cell RNA sequencing of tumor-infiltrating T cells, MAS-ISO-seq demonstrated a 12- to 32-fold increase in the discovery of differentially spliced genes.


Subject(s)
High-Throughput Nucleotide Sequencing , RNA Isoforms , DNA, Complementary/genetics , RNA Isoforms/genetics , High-Throughput Nucleotide Sequencing/methods , Protein Isoforms/genetics , Sequence Analysis, RNA/methods , Transcriptome , Gene Expression Profiling/methods , RNA/genetics
5.
medRxiv ; 2023 Oct 05.
Article in English | MEDLINE | ID: mdl-37873196

ABSTRACT

Copy number variants (CNVs) are significant contributors to the pathogenicity of rare genetic diseases and with new innovative methods can now reliably be identified from exome sequencing. Challenges still remain in accurate classification of CNV pathogenicity. CNV calling using GATK-gCNV was performed on exomes from a cohort of 6,633 families (15,759 individuals) with heterogeneous phenotypes and variable prior genetic testing collected at the Broad Institute Center for Mendelian Genomics of the GREGoR consortium. Each family's CNV data was analyzed using the seqr platform and candidate CNVs classified using the 2020 ACMG/ClinGen CNV interpretation standards. We developed additional evidence criteria to address situations not covered by the current standards. The addition of CNV calling to exome analysis identified causal CNVs for 173 families (2.6%). The estimated sizes of CNVs ranged from 293 bp to 80 Mb with estimates that 44% would not have been detected by standard chromosomal microarrays. The causal CNVs consisted of 141 deletions, 15 duplications, 4 suspected complex structural variants (SVs), 3 insertions and 10 complex SVs, the latter two groups being identified by orthogonal validation methods. We interpreted 153 CNVs as likely pathogenic/pathogenic and 20 CNVs as high interest variants of uncertain significance. Calling CNVs from existing exome data increases the diagnostic yield for individuals undiagnosed after standard testing approaches, providing a higher resolution alternative to arrays at a fraction of the cost of genome sequencing. Our improvements to the classification approach advances the systematic framework to assess the pathogenicity of CNVs.

7.
Nat Hum Behav ; 7(9): 1502-1513, 2023 09.
Article in English | MEDLINE | ID: mdl-37386111

ABSTRACT

The spread of misinformation online is a global problem that requires global solutions. To that end, we conducted an experiment in 16 countries across 6 continents (N = 34,286; 676,605 observations) to investigate predictors of susceptibility to misinformation about COVID-19, and interventions to combat the spread of this misinformation. In every country, participants with a more analytic cognitive style and stronger accuracy-related motivations were better at discerning truth from falsehood; valuing democracy was also associated with greater truth discernment, whereas endorsement of individual responsibility over government support was negatively associated with truth discernment in most countries. Subtly prompting people to think about accuracy had a generally positive effect on the veracity of news that people were willing to share across countries, as did minimal digital literacy tips. Finally, aggregating the ratings of our non-expert participants was able to differentiate true from false headlines with high accuracy in all countries via the 'wisdom of crowds'. The consistent patterns we observe suggest that the psychological factors underlying the misinformation challenge are similar across different regional settings, and that similar solutions may be broadly effective.


Subject(s)
COVID-19 , Humans , Communication , Thinking , Motivation , Government
8.
bioRxiv ; 2023 May 04.
Article in English | MEDLINE | ID: mdl-37205567

ABSTRACT

Advances in long-read sequencing (LRS) technology continue to make whole-genome sequencing more complete, affordable, and accurate. LRS provides significant advantages over short-read sequencing approaches, including phased de novo genome assembly, access to previously excluded genomic regions, and discovery of more complex structural variants (SVs) associated with disease. Limitations remain with respect to cost, scalability, and platform-dependent read accuracy and the tradeoffs between sequence coverage and sensitivity of variant discovery are important experimental considerations for the application of LRS. We compare the genetic variant calling precision and recall of Oxford Nanopore Technologies (ONT) and PacBio HiFi platforms over a range of sequence coverages. For read-based applications, LRS sensitivity begins to plateau around 12-fold coverage with a majority of variants called with reasonable accuracy (F1 score above 0.5), and both platforms perform well for SV detection. Genome assembly increases variant calling precision and recall of SVs and indels in HiFi datasets with HiFi outperforming ONT in quality as measured by the F1 score of assembly-based variant callsets. While both technologies continue to evolve, our work offers guidance to design cost-effective experimental strategies that do not compromise on discovering novel biology.

9.
Proc Natl Acad Sci U S A ; 120(11): e2212270120, 2023 03 14.
Article in English | MEDLINE | ID: mdl-36877833

ABSTRACT

Recently, social media platforms are heavily moderated to prevent the spread of online hate speech, which is usually fertile in toxic words and is directed toward an individual or a community. Owing to such heavy moderation, newer and more subtle techniques are being deployed. One of the most striking among these is fear speech. Fear speech, as the name suggests, attempts to incite fear about a target community. Although subtle, it might be highly effective, often pushing communities toward a physical conflict. Therefore, understanding their prevalence in social media is of paramount importance. This article presents a large-scale study to understand the prevalence of 400K fear speech and over 700K hate speech posts collected from Gab.com. Remarkably, users posting a large number of fear speech accrue more followers and occupy more central positions in social networks than users posting a large number of hate speech. They can also reach out to benign users more effectively than hate speech users through replies, reposts, and mentions. This connects to the fact that, unlike hate speech, fear speech has almost zero toxic content, making it look plausible. Moreover, while fear speech topics mostly portray a community as a perpetrator using a (fake) chain of argumentation, hate speech topics hurl direct multitarget insults, thus pointing to why general users could be more gullible to fear speech. Our findings transcend even to other platforms (Twitter and Facebook) and thus necessitate using sophisticated moderation policies and mass awareness to combat fear speech.


Subject(s)
Social Media , Humans , Speech , Fear , Fertility , Hate
10.
Nat Methods ; 20(4): 559-568, 2023 04.
Article in English | MEDLINE | ID: mdl-36959322

ABSTRACT

Structural variants (SVs) are a major driver of genetic diversity and disease in the human genome and their discovery is imperative to advances in precision medicine. Existing SV callers rely on hand-engineered features and heuristics to model SVs, which cannot scale to the vast diversity of SVs nor fully harness the information available in sequencing datasets. Here we propose an extensible deep-learning framework, Cue, to call and genotype SVs that can learn complex SV abstractions directly from the data. At a high level, Cue converts alignments to images that encode SV-informative signals and uses a stacked hourglass convolutional neural network to predict the type, genotype and genomic locus of the SVs captured in each image. We show that Cue outperforms the state of the art in the detection of several classes of SVs on synthetic and real short-read data and that it can be easily extended to other sequencing platforms, while achieving competitive performance.


Subject(s)
Deep Learning , Software , Humans , Genotype , Cues , Genomic Structural Variation , Genome, Human
11.
Nat Commun ; 14(1): 126, 2023 01 09.
Article in English | MEDLINE | ID: mdl-36624092

ABSTRACT

Despite the availability of multiple safe vaccines, vaccine hesitancy may present a challenge to successful control of the COVID-19 pandemic. As with many human behaviors, people's vaccine acceptance may be affected by their beliefs about whether others will accept a vaccine (i.e., descriptive norms). However, information about these descriptive norms may have different effects depending on the actual descriptive norm, people's baseline beliefs, and the relative importance of conformity, social learning, and free-riding. Here, using a pre-registered, randomized experiment (N = 484,239) embedded in an international survey (23 countries), we show that accurate information about descriptive norms can increase intentions to accept a vaccine for COVID-19. We find mixed evidence that information on descriptive norms impacts mask wearing intentions and no statistically significant evidence that it impacts intentions to physically distance. The effects on vaccination intentions are largely consistent across the 23 included countries, but are concentrated among people who were otherwise uncertain about accepting a vaccine. Providing normative information in vaccine communications partially corrects individuals' underestimation of how many other people will accept a vaccine. These results suggest that presenting people with information about the widespread and growing acceptance of COVID-19 vaccines helps to increase vaccination intentions.


Subject(s)
COVID-19 Vaccines , COVID-19 , Humans , Intention , Pandemics , COVID-19/prevention & control , Vaccination
12.
Genome Res ; 33(12): 2029-2040, 2023 Dec 27.
Article in English | MEDLINE | ID: mdl-38190646

ABSTRACT

Advances in long-read sequencing (LRS) technologies continue to make whole-genome sequencing more complete, affordable, and accurate. LRS provides significant advantages over short-read sequencing approaches, including phased de novo genome assembly, access to previously excluded genomic regions, and discovery of more complex structural variants (SVs) associated with disease. Limitations remain with respect to cost, scalability, and platform-dependent read accuracy and the tradeoffs between sequence coverage and sensitivity of variant discovery are important experimental considerations for the application of LRS. We compare the genetic variant-calling precision and recall of Oxford Nanopore Technologies (ONT) and Pacific Biosciences (PacBio) HiFi platforms over a range of sequence coverages. For read-based applications, LRS sensitivity begins to plateau around 12-fold coverage with a majority of variants called with reasonable accuracy (F1 score above 0.5), and both platforms perform well for SV detection. Genome assembly increases variant-calling precision and recall of SVs and indels in HiFi data sets with HiFi outperforming ONT in quality as measured by the F1 score of assembly-based variant call sets. While both technologies continue to evolve, our work offers guidance to design cost-effective experimental strategies that do not compromise on discovering novel biology.


Subject(s)
Genomics , Nanopores , INDEL Mutation , Whole Genome Sequencing
13.
Proc ACM Web Sci Conf ; 2022: 359-363, 2022 Jun.
Article in English | MEDLINE | ID: mdl-36112977

ABSTRACT

Homeopathy is a medical system originating in Germany more than 200 years ago. Based on prior investigations, mainstream health agencies and medical research communities indicate that there is little evidence that homeopathy can be an effective treatment for any specific health condition. However, it continues to be practiced as a popular form of alternative medicine in many countries, even during the ongoing COVID-19 pandemic. In this paper, we mine opinions on homeopathy for COVID-19 expressed in Twitter data. Our experiments are conducted with a dataset of nearly 60K tweets collected during a seven month period ending in July 2020. We first built text classifiers (linear and neural models) to mine opinions on homeopathy (positive, negative, neutral) from tweets using a dataset of 2400 hand-labeled tweets obtaining an average macro F-score of 81.5% for the positive and negative classes. We applied this model to identify opinions from the full dataset. Our results show that the number of unique positive tweets is twice that of the number of unique negative tweets; but when including retweets, there are 23% more negative tweets overall indicating that negative tweets are getting more retweets and better traction on Twitter. Using a word shift graph analysis on the Twitter bios of authors of positive and negative tweets, we observe that opinions on homeopathy appear to be correlated with political/religious ideologies of the authors (e.g., liberal vs nationalist, atheist vs Hindu). To our knowledge, this is the first study to analyze public opinions on homeopathy on any social media platform. Our results surface a tricky landscape for public health agencies as they promote evidence-based therapies and preventative measures for COVID-19.

14.
Nature ; 608(7922): 353-359, 2022 08.
Article in English | MEDLINE | ID: mdl-35922509

ABSTRACT

Regulation of transcript structure generates transcript diversity and plays an important role in human disease1-7. The advent of long-read sequencing technologies offers the opportunity to study the role of genetic variation in transcript structure8-16. In this Article, we present a large human long-read RNA-seq dataset using the Oxford Nanopore Technologies platform from 88 samples from Genotype-Tissue Expression (GTEx) tissues and cell lines, complementing the GTEx resource. We identified just over 70,000 novel transcripts for annotated genes, and validated the protein expression of 10% of novel transcripts. We developed a new computational package, LORALS, to analyse the genetic effects of rare and common variants on the transcriptome by allele-specific analysis of long reads. We characterized allele-specific expression and transcript structure events, providing new insights into the specific transcript alterations caused by common and rare genetic variants and highlighting the resolution gained from long-read data. We were able to perturb the transcript structure upon knockdown of PTBP1, an RNA binding protein that mediates splicing, thereby finding genetic regulatory effects that are modified by the cellular environment. Finally, we used this dataset to enhance variant interpretation and study rare variants leading to aberrant splicing patterns.


Subject(s)
Alleles , Gene Expression Profiling , Organ Specificity , RNA-Seq , Transcriptome , Alternative Splicing/genetics , Cell Line , Datasets as Topic , Genotype , Heterogeneous-Nuclear Ribonucleoproteins/deficiency , Heterogeneous-Nuclear Ribonucleoproteins/genetics , Humans , Organ Specificity/genetics , Polypyrimidine Tract-Binding Protein/deficiency , Polypyrimidine Tract-Binding Protein/genetics , Reproducibility of Results , Transcriptome/genetics
15.
Genome Biol Evol ; 14(9)2022 09 06.
Article in English | MEDLINE | ID: mdl-35929770

ABSTRACT

The brown bear (Ursus arctos) is the second largest and most widespread extant terrestrial carnivore on Earth and has recently emerged as a medical model for human metabolic diseases. Here, we report a fully phased chromosome-level assembly of a male North American brown bear built by combining Pacific Biosciences (PacBio) HiFi data and publicly available Hi-C data. The final genome size is 2.47 Gigabases (Gb) with a scaffold and contig N50 length of 70.08 and 43.94 Megabases (Mb), respectively. Benchmarking Universal Single-Copy Ortholog (BUSCO) analysis revealed that 94.5% of single copy orthologs from Mammalia were present in the genome (the highest of any ursid genome to date). Repetitive elements accounted for 44.48% of the genome and a total of 20,480 protein coding genes were identified. Based on whole genome alignment to the polar bear, the brown bear is highly syntenic with the polar bear, and our phylogenetic analysis of 7,246 single-copy orthologs supports the currently proposed species tree for Ursidae. This highly contiguous genome assembly will support future research on both the evolutionary history of the bear family and the physiological mechanisms behind hibernation, the latter of which has broad medical implications.


Subject(s)
Ursidae , Animals , Chromosomes , Genome , Haplotypes , Humans , Phylogeny , Ursidae/genetics
16.
Nat Hum Behav ; 6(9): 1310-1317, 2022 09.
Article in English | MEDLINE | ID: mdl-35606513

ABSTRACT

Policy and communication responses to COVID-19 can benefit from better understanding of people's baseline and resulting beliefs, behaviours and norms. From July 2020 to March 2021, we fielded a global survey on these topics in 67 countries yielding over 2 million responses. This paper provides an overview of the motivation behind the survey design, details the sampling and weighting designed to make the results representative of populations of interest and presents some insights learned from the survey. Several studies have already used the survey data to analyse risk perception, attitudes towards mask wearing and other preventive behaviours, as well as trust in information sources across communities worldwide. This resource can open new areas of enquiry in public health, communication and economic policy by leveraging large-scale, rich survey datasets on beliefs, behaviours and norms during a global pandemic.


Subject(s)
COVID-19 , COVID-19/prevention & control , Humans , Pandemics/prevention & control , SARS-CoV-2 , Surveys and Questionnaires , Trust
17.
Genome Res ; 32(3): 569-582, 2022 03.
Article in English | MEDLINE | ID: mdl-35074858

ABSTRACT

Genomic databases of allele frequency are extremely helpful for evaluating clinical variants of unknown significance; however, until now, databases such as the Genome Aggregation Database (gnomAD) have focused on nuclear DNA and have ignored the mitochondrial genome (mtDNA). Here, we present a pipeline to call mtDNA variants that addresses three technical challenges: (1) detecting homoplasmic and heteroplasmic variants, present, respectively, in all or a fraction of mtDNA molecules; (2) circular mtDNA genome; and (3) misalignment of nuclear sequences of mitochondrial origin (NUMTs). We observed that mtDNA copy number per cell varied across gnomAD cohorts and influenced the fraction of NUMT-derived false-positive variant calls, which can account for the majority of putative heteroplasmies. To avoid false positives, we excluded contaminated samples, cell lines, and samples prone to NUMT misalignment due to few mtDNA copies. Furthermore, we report variants with heteroplasmy ≥10%. We applied this pipeline to 56,434 whole-genome sequences in the gnomAD v3.1 database that includes individuals of European (58%), African (25%), Latino (10%), and Asian (5%) ancestry. Our gnomAD v3.1 release contains population frequencies for 10,850 unique mtDNA variants at more than half of all mtDNA bases. Importantly, we report frequencies within each nuclear ancestral population and mitochondrial haplogroup. Homoplasmic variants account for most variant calls (98%) and unique variants (85%). We observed that 1/250 individuals carry a pathogenic mtDNA variant with heteroplasmy above 10%. These mtDNA population allele frequencies are freely accessible and will aid in diagnostic interpretation and research studies.


Subject(s)
DNA, Mitochondrial , Genome, Mitochondrial , Cell Nucleus/genetics , DNA, Mitochondrial/genetics , Gene Frequency , Genome , Humans , Mitochondria/genetics , Sequence Analysis, DNA
18.
Soc Netw Anal Min ; 12(1): 5, 2022.
Article in English | MEDLINE | ID: mdl-34804253

ABSTRACT

The spread of COVID-19 and the lockdowns that followed led to an increase in activity on online social networks. This has resulted in users sharing unfiltered and unreliable information on social networks like WhatsApp, Twitter, Facebook, etc. In this work, we give an extended overview of how Pakistan's population used public WhatsApp groups for sharing information related to the pandemic. Our work is based on a major effort to annotate thousands of text and image-based messages. We explore how information propagates across WhatsApp and the user behavior around it. Specifically, we look at political polarization and its impact on how users from different political parties shared COVID-19-related content. We also try to understand information dissemination across different social networks-Twitter and WhatsApp-in Pakistan and find that there is no significant bot involvement in spreading misinformation about the pandemic.

19.
J Exp Med ; 218(6)2021 06 07.
Article in English | MEDLINE | ID: mdl-33857290

ABSTRACT

Advances in genome sequencing have resulted in the identification of the causes for numerous rare diseases. However, many cases remain unsolved with standard molecular analyses. We describe a family presenting with a phenotype resembling inherited thrombocytopenia 2 (THC2). THC2 is generally caused by single nucleotide variants that prevent silencing of ANKRD26 expression during hematopoietic differentiation. Short-read whole-exome and genome sequencing approaches were unable to identify a causal variant in this family. Using long-read whole-genome sequencing, a large complex structural variant involving a paired-duplication inversion was identified. Through functional studies, we show that this structural variant results in a pathogenic gain-of-function WAC-ANKRD26 fusion transcript. Our findings illustrate how complex structural variants that may be missed by conventional genome sequencing approaches can cause human disease.


Subject(s)
Adaptor Proteins, Signal Transducing/genetics , Intercellular Signaling Peptides and Proteins/genetics , Polymorphism, Single Nucleotide/genetics , Thrombocytopenia/genetics , Adolescent , Adult , Aged , Cell Line , Cell Line, Tumor , Child , Chromosome Breakage , Chromosome Disorders/genetics , Exome/genetics , Female , HEK293 Cells , HeLa Cells , Humans , Male , Middle Aged , Mutation/genetics , Pedigree , Thrombocytopenia/congenital
20.
Nature ; 586(7828): 292-298, 2020 10.
Article in English | MEDLINE | ID: mdl-32999459

ABSTRACT

The RecQ DNA helicase WRN is a synthetic lethal target for cancer cells with microsatellite instability (MSI), a form of genetic hypermutability that arises from impaired mismatch repair1-4. Depletion of WRN induces widespread DNA double-strand breaks in MSI cells, leading to cell cycle arrest and/or apoptosis. However, the mechanism by which WRN protects MSI-associated cancers from double-strand breaks remains unclear. Here we show that TA-dinucleotide repeats are highly unstable in MSI cells and undergo large-scale expansions, distinct from previously described insertion or deletion mutations of a few nucleotides5. Expanded TA repeats form non-B DNA secondary structures that stall replication forks, activate the ATR checkpoint kinase, and require unwinding by the WRN helicase. In the absence of WRN, the expanded TA-dinucleotide repeats are susceptible to cleavage by the MUS81 nuclease, leading to massive chromosome shattering. These findings identify a distinct biomarker that underlies the synthetic lethal dependence on WRN, and support the development of therapeutic agents that target WRN for MSI-associated cancers.


Subject(s)
DNA Breaks, Double-Stranded , DNA Repeat Expansion/genetics , Dinucleotide Repeats/genetics , Neoplasms/genetics , Werner Syndrome Helicase/metabolism , Ataxia Telangiectasia Mutated Proteins/metabolism , Cell Line, Tumor , Chromosomes, Human/genetics , Chromosomes, Human/metabolism , Chromothripsis , DNA Cleavage , DNA Replication , DNA-Binding Proteins/metabolism , Endodeoxyribonucleases/metabolism , Endonucleases/metabolism , Genomic Instability , Humans , Recombinases/metabolism
SELECTION OF CITATIONS
SEARCH DETAIL
...