Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 13 de 13
Filter
1.
Genome Biol ; 20(1): 285, 2019 12 18.
Article in English | MEDLINE | ID: mdl-31849330

ABSTRACT

BACKGROUND: The lifelong accumulation of somatic mutations underlies age-related phenotypes and cancer. Mutagenic forces are thought to shape the genome of aging cells in a tissue-specific way. Whole genome analyses of somatic mutation patterns, based on both types and genomic distribution of variants, can shed light on specific processes active in different human tissues and their effect on the transition to cancer. RESULTS: To analyze somatic mutation patterns, we compile a comprehensive genetic atlas of somatic mutations in healthy human cells. High-confidence variants are obtained from newly generated and publicly available whole genome DNA sequencing data from single non-cancer cells, clonally expanded in vitro. To enable a well-controlled comparison of different cell types, we obtain single genome data (92% mean coverage) from multi-organ biopsies from the same donors. These data show multiple cell types that are protected from mutagens and display a stereotyped mutation profile, despite their origin from different tissues. Conversely, the same tissue harbors cells with distinct mutation profiles associated to different differentiation states. Analyses of mutation rate in the coding and non-coding portions of the genome identify a cell type bearing a unique mutation pattern characterized by mutation enrichment in active chromatin, regulatory, and transcribed regions. CONCLUSIONS: Our analysis of normal cells from healthy donors identifies a somatic mutation landscape that enhances the risk of tumor transformation in a specific cell population from the kidney proximal tubule. This unique pattern is characterized by high rate of mutation accumulation during adult life and specific targeting of expressed genes and regulatory regions.


Subject(s)
Mutagenesis , Neoplasms/etiology , Whole Genome Sequencing , Aged , Female , Humans
2.
Hum Mol Genet ; 28(16): 2675-2685, 2019 08 15.
Article in English | MEDLINE | ID: mdl-31216356

ABSTRACT

Alzheimer's disease (AD) is the most common neurodegenerative disease worldwide. Familial cases suggest genetic components; however, monogenetic causes are few, and the vast majority of incidences have unknown cause. Sequencing efforts have focused on germline mutations, but improved technology has opened up for studies on somatic mutations in affected brain tissue samples. Here we use ultra-deep sequencing on brain and blood from early-onset AD (EOAD) and late-onset AD (LOAD) patients and non-AD individuals (n = 16). In total, 2.86 Mb of genomic regions, previously associated with AD, were targeted included 28 genes and upstream and downstream regulatory regions. Tailored downstream bioinformatics filtering identified 11 somatic single nucleotide variants in the temporal cortex in AD patients and none in the controls. One variant was validated to be present at 0.4% allele frequency in temporal cortex of a LOAD patient. This variant was predicted to affect transcription factor binding sites upstream of the CD55 gene, contributing to AD pathogenesis by affecting the complement system. Our results suggest that future studies targeting larger portions of the genome for somatic mutation analysis are important to obtain an increased understanding for the molecular basis of both EOAD and LOAD.


Subject(s)
5' Untranslated Regions , Alzheimer Disease/genetics , Alzheimer Disease/metabolism , CD55 Antigens/genetics , CD55 Antigens/metabolism , Mutation , Temporal Lobe/metabolism , Transcription Factors/metabolism , Age of Onset , Aged , Alleles , Allelic Imbalance , Alzheimer Disease/pathology , Binding Sites , Case-Control Studies , DNA Mutational Analysis , Databases, Genetic , Disease Susceptibility , Female , High-Throughput Nucleotide Sequencing , Humans , Male , Middle Aged , Protein Binding , Reproducibility of Results
3.
Nat Commun ; 9(1): 800, 2018 02 23.
Article in English | MEDLINE | ID: mdl-29476074

ABSTRACT

Human aging is associated with a decline in skeletal muscle (SkM) function and a reduction in the number and activity of satellite cells (SCs), the resident stem cells. To study the connection between SC aging and muscle impairment, we analyze the whole genome of single SC clones of the leg muscle vastus lateralis from healthy individuals of different ages (21-78 years). We find an accumulation rate of 13 somatic mutations per genome per year, consistent with proliferation of SCs in the healthy adult muscle. SkM-expressed genes are protected from mutations, but aging results in an increase in mutations in exons and promoters, targeting genes involved in SC activity and muscle function. In agreement with SC mutations affecting the whole tissue, we detect a missense mutation in a SC propagating to the muscle. Our results suggest somatic mutagenesis in SCs as a driving force in the age-related decline of SkM function.


Subject(s)
Aging/genetics , Muscle, Skeletal/growth & development , Mutation , Satellite Cells, Skeletal Muscle/cytology , Adult , Aged , Aging/metabolism , Cell Differentiation , Cell Proliferation , Connectin/genetics , Connectin/metabolism , Cytokines/genetics , Cytokines/metabolism , Exons , Female , Fibronectins , Heparan Sulfate Proteoglycans/genetics , Heparan Sulfate Proteoglycans/metabolism , Humans , Male , Middle Aged , Muscle, Skeletal/cytology , Muscle, Skeletal/metabolism , Mutagenesis , Neoplasm Proteins/genetics , Neoplasm Proteins/metabolism , Promoter Regions, Genetic , Satellite Cells, Skeletal Muscle/metabolism , Young Adult
4.
Eur J Hum Genet ; 25(11): 1253-1260, 2017 11.
Article in English | MEDLINE | ID: mdl-28832569

ABSTRACT

Here we describe the SweGen data set, a comprehensive map of genetic variation in the Swedish population. These data represent a basic resource for clinical genetics laboratories as well as for sequencing-based association studies by providing information on genetic variant frequencies in a cohort that is well matched to national patient cohorts. To select samples for this study, we first examined the genetic structure of the Swedish population using high-density SNP-array data from a nation-wide cohort of over 10 000 Swedish-born individuals included in the Swedish Twin Registry. A total of 1000 individuals, reflecting a cross-section of the population and capturing the main genetic structure, were selected for whole-genome sequencing. Analysis pipelines were developed for automated alignment, variant calling and quality control of the sequencing data. This resulted in a genome-wide collection of aggregated variant frequencies in the Swedish population that we have made available to the scientific community through the website https://swefreq.nbis.se. A total of 29.2 million single-nucleotide variants and 3.8 million indels were detected in the 1000 samples, with 9.9 million of these variants not present in current databases. Each sample contributed with an average of 7199 individual-specific variants. In addition, an average of 8645 larger structural variants (SVs) were detected per individual, and we demonstrate that the population frequencies of these SVs can be used for efficient filtering analyses. Finally, our results show that the genetic diversity within Sweden is substantial compared with the diversity among continental European populations, underscoring the relevance of establishing a local reference data set.


Subject(s)
Genome, Human , Polymorphism, Single Nucleotide , Registries , Datasets as Topic , Genome-Wide Association Study , Humans , Sweden , Twins/genetics
5.
PLoS One ; 12(2): e0171435, 2017.
Article in English | MEDLINE | ID: mdl-28192450

ABSTRACT

We have developed a novel analysis method that can interrogate the authenticity of biological samples used for generation of transcriptome profiles in public data repositories. The method uses RNA sequencing information to reveal mutations in expressed transcripts and subsequently confirms the identity of analysed cells by comparison with publicly available cell-specific mutational profiles. Cell lines constitute key model systems widely used within cancer research, but their identity needs to be confirmed in order to minimise the influence of cell contaminations and genetic drift on the analysis. Using both public and novel data, we demonstrate the use of RNA-sequencing data analysis for cell line authentication by examining the validity of COLO205, DLD1, HCT15, HCT116, HKE3, HT29 and RKO colorectal cancer cell lines. We successfully authenticate the studied cell lines and validate previous reports indicating that DLD1 and HCT15 are synonymous. We also show that the analysed HKE3 cells harbour an unexpected KRAS-G13D mutation and confirm that this cell line is a genuine KRAS dosage mutant, rather than a true isogenic derivative of HCT116 expressing only the wild type KRAS. This authentication method could be used to revisit the numerous cell line based RNA sequencing experiments available in public data repositories, analyse new experiments where whole genome sequencing is not available, as well as facilitate comparisons of data from different experiments, platforms and laboratories.


Subject(s)
DNA Mutational Analysis/methods , Mutation , Sequence Analysis, RNA/methods , Transcriptome/genetics , Cell Line, Tumor , Colorectal Neoplasms/genetics , Colorectal Neoplasms/pathology , Gene Expression Profiling/methods , Gene Expression Regulation, Neoplastic , Genetic Predisposition to Disease/genetics , HCT116 Cells , HT29 Cells , Humans , Proto-Oncogene Proteins p21(ras)/genetics , Reproducibility of Results
6.
Schizophr Res ; 169(1-3): 441-446, 2015 Dec.
Article in English | MEDLINE | ID: mdl-26421691

ABSTRACT

Formal thought disorder is a major feature of schizophrenia and other psychotic disorders. It is heritable, found in healthy relatives of patients with schizophrenia and other mental disorders but knowledge of specific genetic factors is lacking. The aim of this study was to search for biologically relevant high-risk variants. Formal thought disorder was assessed in participants in the Copenhagen Schizophrenia Linkage Study (N=236), a unique high-risk family study comprised of six large pedigrees. Microsatellite linkage analysis of formal thought disorder was performed and subsequent haplotype analysis of the implicated region using phased microsatellite and SNP genotypes. Whole genome sequencing (N=3) was used in the attempt to identify causative variants in the linkage region. Linkage analysis of formal thought disorder resulted in a single peak at chromosome 6(q26-q27) centred on marker D6S1277, with a maximum LOD score of 4.0. Phasing and fine mapping of the linkage peak identified a 5.5Mb haplotype (chr6:162242322-167753547, hg18) in 31 individuals, all belonging to the same pedigree sharing the haplotype from a common ancestor. The haplotype segregated with increased total thought disorder index score (P=4.9 × 10(-5)) and qualitatively severe forms of thought disturbances. Whole genome sequencing identified a novel nucleotide deletion (chr6:164377205 AG>A, hg18) predicted to disrupt the potential binding of the transcription factor MEF2A. The MEF2A binding site is located between two genes previously reported to associate with schizophrenia, QKI (HGNC:21100) and PDE10A (HGNC:8772). The findings are consistent with MEF2A deregulation conferring risk of formal thought disorder.


Subject(s)
Chromosomes, Human, Pair 6 , Genetic Predisposition to Disease/genetics , Genetic Variation/genetics , Schizophrenia/genetics , Adolescent , Adult , Aged , Aged, 80 and over , Denmark , Female , Genetic Linkage , Genome, Human/genetics , Genotype , Humans , Lod Score , Longitudinal Studies , MEF2 Transcription Factors/genetics , Male , Middle Aged , Phosphoric Diester Hydrolases/genetics , RNA-Binding Proteins/genetics , Sequence Analysis, DNA , Statistics, Nonparametric , Young Adult
7.
Genes Chromosomes Cancer ; 54(4): 235-48, 2015 Apr.
Article in English | MEDLINE | ID: mdl-25546585

ABSTRACT

In situ detection of genomic alterations in cancer provides information at the single cell level, making it possible to investigate genomic changes in cells in a tissue context. Such topological information is important when studying intratumor heterogeneity as well as alterations related to different steps in tumor progression. We developed a quantitative multigene fluorescence in situ hybridization (QM FISH) method to detect multiple genomic regions in single cells in complex tissues. As a "proof of principle" we applied the method to breast cancer samples to identify partners in whole arm (WA) translocations. WA gain of chromosome arm 1q and loss of chromosome arm 16q are among the most frequent genomic events in breast cancer. By designing five specific FISH probes based on breakpoint information from comparative genomic hybridization array (aCGH) profiles, we visualized chromosomal translocations in clinical samples at the single cell level. By analyzing aCGH data from 295 patients with breast carcinoma with known molecular subtype, we found concurrent WA gain of 1q and loss of 16q to be more frequent in luminal A tumors compared to other molecular subtypes. QM FISH applied to a subset of samples (n = 26) identified a derivative chromosome der(1;16)(q10;p10), a result of a centromere-close translocation between chromosome arms 1q and 16p. In addition, we observed that the distribution of cells with the translocation varied from sample to sample, some had a homogenous cell population while others displayed intratumor heterogeneity with cell-to-cell variation. Finally, for one tumor with both preinvasive and invasive components, the fraction of cells with translocation was lower and more heterogeneous in the preinvasive tumor cells compared to the cells in the invasive component.


Subject(s)
Breast Neoplasms/genetics , In Situ Hybridization, Fluorescence/methods , Translocation, Genetic , Chromosome Breakage , Chromosomes, Human, Pair 1 , Chromosomes, Human, Pair 16 , Female , Humans
8.
Nat Genet ; 43(11): 1104-7, 2011 Oct 02.
Article in English | MEDLINE | ID: mdl-21964575

ABSTRACT

Ovarian cancer causes more deaths than any other gynecologic malignancy in developed countries. Sixteen million sequence variants, identified through whole-genome sequencing of 457 Icelanders, were imputed to 41,675 Icelanders genotyped using SNP chips, as well as to their relatives. Sequence variants were tested for association with ovarian cancer (N of affected individuals = 656). We discovered a rare (0.41% allelic frequency) frameshift mutation, c.2040_2041insTT, in the BRIP1 (FANCJ) gene that confers an increase in ovarian cancer risk (odds ratio (OR) = 8.13, P = 2.8 × 10(-14)). The mutation was also associated with increased risk of cancer in general and reduced lifespan by 3.6 years. In a Spanish population, another frameshift mutation in BRIP1, c.1702_1703del, was seen in 2 out of 144 subjects with ovarian cancer and 1 out of 1,780 control subjects (P = 0.016). This allele was also associated with breast cancer (seen in 6/927 cases; P = 0.0079). Ovarian tumors from heterozygous carriers of the Icelandic mutation show loss of the wild-type allele, indicating that BRIP1 behaves like a classical tumor suppressor gene in ovarian cancer.


Subject(s)
DNA-Binding Proteins/genetics , Mutation , Ovarian Neoplasms/genetics , RNA Helicases/genetics , Fanconi Anemia Complementation Group Proteins , Female , Humans , Polymorphism, Single Nucleotide
9.
Sci Transl Med ; 2(38): 38ra47, 2010 Jun 30.
Article in English | MEDLINE | ID: mdl-20592421

ABSTRACT

Distinct molecular subtypes of breast carcinomas have been identified, but translation into clinical use has been limited. We have developed two platform-independent algorithms to explore genomic architectural distortion using array comparative genomic hybridization data to measure (i) whole-arm gains and losses [whole-arm aberration index (WAAI)] and (ii) complex rearrangements [complex arm aberration index (CAAI)]. By applying CAAI and WAAI to data from 595 breast cancer patients, we were able to separate the cases into eight subgroups with different distributions of genomic distortion. Within each subgroup data from expression analyses, sequencing and ploidy indicated that progression occurs along separate paths into more complex genotypes. Histological grade had prognostic impact only in the luminal-related groups, whereas the complexity identified by CAAI had an overall independent prognostic power. This study emphasizes the relation among structural genomic alterations, molecular subtype, and clinical behavior and shows that objective score of genomic complexity (CAAI) is an independent prognostic marker in breast cancer.


Subject(s)
Breast Neoplasms/genetics , Disease Progression , Genome, Human/genetics , Chromosome Aberrations , Chromosomes, Human/genetics , Comparative Genomic Hybridization , Female , Gene Expression Profiling , Gene Expression Regulation, Neoplastic , Humans , Prognosis , Proportional Hazards Models
10.
Genome Res ; 20(1): 68-80, 2010 Jan.
Article in English | MEDLINE | ID: mdl-19903760

ABSTRACT

Cancer progression in humans is difficult to infer because we do not routinely sample patients at multiple stages of their disease. However, heterogeneous breast tumors provide a unique opportunity to study human tumor progression because they still contain evidence of early and intermediate subpopulations in the form of the phylogenetic relationships. We have developed a method we call Sector-Ploidy-Profiling (SPP) to study the clonal composition of breast tumors. SPP involves macro-dissecting tumors, flow-sorting genomic subpopulations by DNA content, and profiling genomes using comparative genomic hybridization (CGH). Breast carcinomas display two classes of genomic structural variation: (1) monogenomic and (2) polygenomic. Monogenomic tumors appear to contain a single major clonal subpopulation with a highly stable chromosome structure. Polygenomic tumors contain multiple clonal tumor subpopulations, which may occupy the same sectors, or separate anatomic locations. In polygenomic tumors, we show that heterogeneity can be ascribed to a few clonal subpopulations, rather than a series of gradual intermediates. By comparing multiple subpopulations from different anatomic locations, we have inferred pathways of cancer progression and the organization of tumor growth.


Subject(s)
Breast Neoplasms , Carcinoma, Ductal, Breast , Comparative Genomic Hybridization/methods , Disease Progression , Flow Cytometry/methods , Genetic Heterogeneity , Breast Neoplasms/genetics , Breast Neoplasms/pathology , Carcinoma, Ductal, Breast/genetics , Carcinoma, Ductal, Breast/pathology , Chromosome Breakpoints , Female , Gene Dosage , Humans , In Situ Hybridization, Fluorescence , Informatics , Molecular Sequence Data , Oligonucleotide Array Sequence Analysis , Ploidies , Sequence Analysis, DNA
11.
Genome Res ; 16(12): 1465-79, 2006 Dec.
Article in English | MEDLINE | ID: mdl-17142309

ABSTRACT

Representational Oligonucleotide Microarray Analysis (ROMA) detects genomic amplifications and deletions with boundaries defined at a resolution of approximately 50 kb. We have used this technique to examine 243 breast tumors from two separate studies for which detailed clinical data were available. The very high resolution of this technology has enabled us to identify three characteristic patterns of genomic copy number variation in diploid tumors and to measure correlations with patient survival. One of these patterns is characterized by multiple closely spaced amplicons, or "firestorms," limited to single chromosome arms. These multiple amplifications are highly correlated with aggressive disease and poor survival even when the rest of the genome is relatively quiet. Analysis of a selected subset of clinical material suggests that a simple genomic calculation, based on the number and proximity of genomic alterations, correlates with life-table estimates of the probability of overall survival in patients with primary breast cancer. Based on this sample, we generate the working hypothesis that copy number profiling might provide information useful in making clinical decisions, especially regarding the use or not of systemic therapies (hormonal therapy, chemotherapy), in the management of operable primary breast cancer with ostensibly good prognosis, for example, small, node-negative, hormone-receptor-positive diploid cases.


Subject(s)
Breast Neoplasms/genetics , Gene Rearrangement , Genome, Human , Breast Neoplasms/mortality , Breast Neoplasms/pathology , DNA, Neoplasm/genetics , Diploidy , Female , Gene Amplification , Gene Dosage , Gene Expression Profiling , Genomics , Humans , In Situ Hybridization, Fluorescence , Oligonucleotide Array Sequence Analysis , Prognosis , Retrospective Studies , Survival Analysis
12.
Bioinformatics ; 22(19): 2437-8, 2006 Oct 01.
Article in English | MEDLINE | ID: mdl-16740623

ABSTRACT

UNLABELLED: PROBER is an oligonucleotide primer design software application that designs multiple primer pairs for generating PCR probes useful for fluorescence in situ hybridization (FISH). PROBER generates Tiling Oligonucleotide Probes (TOPs) by masking repetitive genomic sequences and delineating essentially unique regions that can be amplified to yield small (100-2000 bp) DNA probes that in aggregate will generate a single, strong fluorescent signal for regions as small as a single gene. TOPs are an alternative to bacterial artificial chromosomes (BACs) that are commonly used for FISH but may be unstable, unavailable, chimeric, or non-specific to small (10-100 kb) genomic regions. PROBER can be applied to any genomic locus, with the limitation that the locus must contain at least 10 kb of essentially unique blocks. To test the software, we designed a number of probes for genomic amplifications and hemizygous deletions that were initially detected by Representational Oligonucleotide Microarray Analysis of breast cancer tumors. AVAILABILITY: http://prober.cshl.edu


Subject(s)
Algorithms , Chromosome Mapping/methods , DNA Probes/genetics , In Situ Hybridization, Fluorescence/methods , Sequence Analysis, DNA/methods , Software , Base Sequence , Computer-Aided Design , Equipment Design , In Situ Hybridization, Fluorescence/instrumentation , Molecular Sequence Data , Sequence Alignment/methods
13.
Science ; 305(5683): 525-8, 2004 Jul 23.
Article in English | MEDLINE | ID: mdl-15273396

ABSTRACT

The extent to which large duplications and deletions contribute to human genetic variation and diversity is unknown. Here, we show that large-scale copy number polymorphisms (CNPs) (about 100 kilobases and greater) contribute substantially to genomic variation between normal humans. Representational oligonucleotide microarray analysis of 20 individuals revealed a total of 221 copy number differences representing 76 unique CNPs. On average, individuals differed by 11 CNPs, and the average length of a CNP interval was 465 kilobases. We observed copy number variation of 70 different genes within CNP intervals, including genes involved in neurological function, regulation of cell growth, regulation of metabolism, and several genes known to be associated with disease.


Subject(s)
Gene Dosage , Genetic Variation , Genome, Human , Polymorphism, Genetic , Alleles , Bacterial Proteins/metabolism , Cell Line, Transformed , Chromosome Aberrations , Chromosome Mapping , Chromosomes, Human/genetics , Deoxyribonuclease HindIII/metabolism , Deoxyribonucleases, Type II Site-Specific/metabolism , Female , Gene Deletion , Gene Duplication , Gene Frequency , Humans , Male , Markov Chains , Oligonucleotide Array Sequence Analysis
SELECTION OF CITATIONS
SEARCH DETAIL
...