Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 677
Filtrar
1.
Curr Protoc ; 4(6): e1055, 2024 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-38837690

RESUMO

Data harmonization involves combining data from multiple independent sources and processing the data to produce one uniform dataset. Merging separate genotypes or whole-genome sequencing datasets has been proposed as a strategy to increase the statistical power of association tests by increasing the effective sample size. However, data harmonization is not a widely adopted strategy due to the difficulties with merging data (including confounding produced by batch effects and population stratification). Detailed data harmonization protocols are scarce and are often conflicting. Moreover, data harmonization protocols that accommodate samples of admixed ancestry are practically non-existent. Existing data harmonization procedures must be modified to ensure the heterogeneous ancestry of admixed individuals is incorporated into additional downstream analyses without confounding results. Here, we propose a set of guidelines for merging multi-platform genetic data from admixed samples that can be adopted by any investigator with elementary bioinformatics experience. We have applied these guidelines to aggregate 1544 tuberculosis (TB) case-control samples from six separate in-house datasets and conducted a genome-wide association study (GWAS) of TB susceptibility. The GWAS performed on the merged dataset had improved power over analyzing the datasets individually and produced summary statistics free from bias introduced by batch effects and population stratification. © 2024 Wiley Periodicals LLC. Basic Protocol 1: Processing separate datasets comprising array genotype data Alternate Protocol 1: Processing separate datasets comprising array genotype and whole-genome sequencing data Alternate Protocol 2: Performing imputation using a local reference panel Basic Protocol 2: Merging separate datasets Basic Protocol 3: Ancestry inference using ADMIXTURE and RFMix Basic Protocol 4: Batch effect correction using pseudo-case-control comparisons.


Assuntos
Estudo de Associação Genômica Ampla , Humanos , Estudo de Associação Genômica Ampla/métodos , Estudo de Associação Genômica Ampla/normas , Genômica/métodos , Genômica/normas , Tuberculose/genética , Estudos de Casos e Controles , Guias como Assunto , Predisposição Genética para Doença
2.
Methods Mol Biol ; 2802: 587-609, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38819573

RESUMO

Comparative analysis of (meta)genomes necessitates aggregation, integration, and synthesis of well-annotated data using standards. The Genomic Standards Consortium (GSC) collaborates with the research community to develop and maintain the Minimum Information about any (x) Sequence (MIxS) reporting standard for genomic data. To facilitate the use of the GSC's MIxS reporting standard, we provide a description of the structure and terminology, how to navigate ontologies for required terms in MIxS, and demonstrate practical usage through a soil metagenome example.


Assuntos
Genômica , Metagenoma , Metagenômica , Metagenômica/métodos , Metagenômica/normas , Genômica/métodos , Genômica/normas , Metagenoma/genética , Bases de Dados Genéticas , Microbiologia do Solo
4.
Genet Med ; 26(7): 101137, 2024 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-38814327

RESUMO

Carrier screening has historically assessed a relatively small number of autosomal recessive and X-linked conditions selected based on frequency in a specific subpopulation and association with severe morbidity or mortality. Advances in genomic technologies enable simultaneous screening of individuals for several conditions. The American College of Medical Genetics and Genomics recently published a clinical practice resource that presents a framework when offering screening for autosomal recessive and X-linked conditions during pregnancy and preconception and recommends a tier-based approach when considering the number of conditions to screen for and their frequency within the US population in general. This laboratory technical standard aims to complement the practice resource and to put forth considerations for clinical laboratories and clinicians who offer preconception/prenatal carrier screening.


Assuntos
Triagem de Portadores Genéticos , Testes Genéticos , Genética Médica , Genômica , Diagnóstico Pré-Natal , Humanos , Triagem de Portadores Genéticos/métodos , Triagem de Portadores Genéticos/normas , Gravidez , Feminino , Genômica/métodos , Genômica/normas , Diagnóstico Pré-Natal/métodos , Diagnóstico Pré-Natal/normas , Testes Genéticos/normas , Testes Genéticos/métodos , Genética Médica/normas , Estados Unidos , Cuidado Pré-Concepcional/métodos , Cuidado Pré-Concepcional/normas , Aconselhamento Genético/normas , Aconselhamento Genético/métodos
5.
Eur J Hum Genet ; 32(6): 665-672, 2024 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-38565640

RESUMO

Currently, there are no widely accepted recommendations in the genomics field guiding the return of incidental findings (IFs), defined here as unexpected results that are unrelated to the indication for testing. Consequently, reporting policies for IFs among laboratories offering genomic testing are variable and may lack transparency. Herein we describe a framework developed to guide the evaluation and return of IFs encountered in probands undergoing clinical genome sequencing (cGS). The framework prioritizes clinical significance and actionability of IFs and follows a stepwise approach with stopping points at which IFs may be recommended for return or not. Over 18 months, implementation of the framework in a clinical laboratory facilitated the return of actionable IFs in 37 of 720 (5.1%) individuals referred for cGS, which is reduced to 3.1% if glucose-6-phosphate dehydrogenase (G6PD) deficiency is excluded. This framework can serve as a model to standardize reporting of IFs identified during genomic testing.


Assuntos
Testes Genéticos , Achados Incidentais , Humanos , Testes Genéticos/normas , Testes Genéticos/métodos , Genômica/normas , Genômica/métodos
6.
Genet Med ; 26(6): 101116, 2024 06.
Artigo em Inglês | MEDLINE | ID: mdl-38459833

RESUMO

PURPOSE: Determining the value of genomic tests in rare disease necessitates a broader conceptualization of genomic utility beyond diagnostic yield. Despite widespread discussion, consensus toward which aspects of value to consider is lacking. This study aimed to use expert opinion to identify and refine priority indicators of utility in rare disease genomic testing. METHODS: We used 2 survey rounds following Delphi methodology to obtain consensus on indicators of utility among experts involved in policy, clinical, research, and consumer advocacy leadership in Australia. We analyzed quantitative and qualitative data to identify, define, and determine priority indicators. RESULTS: Twenty-five experts completed round 1 and 18 completed both rounds. Twenty indicators reached consensus as a priority in value assessment, including those relating to prognostic information, timeliness of results, practical and health care outcomes, clinical accreditation, and diagnostic yield. Whereas indicators pertaining to discovery research, disutility, and factors secondary to primary reason for testing were considered less of a priority and were removed. CONCLUSION: This study obtained expert consensus on different utility indicators that are considered a priority in determining the value of genomic testing in rare disease in Australia. Indicators may inform a standardized approach to evidence generation and assessment to guide future research, decision making, and implementation efforts.


Assuntos
Técnica Delphi , Testes Genéticos , Genômica , Doenças Raras , Humanos , Doenças Raras/genética , Doenças Raras/diagnóstico , Testes Genéticos/normas , Testes Genéticos/métodos , Genômica/métodos , Genômica/normas , Austrália , Consenso , Inquéritos e Questionários
8.
Eur J Hum Genet ; 32(5): 521-528, 2024 May.
Artigo em Inglês | MEDLINE | ID: mdl-38212661

RESUMO

Automating reanalysis of genomic data for undiagnosed rare disease patients presents a paradigm shift in how clinical genomics is delivered. We aimed to map the current manual and proposed automated approach to reanalysis and identify possible implementation strategies to address clinical and laboratory staff's perceived challenges to automation. Fourteen semi-structured interviews guided by a simplified process map were conducted with clinical and laboratory staff across Australia. Individual process maps were integrated into an overview of the current process, noting variation in service delivery. Participants then mapped an automated approach and were invited to discuss perceived challenges and possible supports to automation. Responses were analysed using the Consolidated Framework for Implementation Research, linking to the Expert Recommendations for Implementing Change framework to identify theory-informed implementation strategies. Process mapping demonstrates how automation streamlines processes with eleven steps reduced to seven. Although participants welcomed automation, challenges were raised at six of the steps. Strategies to overcome challenges include embedding project champions, developing education materials, facilitating clinical innovation and quality monitoring tools, and altering reimbursement structures. Future work can build on these findings to develop context specific implementation strategies to guide translation of an automated approach to reanalysis to improve clinical care and patient outcomes.


Assuntos
Genômica , Humanos , Genômica/métodos , Genômica/normas , Pesquisa Qualitativa , Testes Genéticos/normas , Testes Genéticos/métodos , Austrália , Automação
9.
Nature ; 621(7978): 344-354, 2023 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-37612512

RESUMO

The human Y chromosome has been notoriously difficult to sequence and assemble because of its complex repeat structure that includes long palindromes, tandem repeats and segmental duplications1-3. As a result, more than half of the Y chromosome is missing from the GRCh38 reference sequence and it remains the last human chromosome to be finished4,5. Here, the Telomere-to-Telomere (T2T) consortium presents the complete 62,460,029-base-pair sequence of a human Y chromosome from the HG002 genome (T2T-Y) that corrects multiple errors in GRCh38-Y and adds over 30 million base pairs of sequence to the reference, showing the complete ampliconic structures of gene families TSPY, DAZ and RBMY; 41 additional protein-coding genes, mostly from the TSPY family; and an alternating pattern of human satellite 1 and 3 blocks in the heterochromatic Yq12 region. We have combined T2T-Y with a previous assembly of the CHM13 genome4 and mapped available population variation, clinical variants and functional genomics data to produce a complete and comprehensive reference sequence for all 24 human chromosomes.


Assuntos
Cromossomos Humanos Y , Genômica , Análise de Sequência de DNA , Humanos , Sequência de Bases , Cromossomos Humanos Y/genética , DNA Satélite/genética , Variação Genética/genética , Genética Populacional , Genômica/métodos , Genômica/normas , Heterocromatina/genética , Família Multigênica/genética , Padrões de Referência , Duplicações Segmentares Genômicas/genética , Análise de Sequência de DNA/normas , Sequências de Repetição em Tandem/genética , Telômero/genética
11.
JAMA ; 330(3): 205-206, 2023 07 18.
Artigo em Inglês | MEDLINE | ID: mdl-37379037

RESUMO

This Medical News article discusses the Human Pangenome Project.


Assuntos
Genoma Humano , Genômica , Medicina , Humanos , Genoma Humano/genética , Genômica/normas , Medicina/tendências
12.
Nature ; 617(7960): 312-324, 2023 05.
Artigo em Inglês | MEDLINE | ID: mdl-37165242

RESUMO

Here the Human Pangenome Reference Consortium presents a first draft of the human pangenome reference. The pangenome contains 47 phased, diploid assemblies from a cohort of genetically diverse individuals1. These assemblies cover more than 99% of the expected sequence in each genome and are more than 99% accurate at the structural and base pair levels. Based on alignments of the assemblies, we generate a draft pangenome that captures known variants and haplotypes and reveals new alleles at structurally complex loci. We also add 119 million base pairs of euchromatic polymorphic sequences and 1,115 gene duplications relative to the existing reference GRCh38. Roughly 90 million of the additional base pairs are derived from structural variation. Using our draft pangenome to analyse short-read data reduced small variant discovery errors by 34% and increased the number of structural variants detected per haplotype by 104% compared with GRCh38-based workflows, which enabled the typing of the vast majority of structural variant alleles per sample.


Assuntos
Genoma Humano , Genômica , Humanos , Diploide , Genoma Humano/genética , Haplótipos/genética , Análise de Sequência de DNA , Genômica/normas , Padrões de Referência , Estudos de Coortes , Alelos , Variação Genética
14.
BMC Genomics ; 24(1): 117, 2023 Mar 16.
Artigo em Inglês | MEDLINE | ID: mdl-36927511

RESUMO

BACKGROUND: Generating the most contiguous, accurate genome assemblies given available sequencing technologies is a long-standing challenge in genome science. With the rise of long-read sequencing, assembly challenges have shifted from merely increasing contiguity to correctly assembling complex, repetitive regions of interest, ideally in a phased manner. At present, researchers largely choose between two types of long read data: longer, but less accurate sequences, or highly accurate, but shorter reads (i.e., >Q20 or 99% accurate). To better understand how these types of long-read data as well as scale of data (i.e., mean length and sequencing depth) influence genome assembly outcomes, we compared genome assemblies for a caddisfly, Hesperophylax magnus, generated with longer, but less accurate, Oxford Nanopore (ONT) R9.4.1 and highly accurate PacBio HiFi (HiFi) data. Next, we expanded this comparison to consider the influence of highly accurate long-read sequence data on genome assemblies across 6750 plant and animal genomes. For this broader comparison, we used HiFi data as a surrogate for highly accurate long-reads broadly as we could identify when they were used from GenBank metadata. RESULTS: HiFi reads outperformed ONT reads in all assembly metrics tested for the caddisfly data set and allowed for accurate assembly of the repetitive ~ 20 Kb H-fibroin gene. Across plants and animals, genome assemblies that incorporated HiFi reads were also more contiguous. For plants, the average HiFi assembly was 501% more contiguous (mean contig N50 = 20.5 Mb) than those generated with any other long-read data (mean contig N50 = 4.1 Mb). For animals, HiFi assemblies were 226% more contiguous (mean contig N50 = 20.9 Mb) versus other long-read assemblies (mean contig N50 = 9.3 Mb). In plants, we also found limited evidence that HiFi may offer a unique solution for overcoming genomic complexity that scales with assembly size. CONCLUSIONS: Highly accurate long-reads generated with HiFi or analogous technologies represent a key tool for maximizing genome assembly quality for a wide swath of plants and animals. This finding is particularly important when resources only allow for one type of sequencing data to be generated. Ultimately, to realize the promise of biodiversity genomics, we call for greater uptake of highly accurate long-reads in future studies.


Assuntos
Biodiversidade , Genômica , Sequenciamento de Nucleotídeos em Larga Escala , Análise de Sequência de DNA , Genômica/métodos , Genômica/normas , Genômica/tendências , Insetos/classificação , Insetos/genética , Fibroínas/genética , Mapeamento de Sequências Contíguas , Genoma de Inseto/genética , Animais , Bases de Dados de Ácidos Nucleicos , Reprodutibilidade dos Testes , Metanálise como Assunto , Conjuntos de Dados como Assunto , Análise de Sequência de DNA/métodos , Análise de Sequência de DNA/normas , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Sequenciamento de Nucleotídeos em Larga Escala/normas , Sequenciamento de Nucleotídeos em Larga Escala/tendências , Plantas/genética , Genoma de Planta/genética
15.
Trends Genet ; 39(3): 175-186, 2023 03.
Artigo em Inglês | MEDLINE | ID: mdl-36402623

RESUMO

Quality control is essential for genome assemblies; however, a consensus has yet to be reached on what metrics should be adopted for the evaluation of assembly quality. N50 is widely used for contiguity measurement, but its effectiveness is constantly in question. Prevailing metrics for the completeness evaluation focus on gene space, yet challenging areas such as tandem repeats are commonly overlooked. Achieving correctness has become an indispensable dimension for quality control, while prevailing assembly releases lack scores reflecting this aspect. We propose a metric set with a set of statistic indexes for effective, comprehensive evaluation of assemblies and provide a score of a finished assembly for each metric, which can be utilized as a benchmark for achieving high-quality genome assemblies.


Assuntos
Genômica , Análise de Sequência de DNA , Análise de Sequência de DNA/métodos , Genômica/normas
16.
Nature ; 611(7936): 519-531, 2022 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-36261518

RESUMO

The current human reference genome, GRCh38, represents over 20 years of effort to generate a high-quality assembly, which has benefitted society1,2. However, it still has many gaps and errors, and does not represent a biological genome as it is a blend of multiple individuals3,4. Recently, a high-quality telomere-to-telomere reference, CHM13, was generated with the latest long-read technologies, but it was derived from a hydatidiform mole cell line with a nearly homozygous genome5. To address these limitations, the Human Pangenome Reference Consortium formed with the goal of creating high-quality, cost-effective, diploid genome assemblies for a pangenome reference that represents human genetic diversity6. Here, in our first scientific report, we determined which combination of current genome sequencing and assembly approaches yield the most complete and accurate diploid genome assembly with minimal manual curation. Approaches that used highly accurate long reads and parent-child data with graph-based haplotype phasing during assembly outperformed those that did not. Developing a combination of the top-performing methods, we generated our first high-quality diploid reference assembly, containing only approximately four gaps per chromosome on average, with most chromosomes within ±1% of the length of CHM13. Nearly 48% of protein-coding genes have non-synonymous amino acid changes between haplotypes, and centromeric regions showed the highest diversity. Our findings serve as a foundation for assembling near-complete diploid human genomes at scale for a pangenome reference to capture global genetic variation from single nucleotides to structural rearrangements.


Assuntos
Mapeamento Cromossômico , Diploide , Genoma Humano , Genômica , Humanos , Mapeamento Cromossômico/normas , Genoma Humano/genética , Haplótipos/genética , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Sequenciamento de Nucleotídeos em Larga Escala/normas , Análise de Sequência de DNA/métodos , Análise de Sequência de DNA/normas , Padrões de Referência , Genômica/métodos , Genômica/normas , Cromossomos Humanos/genética , Variação Genética/genética
18.
Science ; 376(6588): eabj5089, 2022 04.
Artigo em Inglês | MEDLINE | ID: mdl-35357915

RESUMO

The completion of a telomere-to-telomere human reference genome, T2T-CHM13, has resolved complex regions of the genome, including repetitive and homologous regions. Here, we present a high-resolution epigenetic study of previously unresolved sequences, representing entire acrocentric chromosome short arms, gene family expansions, and a diverse collection of repeat classes. This resource precisely maps CpG methylation (32.28 million CpGs), DNA accessibility, and short-read datasets (166,058 previously unresolved chromatin immunoprecipitation sequencing peaks) to provide evidence of activity across previously unidentified or corrected genes and reveals clinically relevant paralog-specific regulation. Probing CpG methylation across human centromeres from six diverse individuals generated an estimate of variability in kinetochore localization. This analysis provides a framework with which to investigate the most elusive regions of the human genome, granting insights into epigenetic regulation.


Assuntos
Ilhas de CpG , Metilação de DNA , Epigênese Genética , Genoma Humano , Centrômero/genética , Centrômero/metabolismo , Doença/genética , Loci Gênicos , Genômica/normas , Humanos , Padrões de Referência , Análise de Sequência de DNA
19.
Science ; 376(6588): eabl3533, 2022 04.
Artigo em Inglês | MEDLINE | ID: mdl-35357935

RESUMO

Compared to its predecessors, the Telomere-to-Telomere CHM13 genome adds nearly 200 million base pairs of sequence, corrects thousands of structural errors, and unlocks the most complex regions of the human genome for clinical and functional study. We show how this reference universally improves read mapping and variant calling for 3202 and 17 globally diverse samples sequenced with short and long reads, respectively. We identify hundreds of thousands of variants per sample in previously unresolved regions, showcasing the promise of the T2T-CHM13 reference for evolutionary and biomedical discovery. Simultaneously, this reference eliminates tens of thousands of spurious variants per sample, including reduction of false positives in 269 medically relevant genes by up to a factor of 12. Because of these improvements in variant discovery coupled with population and functional genomic resources, T2T-CHM13 is positioned to replace GRCh38 as the prevailing reference for human genetics.


Assuntos
Variação Genética , Genoma Humano , Genômica/normas , Análise de Sequência de DNA/normas , Humanos , Padrões de Referência
20.
Cancer Cell ; 40(2): 109-113, 2022 02 14.
Artigo em Inglês | MEDLINE | ID: mdl-35120599

RESUMO

Cancers other than breast, colorectal, cervical, and lung do not have guideline-recommended screening. New multi-cancer early detection (MCED) tests-using a single blood sample-have been developed based on circulating cell-free DNA (cfDNA) or other analytes. In this commentary, we review the current evidence on these tests, provide several major considerations for new MCED tests, and outline how their evaluation will need to differ from that established for traditional single-cancer screening tests.


Assuntos
Biomarcadores Tumorais , Detecção Precoce de Câncer , Genômica/métodos , Neoplasias/diagnóstico , Neoplasias/genética , Tomada de Decisão Clínica , Gerenciamento Clínico , Suscetibilidade a Doenças , Detecção Precoce de Câncer/métodos , Detecção Precoce de Câncer/normas , Genômica/normas , Humanos , Especificidade de Órgãos
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...