Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 806
Filtrar
1.
Int J Mol Sci ; 25(16)2024 Aug 22.
Artigo em Inglês | MEDLINE | ID: mdl-39201813

RESUMO

Genetic variation in the FCGR3B gene is responsible for different variants of human neutrophil antigen 1 (HNA-1). Laboratory techniques currently utilized for routine HNA-1 genotyping, predominantly PCR-sequence-specific primer (PCR-SSP) and PCR-sequence-based typing (PCR-SBT), lack specificity for FCGR3B. This study compares the capabilities and limitations of existing technologies including an in-house TaqMan PCR, a commercial PCR-SSP test, PCR-SBT and multiplex ligation-dependent probe amplification (MLPA) with those of a long-read nanopore sequencing assay. Testing was performed with both related and unrelated Danish samples with different copy numbers and/or rare alleles. Long-read nanopore sequencing was validated by blind testing of ten English samples. The results showed that FCGR3B copy numbers correlate with a dose-dependent distribution of alleles that complicates genotyping by TaqMan PCR, PCR-SSP and PCR-SBT, due to co-amplification of the homologous FCGR3A gene. MLPA can correctly quantify the dose-dependent distribution but not detect novel variants. Long-read nanopore sequencing showed high specificity for FCGR3B and was able to detect dosage-dependent distribution, and rare and novel variants that were previously not described. Current HNA-1 genotyping methods cannot produce unambiguous allele-level results, whereas long-read nanopore sequencing has shown the potential to resolve observed ambiguities, identify new HNA-1 variants and allow definitive allele assignment.


Assuntos
Alelos , Proteínas Ligadas por GPI , Genótipo , Receptores de IgG , Humanos , Técnicas de Genotipagem/métodos , Proteínas Ligadas por GPI/genética , Isoantígenos/genética , Reação em Cadeia da Polimerase Multiplex/métodos , Sequenciamento por Nanoporos/métodos , Reação em Cadeia da Polimerase/métodos , Receptores de IgG/genética
2.
Genes (Basel) ; 15(8)2024 Aug 05.
Artigo em Inglês | MEDLINE | ID: mdl-39202389

RESUMO

Lichens have developed numerous adaptations to optimize their survival in various environmental conditions, largely by producing secondary compounds by the fungal partner. They often have antibiotic properties and are involved in protection against intensive UV radiation, pathogens, and herbivores. To contribute to the knowledge of the arsenal of secondary compounds in a crustose lichen species, we sequenced and assembled the genome of Toniniopsis dissimilis, an indicator of old-growth forests, using Oxford Nanopore Technologies (ONT, Oxford, UK) long reads. Our analyses focused on biosynthetic gene clusters (BGCs) and specifically on Type I Polyketide (T1PKS) genes involved in the biosynthesis of polyketides. We used the comparative genomic approach to compare the genome of T. dissimilis with six other members of the family Ramalinaceae and twenty additional lichen genomes from the database. With only six T1PKS genes, a comparatively low number of biosynthetic genes are present in the T. dissimilis genome; from those, two-thirds are putatively involved in melanin biosynthesis. The comparative analyses showed at least three potential pathways of melanin biosynthesis in T. dissimilis, namely via the formation of 1,3,6,8-tetrahydroxynaphthalene, naphthopyrone, or YWA1 putative precursors, which highlights its importance in T. dissimilis. In addition, we report the occurrence of genes encoding ribosomally synthesized and posttranslationally modified peptides (RiPPs) in lichens, with their highest number in T. dissimilis compared to other Ramalinaceae genomes. So far, no function has been assigned to RiPP-like proteins in lichens, which leaves potential for future research on this topic.


Assuntos
Genoma Fúngico , Líquens , Melaninas , Melaninas/biossíntese , Melaninas/genética , Líquens/genética , Líquens/metabolismo , Família Multigênica , Filogenia , Vias Biossintéticas/genética , Ascomicetos/genética , Ascomicetos/metabolismo , Policetídeos/metabolismo , Proteínas Fúngicas/genética , Proteínas Fúngicas/metabolismo
3.
mSystems ; : e0024224, 2024 Aug 19.
Artigo em Inglês | MEDLINE | ID: mdl-39158287

RESUMO

Although long-read sequencing has enabled obtaining high-quality and complete genomes from metagenomes, many challenges still remain to completely decompose a metagenome into its constituent prokaryotic and viral genomes. This study focuses on decomposing an estuarine metagenome to obtain a more accurate estimate of microbial diversity. To achieve this, we developed a new bead-based DNA extraction method, a novel bin refinement method, and obtained 150 Gbp of Nanopore sequencing. We estimate that there are ~500 bacterial and archaeal species in our sample and obtained 68 high-quality bins (>90% complete, <5% contamination, ≤5 contigs, contig length of >100 kbp, and all ribosomal and tRNA genes). We also obtained many contigs of picoeukaryotes, environmental DNA of larger eukaryotes such as mammals, and complete mitochondrial and chloroplast genomes and detected ~40,000 viral populations. Our analysis indicates that there are only a few strains that comprise most of the species abundances. IMPORTANCE: Ocean and estuarine microbiomes play critical roles in global element cycling and ecosystem function. Despite the importance of these microbial communities, many species still have not been cultured in the lab. Environmental sequencing is the primary way the function and population dynamics of these communities can be studied. Long-read sequencing provides an avenue to overcome limitations of short-read technologies to obtain complete microbial genomes but comes with its own technical challenges, such as needed sequencing depth and obtaining high-quality DNA. We present here new sampling and bioinformatics methods to attempt decomposing an estuarine microbiome into its constituent genomes. Our results suggest there are only a few strains that comprise most of the species abundances from viruses to picoeukaryotes, and to fully decompose a metagenome of this diversity requires 1 Tbp of long-read sequencing. We anticipate that as long-read sequencing technologies continue to improve, less sequencing will be needed.

4.
Brief Funct Genomics ; 2024 Aug 19.
Artigo em Inglês | MEDLINE | ID: mdl-39158328

RESUMO

Long-read sequencing technologies can capture entire RNA transcripts in a single sequencing read, reducing the ambiguity in constructing and quantifying transcript models in comparison to more common and earlier methods, such as short-read sequencing. Recent improvements in the accuracy of long-read sequencing technologies have expanded the scope for novel splice isoform detection and have also enabled a far more accurate reconstruction of complex splicing patterns and transcriptomes. Additionally, the incorporation and advancements of machine learning and deep learning algorithms in bioinformatic software have significantly improved the reliability of long-read sequencing transcriptomic studies. However, there is a lack of consensus on what bioinformatic tools and pipelines produce the most precise and consistent results. Thus, this review aims to discuss and compare the performance of available methods for novel isoform discovery with long-read sequencing technologies, with 25 tools being presented. Furthermore, this review intends to demonstrate the need for developing standard analytical pipelines, tools, and transcript model conventions for novel isoform discovery and transcriptomic studies.

5.
J Clin Microbiol ; : e0074124, 2024 Aug 13.
Artigo em Inglês | MEDLINE | ID: mdl-39136450

RESUMO

The transition from MIRU-VNTR-based epidemiology studies in tuberculosis (TB) to genomic epidemiology has transformed how we track transmission. However, short-read sequencing is poor at analyzing repetitive regions such as the MIRU-VNTR loci. This causes a gap between the new genomic data and the large amount of information stored in historical databases. Long-read sequencing could bridge this knowledge gap by allowing analysis of repetitive regions. However, the feasibility of extracting MIRU-VNTRs from long reads and linking them to historical data has not been evaluated. In our study, an in silico arm, consisting of inference of MIRU patterns from long-read sequences (using MIRUReader program), was compared with an experimental arm, involving standard amplification and fragment sizing. We analyzed overall performance on 39 isolates from South Africa and confirmed reproducibility in a sample enriched with 62 clustered cases from Spain. Finally, we ran 25 consecutive incident cases, demonstrating the feasibility of correctly assigning new clustered/orphan cases by linking data inferred from genomic analysis to MIRU-VNTR databases. Of the 3,024 loci analyzed, only 11 discrepancies (0.36%) were found between the two arms: three attributed to experimental error and eight to misassigned alleles from long-read sequencing. A second round of analysis of these discrepancies resulted in agreement between the experimental and in silico arms in all but one locus. Adjusting the MIRUReader program code allowed us to flag potential in silico misassignments due to suboptimal coverage or unfixed double alleles. Our study indicates that long-read sequencing could help address potential chronological and geographical gaps arising from the transition from molecular to genomic epidemiology of tuberculosis. IMPORTANCE: The transition from molecular epidemiology in tuberculosis (TB), based on the analysis of repetitive regions (VNTR-based genotyping), to genomic epidemiology transforms in the precision with which we track transmission. However, short-read sequencing, the most common method for performing genomic analysis, is poor at analyzing repetitive regions. This means that we face a gap between the new genomic data and the large amount of information stored in historical databases, which is also an obstacle to cross-national surveillance involving settings where only molecular data are available. Long-read sequencing could help bridge this knowledge gap by allowing analysis of repetitive regions. Our study demonstrates that MIRU-VNTR patterns can be successfully inferred from long-read sequences, allowing the correct assignment of new cases as clustered/orphan by linking new data extracted from genomic analysis to historical MIRU-VNTR databases. Our data may provide a starting point for bridging the knowledge gap between the molecular and genomic eras in tuberculosis epidemiology.

6.
Genome Biol Evol ; 16(8)2024 Aug 05.
Artigo em Inglês | MEDLINE | ID: mdl-39101619

RESUMO

The plant Arabidopsis thaliana is a model system used by researchers through much of plant research. Recent efforts have focused on discovering the genomic variation found in naturally occurring ecotypes isolated from around the world. These ecotypes have come from diverse climates and therefore have faced and adapted to a variety of abiotic and biotic stressors. The sequencing and comparative analysis of these genomes can offer insight into the adaptive strategies of plants. While there are a large number of ecotype genome sequences available, the majority were created using short-read technology. Mapping of short-reads containing structural variation to a reference genome bereft of that variation leads to incorrect mapping of those reads, resulting in a loss of genetic information and introduction of false heterozygosity. For this reason, long-read de novo sequencing of genomes is required to resolve structural variation events. In this article, we sequenced the genomes of eight natural variants of A. thaliana using nanopore sequencing. This resulted in highly contiguous assemblies with >95% of the genome contained within five contigs. The sequencing results from this study include five ecotypes from relict and African populations, an area of untapped genetic diversity. With this study, we increase the knowledge of diversity we have across A. thaliana ecotypes and contribute to ongoing production of an A. thaliana pan-genome.


Assuntos
Arabidopsis , Ecótipo , Genoma de Planta , Arabidopsis/genética , Cromossomos de Plantas/genética , Anotação de Sequência Molecular , Variação Genética
7.
Front Vet Sci ; 11: 1443855, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-39144078

RESUMO

Introduction: Spillover events of Mycoplasma ovipneumoniae have devastating effects on the wild sheep populations. Multilocus sequence typing (MLST) is used to monitor spillover events and the spread of M. ovipneumoniae between the sheep populations. Most studies involving the typing of M. ovipneumoniae have used Sanger sequencing. However, this technology is time-consuming, expensive, and is not well suited to efficient batch sample processing. Methods: Our study aimed to develop and validate an MLST workflow for typing of M. ovipneumoniae using Nanopore Rapid Barcoding sequencing and multiplex polymerase chain reaction (PCR). We compare the workflow with Nanopore Native Barcoding library preparation and Illumina MiSeq amplicon protocols to determine the most accurate and cost-effective method for sequencing multiplex amplicons. A multiplex PCR was optimized for four housekeeping genes of M. ovipneumoniae using archived DNA samples (N = 68) from nasal swabs. Results: Sequences recovered from Nanopore Rapid Barcoding correctly identified all MLST types with the shortest total workflow time and lowest cost per sample when compared with Nanopore Native Barcoding and Illumina MiSeq methods. Discussion: Our proposed workflow is a convenient and effective method for strain typing of M. ovipneumoniae and can be applied to other bacterial MLST schemes. The workflow is suitable for diagnostic settings, where reduced hands-on time, cost, and multiplexing capabilities are important.

8.
G3 (Bethesda) ; 2024 Aug 16.
Artigo em Inglês | MEDLINE | ID: mdl-39148415

RESUMO

The recent acceleration in genome sequencing targeting previously unexplored parts of the tree of life presents computational challenges. Samples collected from the wild often contain sequences from several organisms, including the target, its cobionts, and contaminants. Effective methods are therefore needed to separate sequences. Though advances in sequencing technology make this task easier, it remains difficult to taxonomically assign sequences from eukaryotic taxa that are not well-represented in databases. Therefore, reference-based methods alone are insufficient. Here, I examine how we can take advantage of differences in sequence composition between organisms to identify symbionts, parasites and contaminants in samples, with minimal reliance on reference data. To this end, I explore data from the Darwin Tree of Life project, including hundreds of high-quality HiFi read sets from insects. Visualising two-dimensional representations of read tetranucleotide composition learned by a Variational Autoencoder can reveal distinct components of a sample. Annotating the embeddings with additional information, such as coding density, estimated coverage, or taxonomic labels allows rapid assessment of the contents of a dataset. The approach scales to millions of sequences, making it possible to explore unassembled read sets, even for large genomes. Combined with interactive visualisation tools, it allows a large fraction of cobionts reported by reference-based screening to be identified. Crucially, it also facilitates retrieving genomes for which suitable reference data are absent.

9.
Expert Rev Mol Diagn ; : 1-13, 2024 Aug 28.
Artigo em Inglês | MEDLINE | ID: mdl-39194060

RESUMO

INTRODUCTION: Sensorineural hearing impairment (SNHI), a common childhood disorder with heterogeneous genetic causes, can lead to delayed language development and psychosocial problems. Next-generation sequencing (NGS) offers high-throughput screening and high-sensitivity detection of genetic etiologies of SNHI, enabling clinicians to make informed medical decisions, provide tailored treatments, and improve prognostic outcomes. AREAS COVERED: This review covers the diverse etiologies of HHI and the utility of different NGS modalities (targeted sequencing and whole exome/genome sequencing), and includes HHI-related studies on newborn screening, genetic counseling, prognostic prediction, and personalized treatment. Challenges such as the trade-off between cost and diagnostic yield, detection of structural variants, and exploration of the non-coding genome are also highlighted. EXPERT OPINION: In the current landscape of NGS-based diagnostics for HHI, there are both challenges (e.g. detection of structural variants and non-coding genome variants) and opportunities (e.g. the emergence of medical artificial intelligence tools). The authors advocate the use of technological advances such as long-read sequencing for structural variant detection, multi-omics analysis for non-coding variant exploration, and medical artificial intelligence for pathogenicity assessment and outcome prediction. By integrating these innovations into clinical practice, precision medicine in the diagnosis and management of HHI can be further improved.

10.
Cells ; 13(15)2024 Jul 26.
Artigo em Inglês | MEDLINE | ID: mdl-39120292

RESUMO

Biallelic variants in USH2A are associated with retinitis pigmentosa (RP) and Type 2 Usher Syndrome (USH2), leading to impaired vision and, additionally, hearing loss in the latter. Although the introduction of next-generation sequencing into clinical diagnostics has led to a significant uplift in molecular diagnostic rates, many patients remain molecularly unsolved. It is thought that non-coding variants or variants of uncertain significance contribute significantly to this diagnostic gap. This study aims to demonstrate the clinical utility of the reverse transcription-polymerase chain reaction (RT-PCR)-Oxford Nanopore Technology (ONT) sequencing of USH2A mRNA transcripts from nasal epithelial cells to determine the splice-altering effect of candidate variants. Five affected individuals with USH2 or non-syndromic RP who had undergone whole genome sequencing were recruited for further investigation. All individuals had uncertain genotypes in USH2A, including deep intronic rare variants, c.8682-654C>G, c.9055+389G>A, and c.9959-2971C>T; a synonymous variant of uncertain significance, c.2139C>T; p.(Gly713=); and a predicted loss of function duplication spanning an intron/exon boundary, c.3812-3_3837dup p.(Met1280Ter). In silico assessment using SpliceAI provided splice-altering predictions for all candidate variants which were investigated using ONT sequencing. All predictions were found to be accurate; however, in the case of c.3812-3_3837dup, the outcome was a complex cryptic splicing pattern with predominant in-frame exon 18 skipping and a low level of exon 18 inclusion leading to the predicted stop gain. This study detected and functionally characterised simple and complex mis-splicing patterns in USH2A arising from previously unknown deep intronic variants and previously reported variants of uncertain significance, confirming the pathogenicity of the variants.


Assuntos
Proteínas da Matriz Extracelular , Splicing de RNA , Síndromes de Usher , Humanos , Proteínas da Matriz Extracelular/genética , Síndromes de Usher/genética , Feminino , Masculino , Splicing de RNA/genética , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Éxons/genética , Mutação/genética , Retinose Pigmentar/genética , Adulto , RNA Mensageiro/genética , RNA Mensageiro/metabolismo , Íntrons/genética , Pessoa de Meia-Idade
11.
Mar Drugs ; 22(8)2024 Jul 27.
Artigo em Inglês | MEDLINE | ID: mdl-39195458

RESUMO

Dinoflagellates are one of the largest groups of marine microalgae and exhibit diverse trophic strategies. Some dinoflagellates can produce secondary metabolites that are known to be toxic, which can lead to ecologically harmful blooms. Amphidinium carterae is one species of dinoflagellate that produces toxic compounds and is used as a model for dinoflagellate studies. The impact of the microbiome on A. carterae growth and metabolite synthesis is not yet fully understood, nor is the impact of bacterial data on sequencing and assembly. An antibiotic cocktail was previously shown to eliminate 16S amplification from the dinoflagellate culture. Even with drastically reduced bacterial numbers during antibiotic treatment, bacterial sequences were still present. In this experiment, we used novel Nanopore long-read sequencing techniques on A. carterae cultures to assemble 15 full bacterial genomes ranging from 2.9 to 6.0 Mb and found that the use of antibiotics decreased the percentage of reads mapping back to bacteria. We also identified shifts in the microbiome composition and identified a potentially deleterious bacterial species arising in the absence of the antibiotic treatment. Multiple antibiotic resistance genes were identified, as well as evidence that the bacterial population does not contribute to toxic secondary metabolite synthesis.


Assuntos
Antibacterianos , Dinoflagellida , Genoma Bacteriano , Microbiota , Dinoflagellida/genética , Antibacterianos/farmacologia , Bactérias/genética , Bactérias/metabolismo , Bactérias/efeitos dos fármacos
12.
mSphere ; : e0035624, 2024 Aug 27.
Artigo em Inglês | MEDLINE | ID: mdl-39191390

RESUMO

In this study, we employed short- and long-read sequencing technologies to delineate the transcriptional architecture of the human monkeypox virus and to identify key regulatory elements that govern its gene expression. Specifically, we conducted a transcriptomic analysis to annotate the transcription start sites (TSSs) and transcription end sites (TESs) of the virus by utilizing Cap Analysis of gene expression sequencing on the Illumina platform and direct RNA sequencing on the Oxford Nanopore technology device. Our investigations uncovered significant complexity in the use of alternative TSSs and TESs in viral genes. In this research, we also detected the promoter elements and poly(A) signals associated with the viral genes. Additionally, we identified novel genes in both the left and right variable regions of the viral genome.IMPORTANCEGenerally, gaining insight into how the transcription of a virus is regulated offers insights into the key mechanisms that control its life cycle. The recent outbreak of the human monkeypox virus has underscored the necessity of understanding the basic biology of its causative agent. Our results are pivotal for constructing a comprehensive transcriptomic atlas of the human monkeypox virus, providing valuable resources for future studies.

13.
medRxiv ; 2024 Jun 15.
Artigo em Inglês | MEDLINE | ID: mdl-39108517

RESUMO

Background: Mutations within the genes PRKN and PINK1 are the leading cause of early onset autosomal recessive Parkinson's disease (PD). However, the genetic cause of most early-onset PD (EOPD) cases still remains unresolved. Long-read sequencing has successfully identified many pathogenic structural variants that cause disease, but this technology has not been widely applied to PD. We recently identified the genetic cause of EOPD in a pair of monozygotic twins by uncovering a complex structural variant that spans over 7 Mb, utilizing Oxford Nanopore Technologies (ONT) long-read sequencing. In this study, we aimed to expand on this and assess whether a second variant could be detected with ONT long-read sequencing in other unresolved EOPD cases reported to carry one heterozygous variant in PRKN or PINK1. Methods: ONT long-read sequencing was performed on patients with one reported PRKN/PINK1 pathogenic variant. EOPD patients with an age at onset younger than 50 were included in this study. As a positive control, we also included EOPD patients who had already been identified to carry two known PRKN pathogenic variants. Initial genetic testing was performed using either short-read targeted panel sequencing for single nucleotide variants and multiplex ligation-dependent probe amplification (MLPA) for copy number variants. Results: 48 patients were included in this study (PRKN "one-variant" n = 24, PINK1 "one-variant" n = 12, PRKN "two-variants" n = 12). Using ONT long-read sequencing, we detected a second pathogenic variant in six PRKN "one-variant" patients (26%, 6/23) but none in the PINK1 "one-variant" patients (0%, 0/12). Long-read sequencing identified one case with a complex inversion, two instances of structural variant overlap, and three cases of duplication. In addition, in the positive control PRKN "two-variants" group, we were able to identify both pathogenic variants in PRKN in all the patients (100%, 12/12). Conclusions: This data highlights that ONT long-read sequencing is a powerful tool to identify a pathogenic structural variant at the PRKN locus that is often missed by conventional methods. Therefore, for cases where conventional methods fail to detect a second variant for EOPD, long-read sequencing should be considered as an alternative and complementary approach.

14.
Mitochondrial DNA B Resour ; 9(8): 1020-1023, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-39119347

RESUMO

Heptathela kimurai (Kishida, 1920) is a spider that belongs to the family Heptathelidae which is a basial lineage of spiders. The molecular information of ancestral species belonging to families like Heptathelidae is comparatively limited when compared to spider species from derived families. Here we present the complete mitochondrial genome sequence (mtDNA) of H. kimurai. The sequence was obtained using massively parallel sequencing technology. The circular genome was 14,224 bp in length, and the AT content was 69.53%. The H. kimurai mitochondrial genome contains 13 protein-coding genes (PCGs), 21 tRNA genes, and 2 rRNA genes. The majority of PCGs were found in the heavy strand.

15.
Genome Biol ; 25(1): 226, 2024 Aug 19.
Artigo em Inglês | MEDLINE | ID: mdl-39160564

RESUMO

Long-read sequencing holds great potential for characterizing complex microbial communities, yet taxonomic profiling tools designed specifically for long reads remain lacking. We introduce Melon, a novel marker-based taxonomic profiler that capitalizes on the unique attributes of long reads. Melon employs a two-stage classification scheme to reduce computational time and is equipped with an expectation-maximization-based post-correction module to handle ambiguous reads. Melon achieves superior performance compared to existing tools in both mock and simulated samples. Using wastewater metagenomic samples, we demonstrate the applicability of Melon by showing it provides reliable estimates of overall genome copies, and species-level taxonomic profiles.


Assuntos
Metagenômica , Metagenômica/métodos , Metagenoma , Marcadores Genéticos , Águas Residuárias/microbiologia , Software
16.
Clin Chim Acta ; 562: 119884, 2024 Aug 15.
Artigo em Inglês | MEDLINE | ID: mdl-39038592

RESUMO

BACKGROUND: Hemophilia A (HA) is an X-linked recessive genetic disorder caused by pathogenic variations of the factor VIII -encoding gene, F8 gene. Due to the large size and diverse types of variations in the F8 gene, causative mutations in F8 cannot be simultaneously detected in one step by traditional molecular analysis, and genetic molecular diagnosis and prenatal screening of HA still face significant difficulties and challenges in clinical practice. Therefore, we aimed to develop and validate an efficient, accurate, and time-saving method for the genetic detection of HA. METHODS: A comprehensive analysis of hemophilia A (CAHEA) method based on long-range PCR and long-read sequencing (LRS) was used to detect F8 gene mutations in 14 clinical HA samples. The LRS results were compared with those of the conventional methods to evaluate the accuracy and sensitivity of the proposed approach. RESULTS: The CAHEA method successfully identified 14 F8 variants in all probands, including 3 small insertion deletions, 4 single nucleotide variants, and 7 intron 22 inversions in a "one-step" manner, of which 2 small deletions have not been reported previously. Moreover, this method provided an opportunity to analyze the mechanism of rearrangement and the pathogenicity of F8 variants. The LRS results were validated and found to be in 100% agreement with those obtained using the conventional method. CONCLUSION: Our proposed LRS-based F8 gene detection method is an accurate and reproducible genetic screening and diagnostic method with significant clinical value. It provides efficient, comprehensive, and accurate genetic screening and diagnostic services for individuals at high risk of HA as well as for premarital and prenatal populations.


Assuntos
Fator VIII , Hemofilia A , Hemofilia A/genética , Hemofilia A/diagnóstico , Humanos , Fator VIII/genética , Variação Genética , Mutação , Reação em Cadeia da Polimerase/métodos , Análise de Sequência de DNA
17.
Cancer Lett ; 598: 217121, 2024 Aug 28.
Artigo em Inglês | MEDLINE | ID: mdl-39009069

RESUMO

Retinoblastoma (RB) is the most common intraocular malignancy in childhood. The causal variants in RB are mostly characterized by previously used short-read sequencing (SRS) analysis, which has technical limitations in identifying structural variants (SVs) and phasing information. Long-read sequencing (LRS) technology has advantages over SRS in detecting SVs, phased genetic variants, and methylation. In this study, we comprehensively characterized the genetic landscape of RB using combinatorial LRS and SRS of 16 RB tumors and 16 matched blood samples. We detected a total of 232 somatic SVs, with an average of 14.5 SVs per sample across the whole genome in our cohort. We identified 20 distinct pathogenic variants disrupting RB1 gene, including three novel small variants and five somatic SVs. We found more somatic SVs were detected from LRS than SRS (140 vs. 122) in RB samples with WGS data, particularly the insertions (18 vs. 1). Furthermore, our analysis shows that, with the exception of one sample who lacked the methylation data, all samples presented biallelic inactivation of RB1 in various forms, including two cases with the biallelic hypermethylated promoter and four cases with compound heterozygous mutations which were missing in SRS analysis. By inferring relative timing of somatic events, we reveal the genetic progression that RB1 disruption early and followed by copy number changes, including amplifications of Chr2p and deletions of Chr16q, during RB tumorigenesis. Altogether, we characterize the comprehensive genetic landscape of RB, providing novel insights into the genetic alterations and mechanisms contributing to RB initiation and development. Our work also establishes a framework to analyze genomic landscape of cancers based on LRS data.


Assuntos
Metilação de DNA , Neoplasias da Retina , Proteínas de Ligação a Retinoblastoma , Retinoblastoma , Humanos , Retinoblastoma/genética , Retinoblastoma/patologia , Proteínas de Ligação a Retinoblastoma/genética , Neoplasias da Retina/genética , Neoplasias da Retina/patologia , Masculino , Feminino , Mutação , Ubiquitina-Proteína Ligases/genética , Pré-Escolar , Criança , Lactente , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Sequenciamento Completo do Genoma/métodos
18.
BMC Genom Data ; 25(1): 70, 2024 Jul 15.
Artigo em Inglês | MEDLINE | ID: mdl-39009995

RESUMO

OBJECTIVES: Ants are ecologically dominant insects in most terrestrial ecosystems, with more than 14,000 extant species in about 340 genera recorded to date. However, genomic resources are still scarce for most species, especially for species endemic in East or Southeast Asia, limiting the study of phylogeny, speciation and adaptation of this evolutionarily successful animal lineage. Here, we assemble and annotate the genomes of Odontoponera transversa and Camponotus friedae, two ant species with a natural distribution in China, to facilitate future study of ant evolution. DATA DESCRIPTION: We obtained a total of 16 Gb and 51 Gb PacBio HiFi data for O. transversa and C. friedae, respectively, which were assembled into the draft genomes of 339 Mb for O. transversa and 233 Mb for C. friedae. Genome assessments by multiple metrics showed good completeness and high accuracy of the two assemblies. Gene annotations assisted by RNA-seq data yielded a comparable number of protein-coding genes in the two genomes (10,892 for O. transversa and 11,296 for C. friedae), while repeat annotations revealed a remarkable difference of repeat content between these two ant species (149.4 Mb for O. transversa versus 49.7 Mb for C. friedae). Besides, complete mitochondrial genomes for the two species were assembled and annotated.


Assuntos
Formigas , Genoma de Inseto , Animais , Formigas/genética , Formigas/classificação , Genoma de Inseto/genética , Anotação de Sequência Molecular , Filogenia , Genômica/métodos
19.
Mol Genet Genomics ; 299(1): 65, 2024 Jul 07.
Artigo em Inglês | MEDLINE | ID: mdl-38972030

RESUMO

BACKGROUND: A large number of challenging medically relevant genes (CMRGs) are situated in complex or highly repetitive regions of the human genome, hindering comprehensive characterization of genetic variants using next-generation sequencing technologies. In this study, we employed long-read sequencing technology, extensively utilized in studying complex genomic regions, to characterize genetic alterations, including short variants (single nucleotide variants and short insertions and deletions) and copy number variations, in 370 CMRGs across 41 individuals from 19 global populations. RESULTS: Our analysis revealed high levels of genetic variants in CMRGs, with 68.73% exhibiting copy number variations and 65.20% containing short variants that may disrupt protein function across individuals. Such variants can influence pharmacogenomics, genetic disease susceptibility, and other clinical outcomes. We observed significant differences in CMRG variation across populations, with individuals of African ancestry harboring the highest number of copy number variants and short variants compared to samples from other continents. Notably, 15.79% to 33.96% of short variants were exclusively detectable through long-read sequencing. While the T2T-CHM13 reference genome significantly improved the assembly of CMRG regions, thereby facilitating variant detection in these regions, some regions still lacked resolution. CONCLUSION: Our results provide an important reference for future clinical and pharmacogenetic studies, highlighting the need for a comprehensive representation of global genetic diversity in the reference genome and improved variant calling techniques to fully resolve medically relevant genes.


Assuntos
Variações do Número de Cópias de DNA , Genoma Humano , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Variações do Número de Cópias de DNA/genética , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Genoma Humano/genética , Polimorfismo de Nucleotídeo Único/genética , Variação Genética/genética , Predisposição Genética para Doença , Genética Populacional/métodos , Mutação INDEL
20.
BMC Genomics ; 25(1): 679, 2024 Jul 08.
Artigo em Inglês | MEDLINE | ID: mdl-38978005

RESUMO

BACKGROUND: Oxford Nanopore provides high throughput sequencing platforms able to reconstruct complete bacterial genomes with 99.95% accuracy. However, even small levels of error can obscure the phylogenetic relationships between closely related isolates. Polishing tools have been developed to correct these errors, but it is uncertain if they obtain the accuracy needed for the high-resolution source tracking of foodborne illness outbreaks. RESULTS: We tested 132 combinations of assembly and short- and long-read polishing tools to assess their accuracy for reconstructing the genome sequences of 15 highly similar Salmonella enterica serovar Newport isolates from a 2020 onion outbreak. While long-read polishing alone improved accuracy, near perfect accuracy (99.9999% accuracy or ~ 5 nucleotide errors across the 4.8 Mbp genome, excluding low confidence regions) was only obtained by pipelines that combined both long- and short-read polishing tools. Notably, medaka was a more accurate and efficient long-read polisher than Racon. Among short-read polishers, NextPolish showed the highest accuracy, but Pilon, Polypolish, and POLCA performed similarly. Among the 5 best performing pipelines, polishing with medaka followed by NextPolish was the most common combination. Importantly, the order of polishing tools mattered i.e., using less accurate tools after more accurate ones introduced errors. Indels in homopolymers and repetitive regions, where the short reads could not be uniquely mapped, remained the most challenging errors to correct. CONCLUSIONS: Short reads are still needed to correct errors in nanopore sequenced assemblies to obtain the accuracy required for source tracking investigations. Our granular assessment of the performance of the polishing pipelines allowed us to suggest best practices for tool users and areas for improvement for tool developers.


Assuntos
Benchmarking , Surtos de Doenças , Genoma Bacteriano , Nanoporos , Sequenciamento por Nanoporos/métodos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Salmonella enterica/genética , Salmonella enterica/isolamento & purificação , Humanos , Filogenia
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA