Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 23.367
Filtrar
1.
BMC Genomics ; 25(1): 558, 2024 Jun 05.
Artigo em Inglês | MEDLINE | ID: mdl-38834950

RESUMO

BACKGROUND: Indigenous Chinese cattle have abundant genetic diversity and a long history of artificial selection, giving local breeds advantages in adaptability, forage tolerance and resistance. The detection of selective sweeps and comparative genome analysis of selected breeds and ancestral populations provide a basis for understanding differences among breeds and for the identification and utilization of candidate genes. We investigated genetic diversity, population structure, and signatures of selection using genome-wide sequencing data for a new breed of Qinchuan cattle (QNC, n = 21), ancestral Qinchuan cattle (QCC, n = 20), and Zaosheng cattle (ZSC, n = 19). RESULTS: A population structure analysis showed that the ancestry components of QNC and ZSC were similar. In addition, the QNC and ZSC groups showed higher proportions of European taurine ancestry than that of QCC, and this may explain the larger body size of QNC, approaching that of European cattle under long-term domestication and selection. A neighbor-joining tree revealed that QCC individuals were closely related, whereas QNC formed a distinct group. To search for signatures of selection in the QNC genome, we evaluated nucleotide diversity (θπ), the fixation index (FST) and Tajima's D. Overlapping selective sweeps were enriched for one KEGG pathway, the apelin signaling pathway, and included five candidate genes (MEF2A, SMAD2, CAMK4, RPS6, and PIK3CG). We performed a comprehensive review of genomic variants in QNC, QCC, and ZSC using whole-genome sequencing data. QCC was rich in novel genetic diversity, while diversity in QNC and ZSC cattle was reduced due to strong artificial selection, with divergence from the original cattle. CONCLUSIONS: We identified candidate genes associated with production traits. These results support the success of selective breeding and can guide further breeding and resource conservation of Qinchuan cattle.


Assuntos
Variação Genética , Seleção Genética , Animais , Bovinos/genética , Genômica/métodos , Polimorfismo de Nucleotídeo Único , Genética Populacional , Estudo de Associação Genômica Ampla , Genoma , Cruzamento
2.
Cancer Epidemiol Biomarkers Prev ; 33(6): 766-768, 2024 Jun 03.
Artigo em Inglês | MEDLINE | ID: mdl-38826080

RESUMO

Mitochondrial DNA (mtDNA) has emerged as a pivotal component in understanding the etiology and susceptibility of cancer. A recent study by Chen and colleagues delineated the germline genetic effect of mtDNA single-nucleotide polymorphisms (SNP) and haplogroups across pan-cancer risk. They identified a subset of mtSNPs and the corresponding risk score, as well as haplogroups A and M7 alongside their genetic interactions, conferring a protective effect against various cancers. These findings underscored the value of mtDNA variations as biomarkers for cancer etiology and as tools for cancer risk stratification. Future investigations are encouraged to integrate comprehensive omics data of genomics, transcriptomics, proteomics, and metabolomics, etc., from nuclear DNA with mtDNA variations, alongside single-cell and spatial technologies, to unravel the tumor mechanism and identify the drug targets. Moreover, the incorporation of polygenic risk score, that included mtDNA variations with both rare and common frequencies, and liquid biopsy-based biomarkers would enhance the predictive performance of cancer risk assessment and refine the risk stratification of population-based cancer screening. This commentary advocates for the validation across diverse populations to harness the full potential of mitochondrial genomics, and ultimately paves the prospective way for advancements in personalized cancer therapeutics and prevention strategies. See related article by Chen and colleagues, Cancer Epidemiol Biomarkers Prev 2024;33:381-8.


Assuntos
DNA Mitocondrial , Genômica , Neoplasias , Humanos , DNA Mitocondrial/genética , Neoplasias/genética , Genômica/métodos , Polimorfismo de Nucleotídeo Único , Biomarcadores Tumorais/genética , Predisposição Genética para Doença , Estudos Prospectivos
3.
Curr Protoc ; 4(6): e1055, 2024 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-38837690

RESUMO

Data harmonization involves combining data from multiple independent sources and processing the data to produce one uniform dataset. Merging separate genotypes or whole-genome sequencing datasets has been proposed as a strategy to increase the statistical power of association tests by increasing the effective sample size. However, data harmonization is not a widely adopted strategy due to the difficulties with merging data (including confounding produced by batch effects and population stratification). Detailed data harmonization protocols are scarce and are often conflicting. Moreover, data harmonization protocols that accommodate samples of admixed ancestry are practically non-existent. Existing data harmonization procedures must be modified to ensure the heterogeneous ancestry of admixed individuals is incorporated into additional downstream analyses without confounding results. Here, we propose a set of guidelines for merging multi-platform genetic data from admixed samples that can be adopted by any investigator with elementary bioinformatics experience. We have applied these guidelines to aggregate 1544 tuberculosis (TB) case-control samples from six separate in-house datasets and conducted a genome-wide association study (GWAS) of TB susceptibility. The GWAS performed on the merged dataset had improved power over analyzing the datasets individually and produced summary statistics free from bias introduced by batch effects and population stratification. © 2024 Wiley Periodicals LLC. Basic Protocol 1: Processing separate datasets comprising array genotype data Alternate Protocol 1: Processing separate datasets comprising array genotype and whole-genome sequencing data Alternate Protocol 2: Performing imputation using a local reference panel Basic Protocol 2: Merging separate datasets Basic Protocol 3: Ancestry inference using ADMIXTURE and RFMix Basic Protocol 4: Batch effect correction using pseudo-case-control comparisons.


Assuntos
Estudo de Associação Genômica Ampla , Humanos , Estudo de Associação Genômica Ampla/métodos , Estudo de Associação Genômica Ampla/normas , Genômica/métodos , Genômica/normas , Tuberculose/genética , Estudos de Casos e Controles , Guias como Assunto , Predisposição Genética para Doença
4.
BMC Cancer ; 24(1): 673, 2024 Jun 03.
Artigo em Inglês | MEDLINE | ID: mdl-38825709

RESUMO

Hepatocellular carcinoma (HCC) genomic research has discovered actionable genetic changes that might guide treatment decisions and clinical trials. Nonetheless, due to a lack of large-scale multicenter clinical validation, these putative targets have not been converted into patient survival advantages. So, it's crucial to ascertain whether genetic analysis is clinically feasible, useful, and whether it can be advantageous for patients. We sequenced tumour tissue and blood samples (as normal controls) from 111 Chinese HCC patients at Qingdao University Hospital using the 508-gene panel and the 688-gene panel, respectively. Approximately 95% of patients had gene variations related to targeted treatment, with 50% having clinically actionable mutations that offered significant information for targeted therapy. Immune cell infiltration was enhanced in individuals with TP53 mutations but decreased in patients with CTNNB1 and KMT2D mutations. More notably, we discovered that SPEN, EPPK1, and BRCA2 mutations were related to decreased median overall survival, although MUC16 mutations were not. Furthermore, we found mutant MUC16 as an independent protective factor for the prognosis of HCC patients after curative hepatectomy. In conclusion, this study connects genetic abnormalities to clinical practice and potentially identifies individuals with poor prognoses who may benefit from targeted treatment or immunotherapy.


Assuntos
Carcinoma Hepatocelular , Neoplasias Hepáticas , Mutação , Humanos , Neoplasias Hepáticas/genética , Neoplasias Hepáticas/terapia , Neoplasias Hepáticas/mortalidade , Neoplasias Hepáticas/patologia , Carcinoma Hepatocelular/genética , Carcinoma Hepatocelular/terapia , Carcinoma Hepatocelular/mortalidade , Carcinoma Hepatocelular/patologia , Masculino , Feminino , Prognóstico , Pessoa de Meia-Idade , Idoso , Adulto , Biomarcadores Tumorais/genética , Genômica/métodos , Proteína BRCA2/genética , Terapia de Alvo Molecular , Hepatectomia , Perfilação da Expressão Gênica , Proteína Supressora de Tumor p53/genética , Proteínas de Ligação a DNA , Proteínas de Neoplasias , beta Catenina
5.
Microb Genom ; 10(6)2024 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-38833287

RESUMO

It is now possible to assemble near-perfect bacterial genomes using Oxford Nanopore Technologies (ONT) long reads, but short-read polishing is usually required for perfection. However, the effect of short-read depth on polishing performance is not well understood. Here, we introduce Pypolca (with default and careful parameters) and Polypolish v0.6.0 (with a new careful parameter). We then show that: (1) all polishers other than Pypolca-careful, Polypolish-default and Polypolish-careful commonly introduce false-positive errors at low read depth; (2) most of the benefit of short-read polishing occurs by 25× depth; (3) Polypolish-careful almost never introduces false-positive errors at any depth; and (4) Pypolca-careful is the single most effective polisher. Overall, we recommend the following polishing strategies: Polypolish-careful alone when depth is very low (<5×), Polypolish-careful and Pypolca-careful when depth is low (5-25×), and Polypolish-default and Pypolca-careful when depth is sufficient (>25×).


Assuntos
Genoma Bacteriano , Nanoporos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Análise de Sequência de DNA/métodos , Sequenciamento por Nanoporos/métodos , Bactérias/genética , Bactérias/classificação , Software , Genômica/métodos
6.
BMC Genomics ; 25(1): 559, 2024 Jun 05.
Artigo em Inglês | MEDLINE | ID: mdl-38840048

RESUMO

BACKGROUND: The crossbreeding of specialized beef cattle breeds with Chinese indigenous cattle is a common method of genetic improvement. Xia'nan cattle, a crossbreed of Charolais and Nanyang cattle, is China's first specialized beef cattle breed with independent intellectual property rights. After more than two decades of selective breeding, Xia'nan cattle exhibit a robust physique, good environmental adaptability, good tolerance to coarse feed, and high meat production rates. This study analyzed the population genetic structure, genetic diversity, and genomic variations of Xia'nan cattle using whole-genome sequencing data from 30 Xia'nan cattle and 178 published cattle genomic data. RESULT: The ancestry estimating composition analysis showed that the ancestry proportions for Xia'nan cattle were mainly Charolais with a small amount of Nanyang cattle. Through the genetic diversity studies (nucleotide diversity and linkage disequilibrium decay), we found that the genomic diversity of Xia'nan cattle is higher than that of specialized beef cattle breeds in Europe but lower than that of Chinese native cattle. Then, we used four methods to detect genome candidate regions influencing the excellent traits of Xia'nan cattle. Among the detected results, 42 genes (θπ and CLR) and 131 genes (FST and XP-EHH) were detected by two different detection strategies. In addition, we found a region in BTA8 with strong selection signals. Finally, we conducted functional annotation on the detected genes and found that these genes may influence body development (NR6A1), meat quality traits (MCCC1), growth traits (WSCD1, TMEM68, MFN1, NCKAP5), and immunity (IL11RA, CNTFR, CCL27, SLAMF1, SLAMF7, NAA35, and GOLM1). CONCLUSION: We elucidated the genomic features and population structure of Xia'nan cattle and detected some selection signals in genomic regions potentially associated with crucial economic traits in Xia'nan cattle. This research provided a basis for further breeding improvements in Xia'nan cattle and served as a reference for genetic enhancements in other crossbreed cattle.


Assuntos
Variação Genética , Seleção Genética , Sequenciamento Completo do Genoma , Bovinos/genética , Animais , Sequenciamento Completo do Genoma/métodos , Desequilíbrio de Ligação , Genômica/métodos , Polimorfismo de Nucleotídeo Único , Genoma , Genética Populacional , Cruzamento , Locos de Características Quantitativas , Fenótipo
7.
Planta ; 260(1): 18, 2024 Jun 05.
Artigo em Inglês | MEDLINE | ID: mdl-38837044

RESUMO

MAIN CONCLUSION: We have developed and optimized a rapid, versatile Agrobacterium-mediated transient expression system for cannabis seedlings that can be used in functional genomics studies of both hemp-type and drug-type cannabis. Cannabis (Cannabis sativa L.) holds great promise in the medical and food industries due to its diverse chemical composition, including specialized cannabinoids. However, the study of key genes involved in various biological processes, including secondary metabolite biosynthesis, has been hampered by the lack of efficient in vivo functional analysis methods. Here, we present a novel, short-cycle, high-efficiency transformation method for cannabis seedlings using Agrobacterium tumefaciens. We used the RUBY reporter system to monitor transformation results without the need for chemical treatments or specialized equipment. Four strains of A. tumefaciens (GV3101, EHA105, LBA4404, and AGL1) were evaluated for transformation efficiency, with LBA4404 and AGL1 showing superior performance. The versatility of the system was further demonstrated by successful transformation with GFP and GUS reporter genes. In addition, syringe infiltration was explored as an alternative to vacuum infiltration, offering simplicity and efficiency for high-throughput applications. Our method allows rapid and efficient in vivo transformation of cannabis seedlings, facilitating large-scale protein expression and high-throughput characterization studies.


Assuntos
Agrobacterium tumefaciens , Cannabis , Genômica , Plântula , Transformação Genética , Agrobacterium tumefaciens/genética , Plântula/genética , Genômica/métodos , Cannabis/genética , Cannabis/metabolismo , Plantas Geneticamente Modificadas , Genes Reporter , Proteínas de Fluorescência Verde/genética , Proteínas de Fluorescência Verde/metabolismo
8.
Commun Biol ; 7(1): 675, 2024 Jun 01.
Artigo em Inglês | MEDLINE | ID: mdl-38824179

RESUMO

The three-dimensional (3D) organization of genome is fundamental to cell biology. To explore 3D genome, emerging high-throughput approaches have produced billions of sequencing reads, which is challenging and time-consuming to analyze. Here we present Microcket, a package for mapping and extracting interacting pairs from 3D genomics data, including Hi-C, Micro-C, and derivant protocols. Microcket utilizes a unique read-stitch strategy that takes advantage of the long read cycles in modern DNA sequencers; benchmark evaluations reveal that Microcket runs much faster than the current tools along with improved mapping efficiency, and thus shows high potential in accelerating and enhancing the biological investigations into 3D genome. Microcket is freely available at https://github.com/hellosunking/Microcket .


Assuntos
Genômica , Software , Genômica/métodos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Humanos , Análise de Sequência de DNA/métodos , Análise de Dados
9.
PLoS One ; 19(6): e0303628, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38843230

RESUMO

Genes strictly regulate the development of teeth and their surrounding oral structures. Alteration of gene regulation leads to tooth disorders and developmental anomalies in tooth, oral, and facial regions. With the advancement of gene sequencing technology, genomic data is rapidly increasing. However, the large sets of genomic and proteomic data related to tooth development and dental disorders are currently dispersed in many primary databases and literature, making it difficult for users to navigate, extract, study, or analyze. We have curated the scattered genetic data on tooth development and created a knowledgebase called 'Bioinformatics for Dentistry' (https://dentalbioinformatics.com/). This database compiles genomic and proteomic data on human tooth development and developmental anomalies and organizes them according to their roles in different stages of tooth development. The database is built by systemically curating relevant data from the National Library of Medicine (NCBI) GenBank, OMIM: Online Mendelian Inheritance in Man, AlphaFold Protein Structure Database, Reactome pathway knowledgebase, Wiki Pathways, and PubMed. The accuracy of the included data was verified from supporting primary literature. Upon data curation and validation, a simple, easy-to-navigate browser interface was created on WordPress version 6.3.2, with PHP version 8.0. The website is hosted in a cloud hosting service to provide fast and reliable data transfer rate. Plugins are used to ensure the browser's compatibility across different devices. Bioinformatics for Dentistry contains four embedded filters for complex and specific searches and free-text search options for quick and simple searching through the datasets. Bioinformatics for Dentistry is made freely available worldwide, with the hope that this knowledgebase will improve our understanding of the complex genetic regulation of tooth development and will open doors to research initiatives and discoveries. This database will be expanded in the future by incorporating resources and built-in sequence analysis tools, and it will be maintained and updated annually.


Assuntos
Biologia Computacional , Bases de Dados Genéticas , Dente , Humanos , Biologia Computacional/métodos , Dente/crescimento & desenvolvimento , Odontogênese/genética , Odontologia , Proteômica/métodos , Genômica/métodos
10.
Circ Res ; 134(12): 1681-1702, 2024 Jun 07.
Artigo em Inglês | MEDLINE | ID: mdl-38843288

RESUMO

Throughout our lifetime, each beat of the heart requires the coordinated action of multiple cardiac cell types. Understanding cardiac cell biology, its intricate microenvironments, and the mechanisms that govern their function in health and disease are crucial to designing novel therapeutical and behavioral interventions. Recent advances in single-cell and spatial omics technologies have significantly propelled this understanding, offering novel insights into the cellular diversity and function and the complex interactions of cardiac tissue. This review provides a comprehensive overview of the cellular landscape of the heart, bridging the gap between suspension-based and emerging in situ approaches, focusing on the experimental and computational challenges, comparative analyses of mouse and human cardiac systems, and the rising contextualization of cardiac cells within their niches. As we explore the heart at this unprecedented resolution, integrating insights from both mouse and human studies will pave the way for novel diagnostic tools and therapeutic interventions, ultimately improving outcomes for patients with cardiovascular diseases.


Assuntos
Análise de Célula Única , Humanos , Animais , Análise de Célula Única/métodos , Miocárdio/metabolismo , Miocárdio/patologia , Miócitos Cardíacos/metabolismo , Genômica/métodos , Camundongos
11.
Sci Rep ; 14(1): 13058, 2024 06 06.
Artigo em Inglês | MEDLINE | ID: mdl-38844774

RESUMO

The incidence of vulvar carcinoma varies by race; however, it is a rare disease, and its genomic profiles remain largely unknown. This study examined the characteristics of vulvar squamous cell carcinoma (VSCC) in Japanese patients, focusing on genomic profiles and potential racial disparities. The study included two Japanese groups: the National Cancer Center Hospital (NCCH) group comprised 19 patients diagnosed between 2015 and 2023, and the Center for Cancer Genomics and Advanced Therapeutics group comprised 29 patients diagnosed between 2019 and 2022. Somatic mutations were identified by targeted or panel sequencing, and TP53 was identified as the most common mutation (52-81%), followed by HRAS (7-26%), CDKN2A (21-24%), and PIK3CA (5-10%). The mutation frequencies, except for TP53, were similar to those of Caucasian cohorts. In the NCCH group, 16 patients of HPV-independent tumors were identified by immunohistochemistry and genotyping. Univariate analysis revealed that TP53-mutated patients were associated with a poor prognosis (log-rank test, P = 0.089). Japanese VSCC mutations resembled those of Caucasian vulvar carcinomas, and TP53 mutations predicted prognosis regardless of ethnicity. The present findings suggest potential molecular-targeted therapies for select VSCC patients.


Assuntos
Carcinoma de Células Escamosas , Mutação , Proteína Supressora de Tumor p53 , Neoplasias Vulvares , Humanos , Feminino , Neoplasias Vulvares/genética , Neoplasias Vulvares/patologia , Carcinoma de Células Escamosas/genética , Carcinoma de Células Escamosas/patologia , Pessoa de Meia-Idade , Idoso , Proteína Supressora de Tumor p53/genética , Japão/epidemiologia , Idoso de 80 Anos ou mais , Inibidor p16 de Quinase Dependente de Ciclina/genética , Classe I de Fosfatidilinositol 3-Quinases/genética , Prognóstico , Adulto , Povo Asiático/genética , Genômica/métodos , Proteínas Proto-Oncogênicas p21(ras)/genética , População do Leste Asiático
12.
BMC Genom Data ; 25(1): 53, 2024 Jun 06.
Artigo em Inglês | MEDLINE | ID: mdl-38844844

RESUMO

OBJECTIVES: The new data provide an important genomic resource for the Critically Endangered Cuban crocodile (Crocodylus rhombifer). Cuban crocodiles are restricted to the Zapata Swamp in southern Matanzas Province, Cuba, and readily hybridize with the widespread American crocodile (Crocodylus acutus) in areas of sympatry. The reported de novo assembly will contribute to studies of crocodylian evolutionary history and provide a resource for informing Cuban crocodile conservation. DATA DESCRIPTION: The final 2.2 Gb draft genome for C. rhombifer consists of 41,387 scaffolds (contigs: N50 = 104.67 Kb; scaffold: N50-518.55 Kb). Benchmarking Universal Single-Copy Orthologs (BUSCO) identified 92.3% of the 3,354 genes in the vertebrata_odb10 database. Approximately 42% of the genome (960Mbp) comprises repeat elements. We predicted 30,138 unique protein-coding sequences (17,737 unique genes) in the genome assembly. Functional annotation found the top Gene Ontology annotations for Biological Processes, Molecular Function, and Cellular Component were regulation, protein, and intracellular, respectively. This assembly will support future macroevolutionary, conservation, and molecular studies of the Cuban crocodile.


Assuntos
Jacarés e Crocodilos , Genoma , Anotação de Sequência Molecular , Jacarés e Crocodilos/genética , Animais , Genoma/genética , Cuba , Genômica/métodos
13.
Genome Med ; 16(1): 76, 2024 Jun 04.
Artigo em Inglês | MEDLINE | ID: mdl-38835075

RESUMO

BACKGROUND: Accurate prediction of an individual's predisposition to diseases is vital for preventive medicine and early intervention. Various statistical and machine learning models have been developed for disease prediction using clinico-genomic data. However, the accuracy of clinico-genomic prediction of diseases may vary significantly across ancestry groups due to their unequal representation in clinical genomic datasets. METHODS: We introduced a deep transfer learning approach to improve the performance of clinico-genomic prediction models for data-disadvantaged ancestry groups. We conducted machine learning experiments on multi-ancestral genomic datasets of lung cancer, prostate cancer, and Alzheimer's disease, as well as on synthetic datasets with built-in data inequality and distribution shifts across ancestry groups. RESULTS: Deep transfer learning significantly improved disease prediction accuracy for data-disadvantaged populations in our multi-ancestral machine learning experiments. In contrast, transfer learning based on linear frameworks did not achieve comparable improvements for these data-disadvantaged populations. CONCLUSIONS: This study shows that deep transfer learning can enhance fairness in multi-ancestral machine learning by improving prediction accuracy for data-disadvantaged populations without compromising prediction accuracy for other populations, thus providing a Pareto improvement towards equitable clinico-genomic prediction of diseases.


Assuntos
Genômica , Aprendizado de Máquina , Humanos , Genômica/métodos , Predisposição Genética para Doença , Doença de Alzheimer/genética , Masculino , Neoplasias da Próstata/genética , Neoplasias Pulmonares/genética
14.
Front Immunol ; 15: 1401320, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38835769

RESUMO

Host-microbe interactions are complex and ever-changing, especially during infections, which can significantly impact human physiology in both health and disease by influencing metabolic and immune functions. Infections caused by pathogens such as bacteria, viruses, fungi, and parasites are the leading cause of global mortality. Microbes have evolved various immune evasion strategies to survive within their hosts, which presents a multifaceted challenge for detection. Intracellular microbes, in particular, target specific cell types for survival and replication and are influenced by factors such as functional roles, nutrient availability, immune evasion, and replication opportunities. Identifying intracellular microbes can be difficult because of the limitations of traditional culture-based methods. However, advancements in integrated host microbiome single-cell genomics and transcriptomics provide a promising basis for personalized treatment strategies. Understanding host-microbiota interactions at the cellular level may elucidate disease mechanisms and microbial pathogenesis, leading to targeted therapies. This article focuses on how intracellular microbes reside in specific cell types, modulating functions through persistence strategies to evade host immunity and prolong colonization. An improved understanding of the persistent intracellular microbe-induced differential disease outcomes can enhance diagnostics, therapeutics, and preventive measures.


Assuntos
Genômica , Análise de Célula Única , Humanos , Genômica/métodos , Animais , Interações Hospedeiro-Patógeno/imunologia , Interações Hospedeiro-Patógeno/genética , Interações entre Hospedeiro e Microrganismos/imunologia , Interações entre Hospedeiro e Microrganismos/genética , Evasão da Resposta Imune , Microbiota/imunologia , Bactérias/genética , Bactérias/imunologia , Índice de Gravidade de Doença
15.
Brief Bioinform ; 25(4)2024 May 23.
Artigo em Inglês | MEDLINE | ID: mdl-38836702

RESUMO

Non-invasive prenatal testing (NIPT) is a quite popular approach for detecting fetal genomic aneuploidies. However, due to the limitations on sequencing read length and coverage, NIPT suffers a bottleneck on further improving performance and conducting earlier detection. The errors mainly come from reference biases and population polymorphism. To break this bottleneck, we proposed NIPT-PG, which enables the NIPT algorithm to learn from population data. A pan-genome model is introduced to incorporate variant and polymorphic loci information from tested population. Subsequently, we proposed a sequence-to-graph alignment method, which considers the read mis-match rates during the mapping process, and an indexing method using hash indexing and adjacency lists to accelerate the read alignment process. Finally, by integrating multi-source aligned read and polymorphic sites across the pan-genome, NIPT-PG obtains a more accurate z-score, thereby improving the accuracy of chromosomal aneuploidy detection. We tested NIPT-PG on two simulated datasets and 745 real-world cell-free DNA sequencing data sets from pregnant women. Results demonstrate that NIPT-PG outperforms the standard z-score test. Furthermore, combining experimental and theoretical analyses, we demonstrate the probably approximately correct learnability of NIPT-PG. In summary, NIPT-PG provides a new perspective for fetal chromosomal aneuploidies detection. NIPT-PG may have broad applications in clinical testing, and its detection results can serve as a reference for false positive samples approaching the critical threshold.


Assuntos
Aneuploidia , Teste Pré-Natal não Invasivo , Humanos , Feminino , Gravidez , Teste Pré-Natal não Invasivo/métodos , Algoritmos , Genômica/métodos , Diagnóstico Pré-Natal/métodos , Análise de Sequência de DNA/métodos
16.
J Cell Mol Med ; 28(11): e18408, 2024 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-38837585

RESUMO

We employed single-cell analysis techniques, specifically the inferCNV method, to dissect the complex progression of lung adenocarcinoma (LUAD) from adenocarcinoma in situ (AIS) through minimally invasive adenocarcinoma (MIA) to invasive adenocarcinoma (IAC). This approach enabled the identification of Cluster 6, which was significantly associated with LUAD progression. Our comprehensive analysis included intercellular interaction, transcription factor regulatory networks, trajectory analysis, and gene set variation analysis (GSVA), leading to the development of the lung progression associated signature (LPAS). Interestingly, we discovered that the LPAS not only accurately predicts the prognosis of LUAD patients but also forecasts genomic alterations, distinguishes between 'cold' and 'hot' tumours, and identifies potential candidates suitable for immunotherapy. PSMB1, identified within Cluster 6, was experimentally shown to significantly enhance cancer cell invasion and migration, highlighting the clinical relevance of LPAS in predicting LUAD progression and providing a potential target for therapeutic intervention. Our findings suggest that LPAS offers a novel biomarker for LUAD patient stratification, with significant implications for improving prognostic accuracy and guiding treatment decisions.


Assuntos
Adenocarcinoma de Pulmão , Progressão da Doença , Regulação Neoplásica da Expressão Gênica , Genômica , Neoplasias Pulmonares , Análise de Célula Única , Humanos , Adenocarcinoma de Pulmão/genética , Adenocarcinoma de Pulmão/patologia , Prognóstico , Análise de Célula Única/métodos , Neoplasias Pulmonares/genética , Neoplasias Pulmonares/patologia , Genômica/métodos , Biomarcadores Tumorais/genética , Biomarcadores Tumorais/metabolismo , Redes Reguladoras de Genes , Linhagem Celular Tumoral , Perfilação da Expressão Gênica , Invasividade Neoplásica
17.
Gigascience ; 132024 Jan 02.
Artigo em Inglês | MEDLINE | ID: mdl-38837943

RESUMO

Genomic information is increasingly used to inform medical treatments and manage future disease risks. However, any personal and societal gains must be carefully balanced against the risk to individuals contributing their genomic data. Expanding our understanding of actionable genomic insights requires researchers to access large global datasets to capture the complexity of genomic contribution to diseases. Similarly, clinicians need efficient access to a patient's genome as well as population-representative historical records for evidence-based decisions. Both researchers and clinicians hence rely on participants to consent to the use of their genomic data, which in turn requires trust in the professional and ethical handling of this information. Here, we review existing and emerging solutions for secure and effective genomic information management, including storage, encryption, consent, and authorization that are needed to build participant trust. We discuss recent innovations in cloud computing, quantum-computing-proof encryption, and self-sovereign identity. These innovations can augment key developments from within the genomics community, notably GA4GH Passports and the Crypt4GH file container standard. We also explore how decentralized storage as well as the digital consenting process can offer culturally acceptable processes to encourage data contributions from ethnic minorities. We conclude that the individual and their right for self-determination needs to be put at the center of any genomics framework, because only on an individual level can the received benefits be accurately balanced against the risk of exposing private information.


Assuntos
Genômica , Humanos , Genômica/métodos , Genômica/ética , Segurança Computacional , Computação em Nuvem , Consentimento Livre e Esclarecido
18.
Gigascience ; 132024 Jan 02.
Artigo em Inglês | MEDLINE | ID: mdl-38837945

RESUMO

BACKGROUND: Traditional Chinese medicine has used Peucedanum praeruptorum Dunn (Apiaceae) for a long time. Various coumarins, including the significant constituents praeruptorin (A-E), are the active constituents in the dried roots of P. praeruptorum. Previous transcriptomic and metabolomic studies have attempted to elucidate the distribution and biosynthetic network of these medicinal-valuable compounds. However, the lack of a high-quality reference genome impedes an in-depth understanding of genetic traits and thus the development of better breeding strategies. RESULTS: A telomere-to-telomere (T2T) genome was assembled for P. praeruptorum by combining PacBio HiFi, ONT ultra-long, and Hi-C data. The final genome assembly was approximately 1.798 Gb, assigned to 11 chromosomes with genome completeness >98%. Comparative genomic analysis suggested that P. praeruptorum experienced 2 whole-genome duplication events. By the transcriptomic and metabolomic analysis of the coumarin metabolic pathway, we presented coumarins' spatial and temporal distribution and the expression patterns of critical genes for its biosynthesis. Notably, the COSY and cytochrome P450 genes showed tandem duplications on several chromosomes, which may be responsible for the high accumulation of coumarins. CONCLUSIONS: A T2T genome for P. praeruptorum was obtained, providing molecular insights into the chromosomal distribution of the coumarin biosynthetic genes. This high-quality genome is an essential resource for designing engineering strategies for improving the production of these valuable compounds.


Assuntos
Apiaceae , Cumarínicos , Genoma de Planta , Telômero , Cumarínicos/metabolismo , Apiaceae/genética , Apiaceae/metabolismo , Telômero/genética , Telômero/metabolismo , Evolução Molecular , Filogenia , Genômica/métodos , Vias Biossintéticas/genética
19.
Gigascience ; 132024 Jan 02.
Artigo em Inglês | MEDLINE | ID: mdl-38837946

RESUMO

BACKGROUND: Theobroma grandiflorum (Malvaceae), known as cupuassu, is a tree indigenous to the Amazon basin, valued for its large fruits and seed pulp, contributing notably to the Amazonian bioeconomy. The seed pulp is utilized in desserts and beverages, and its seed butter is used in cosmetics. Here, we present the sequenced telomere-to-telomere genome of cupuassu, disclosing its genomic structure, evolutionary features, and phylogenetic relationships within the Malvaceae family. FINDINGS: The cupuassu genome spans 423 Mb, encodes 31,381 genes distributed in 10 chromosomes, and exhibits approximately 65% gene synteny with the Theobroma cacao genome, reflecting a conserved evolutionary history, albeit punctuated with unique genomic variations. The main changes are pronounced by bursts of long-terminal repeat retrotransposons at postspecies divergence, retrocopied and singleton genes, and gene families displaying distinctive patterns of expansion and contraction. Furthermore, positively selected genes are evident, particularly among retained and dispersed tandem and proximal duplicated genes associated with general fruit and seed traits and defense mechanisms, supporting the hypothesis of potential episodes of subfunctionalization and neofunctionalization following duplication, as well as impact from distinct domestication process. These genomic variations may underpin the differences observed in fruit and seed morphology, ripening, and disease resistance between cupuassu and the other Malvaceae species. CONCLUSIONS: The cupuassu genome offers a foundational resource for both breeding improvement and conservation biology, yielding insights into the evolution and diversity within the genus Theobroma.


Assuntos
Evolução Molecular , Genoma de Planta , Filogenia , Cromossomos de Plantas , Genômica/métodos , Malvaceae/genética
20.
BMC Genomics ; 25(1): 549, 2024 Jun 01.
Artigo em Inglês | MEDLINE | ID: mdl-38824509

RESUMO

BACKGROUND: Despite Spirochetales being a ubiquitous and medically important order of bacteria infecting both humans and animals, there is extremely limited information regarding their bacteriophages. Of the genus Treponema, there is just a single reported characterised prophage. RESULTS: We applied a bioinformatic approach on 24 previously published Treponema genomes to identify and characterise putative treponemal prophages. Thirteen of the genomes did not contain any detectable prophage regions. The remaining eleven contained 38 prophage sequences, with between one and eight putative prophages in each bacterial genome. The prophage regions ranged from 12.4 to 75.1 kb, with between 27 and 171 protein coding sequences. Phylogenetic analysis revealed that 24 of the prophages formed three distinct sequence clusters, identifying putative myoviral and siphoviral morphology. ViPTree analysis demonstrated that the identified sequences were novel when compared to known double stranded DNA bacteriophage genomes. CONCLUSIONS: In this study, we have started to address the knowledge gap on treponeme bacteriophages by characterising 38 prophage sequences in 24 treponeme genomes. Using bioinformatic approaches, we have been able to identify and compare the prophage-like elements with respect to other bacteriophages, their gene content, and their potential to be a functional and inducible bacteriophage, which in turn can help focus our attention on specific prophages to investigate further.


Assuntos
Genoma Bacteriano , Genômica , Filogenia , Prófagos , Treponema , Prófagos/genética , Treponema/genética , Treponema/virologia , Genômica/métodos , Biologia Computacional/métodos , Genoma Viral , Bacteriófagos/genética , Bacteriófagos/classificação
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...