Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 13 de 13
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
Mol Metab ; : 101987, 2024 Jul 06.
Artigo em Inglês | MEDLINE | ID: mdl-38977131

RESUMO

The advent of liquid biopsies presents a novel, minimally invasive methodology for the detection of disease biomarkers, offering a significant advantage over traditional biopsy techniques. Particularly, the analysis of cell-free RNA (cfRNA) has garnered interest due to its dynamic expression profiles and the capability to study various RNA species, including messenger RNA (mRNA) and long non-coding RNA (lncRNA). These attributes position cfRNA as a versatile biomarker with broad potential applications in clinical research and diagnostics. This review delves into the utility of cfRNA biomarkers as prognostic tools for obesity-related comorbidities, such as diabetes, dyslipidemia, and non-alcoholic fatty liver disease. We evaluate the efficacy of cfRNA in forecasting metabolic outcomes associated with obesity and in identifying patients likely to experience favorable clinical outcomes following bariatric surgery. Additionally, this review synthesizes evidence from studies examining circulating cfRNA across different physiological and pathological states, with a focus on its role in diabetes, including disease progression monitoring and treatment efficacy assessment. Through this exploration, we underscore the emerging relevance of cfRNA signatures in the context of obesity and its comorbidities, setting the stage for future investigative efforts in this rapidly advancing domain.

2.
Comput Struct Biotechnol J ; 23: 2289-2303, 2024 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-38840832

RESUMO

The rapid progression of genomics and proteomics has been driven by the advent of advanced sequencing technologies, large, diverse, and readily available omics datasets, and the evolution of computational data processing capabilities. The vast amount of data generated by these advancements necessitates efficient algorithms to extract meaningful information. K-mers serve as a valuable tool when working with large sequencing datasets, offering several advantages in computational speed and memory efficiency and carrying the potential for intrinsic biological functionality. This review provides an overview of the methods, applications, and significance of k-mers in genomic and proteomic data analyses, as well as the utility of absent sequences, including nullomers and nullpeptides, in disease detection, vaccine development, therapeutics, and forensic science. Therefore, the review highlights the pivotal role of k-mers in addressing current genomic and proteomic problems and underscores their potential for future breakthroughs in research.

3.
Comput Struct Biotechnol J ; 23: 1919-1928, 2024 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-38711760

RESUMO

The decrease in sequencing expenses has facilitated the creation of reference genomes and proteomes for an expanding array of organisms. Nevertheless, no established repository that details organism-specific genomic and proteomic sequences of specific lengths, referred to as kmers, exists to our knowledge. In this article, we present kmerDB, a database accessible through an interactive web interface that provides kmer-based information from genomic and proteomic sequences in a systematic way. kmerDB currently contains 202,340,859,107 base pairs and 19,304,903,356 amino acids, spanning 54,039 and 21,865 reference genomes and proteomes, respectively, as well as 6,905,362 and 149,305,183 genomic and proteomic species-specific sequences, termed quasi-primes. Additionally, we provide access to 5,186,757 nucleic and 214,904,089 peptide sequences absent from every genome and proteome, termed primes. kmerDB features a user-friendly interface offering various search options and filters for easy parsing and searching. The service is available at: www.kmerdb.com.

4.
Bioessays ; 46(7): e2300210, 2024 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-38715516

RESUMO

Understanding the influence of cis-regulatory elements on gene regulation poses numerous challenges given complexities stemming from variations in transcription factor (TF) binding, chromatin accessibility, structural constraints, and cell-type differences. This review discusses the role of gene regulatory networks in enhancing understanding of transcriptional regulation and covers construction methods ranging from expression-based approaches to supervised machine learning. Additionally, key experimental methods, including MPRAs and CRISPR-Cas9-based screening, which have significantly contributed to understanding TF binding preferences and cis-regulatory element functions, are explored. Lastly, the potential of machine learning and artificial intelligence to unravel cis-regulatory logic is analyzed. These computational advances have far-reaching implications for precision medicine, therapeutic target discovery, and the study of genetic variations in health and disease.


Assuntos
Sistemas CRISPR-Cas , Redes Reguladoras de Genes , Aprendizado de Máquina , Humanos , Sistemas CRISPR-Cas/genética , Biologia Computacional/métodos , Fatores de Transcrição/metabolismo , Fatores de Transcrição/genética , Regulação da Expressão Gênica/genética , Animais , Elementos Reguladores de Transcrição/genética
5.
NAR Genom Bioinform ; 6(2): lqae029, 2024 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-38584871

RESUMO

The prevalence of nucleic and peptide short sequences across organismal genomes and proteomes has not been thoroughly investigated. We examined 45 785 reference genomes and 21 871 reference proteomes, spanning archaea, bacteria, eukaryotes and viruses to calculate the rarity of short sequences in them. To capture this, we developed a metric of the rarity of each sequence in nature, the rarity index. We find that the frequency of certain dipeptides in rare oligopeptide sequences is hundreds of times lower than expected, which is not the case for any dinucleotides. We also generate predictive regression models that infer the rarity of nucleic and proteomic sequences across nature or within each domain of life and viruses separately. When examining each of the three domains of life and viruses separately, the R² performance of the model predicting rarity for 5-mer peptides from mono- and dipeptides ranged between 0.814 and 0.932. A separate model predicting rarity for 10-mer oligonucleotides from mono- and dinucleotides achieved R² performance between 0.408 and 0.606. Our results indicate that the mono- and dinucleotide composition of nucleic sequences and the mono- and dipeptide composition of peptide sequences can explain a significant proportion of the variance in their frequencies in nature.

6.
Cancer Gene Ther ; 31(6): 861-870, 2024 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-38351138

RESUMO

Early detection of cancer can significantly improve patient outcomes; however, sensitive and highly specific biomarkers for cancer detection are currently missing. Nullomers are the shortest sequences that are absent from the human genome but can emerge due to somatic mutations in cancer. We examine over 10,000 whole exome sequencing matched tumor-normal samples to characterize nullomer emergence across exonic regions of the genome. We also identify nullomer emerging mutational hotspots within tumor genes. Finally, we provide evidence for the identification of nullomers in cell-free RNA from peripheral blood samples, enabling detection of multiple tumor types. We show multiple tumor classification models with an AUC greater than 0.9, including a hepatocellular carcinoma classifier with an AUC greater than 0.99.


Assuntos
Ácidos Nucleicos Livres , Detecção Precoce de Câncer , Humanos , Detecção Precoce de Câncer/métodos , Ácidos Nucleicos Livres/sangue , Ácidos Nucleicos Livres/genética , Neoplasias/genética , Neoplasias/diagnóstico , Neoplasias/sangue , Biomarcadores Tumorais/genética , Biomarcadores Tumorais/sangue , Mutação , Sequenciamento do Exoma/métodos
7.
Eur J Cancer ; 196: 113421, 2024 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-37952501

RESUMO

Early diagnosis of cancer can significantly improve survival of cancer patients; however sensitive and highly specific biomarkers for cancer detection are currently lacking for most cancer types. Nullpeptides are short peptides that are absent from the human proteome. Here, we examined the emergence of nullpeptides during cancer development. We analyzed 3,600,964 somatic mutations across 10,064 whole exome sequencing tumor samples spanning 32 cancer types. We analyze RNA-seq data from primary tumor samples to identify the subset of nullpeptides that emerge in highly expresed genes. We show that nullpeptides, and particularly the subset that is highly recurrent across cancer patients, can be identified in tumor biopsy samples. We find that cancer genes show an excess of nullpeptides and detect nullpeptide hotspots in specific loci of oncogenes and tumor suppressors. We also observe that recurrent nullpeptides are more likely to be found in neoantigens, which have been shown to be effective targets for immunotherapy, suggesting that they can be used to prioritize candidates. Our findings provide evidence for the utility of nullpeptides as cancer detection and therapeutic biomarkers.


Assuntos
Neoplasias , Humanos , Neoplasias/terapia , Oncogenes , Peptídeos/genética , Imunoterapia , Biomarcadores , Mutação , Antígenos de Neoplasias
8.
bioRxiv ; 2023 Nov 22.
Artigo em Inglês | MEDLINE | ID: mdl-38045264

RESUMO

Massively parallel reporter assays (MPRAs) represent a set of high-throughput technologies that measure the functional effects of thousands of sequences/variants on gene regulatory activity. There are several different variations of MPRA technology and they are used for numerous applications, including regulatory element discovery, variant effect measurement, saturation mutagenesis, synthetic regulatory element generation or characterization of evolutionary gene regulatory differences. Despite their many designs and uses, there is no comprehensive database that incorporates the results of these experiments. To address this, we developed MPRAbase, a manually curated database that currently harbors 129 experiments, encompassing 17,718,677 elements tested across 35 cell types and 4 organisms. The MPRAbase web interface (http://www.mprabase.com) serves as a centralized user-friendly repository to download existing MPRA data for independent analysis and is designed with the ability to allow researchers to share their published data for rapid dissemination to the community.

9.
BMC Genomics ; 24(1): 768, 2023 Dec 12.
Artigo em Inglês | MEDLINE | ID: mdl-38087204

RESUMO

Early detection of human disease is associated with improved clinical outcomes. However, many diseases are often detected at an advanced, symptomatic stage where patients are past efficacious treatment periods and can result in less favorable outcomes. Therefore, methods that can accurately detect human disease at a presymptomatic stage are urgently needed. Here, we introduce "frequentmers"; short sequences that are specific and recurrently observed in either patient or healthy control samples, but not in both. We showcase the utility of frequentmers for the detection of liver cirrhosis using metagenomic Next Generation Sequencing data from stool samples of patients and controls. We develop classification models for the detection of liver cirrhosis and achieve an AUC score of 0.91 using ten-fold cross-validation. A small subset of 200 frequentmers can achieve comparable results in detecting liver cirrhosis. Finally, we identify the microbial organisms in liver cirrhosis samples, which are associated with the most predictive frequentmer biomarkers.


Assuntos
Sequenciamento de Nucleotídeos em Larga Escala , Cirrose Hepática , Humanos , Cirrose Hepática/diagnóstico , Cirrose Hepática/genética , Nível de Saúde , Metagenoma , Metagenômica , Sensibilidade e Especificidade
10.
Lung Cancer ; 186: 107424, 2023 12.
Artigo em Inglês | MEDLINE | ID: mdl-37979487

RESUMO

INTRODUCTION: NELSON and NLST prompted the implementation of lung cancer screening programs in the United States followed by several European countries. This study aimed to assess the sensitivity of different screening criteria among patients with lung cancer in Greece and investigate reasons for ineligibility. METHODS: We performed a retrospective analysis on patients with lung cancer referred to the largest referral center in Athens, Greece, between June 2014 and May 2023. The proportion of patients who would meet the updated USPSTF and NLST criteria was compared to the corresponding proportion of the Greek population over 15 years of age. RESULTS: Out of 2434 patients with lung cancer, 77.4 % (N = 1883) would meet the updated USPSTF criteria, and 58.9 % (N = 1439) would meet the NLST criteria at diagnosis; the corresponding proportions for the Greek population over 15 years would be 13.8 % and 8.2 %, respectively. Ineligible patients were more likely to be female, former or never-smokers, have adenocarcinoma histology, and have driver mutations (p < 0.001). CONCLUSIONS: Although the updated USPSTF criteria demonstrated good sensitivity, a substantial proportion of patients with lung cancer would still not be eligible for screening. Future studies to shape a comprehensive screening strategy should focus on the incorporation of additional risk factors for lung cancer, including air pollution and individual genetic susceptibility.


Assuntos
Neoplasias Pulmonares , Humanos , Feminino , Estados Unidos , Masculino , Neoplasias Pulmonares/diagnóstico por imagem , Neoplasias Pulmonares/epidemiologia , Grécia/epidemiologia , Estudos Retrospectivos , Detecção Precoce de Câncer , Fumar/efeitos adversos , Programas de Rastreamento , Tomografia Computadorizada por Raios X
11.
NAR Genom Bioinform ; 5(2): lqad039, 2023 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-37101657

RESUMO

Determining the organisms present in a biosample has many important applications in agriculture, wildlife conservation, and healthcare. Here, we develop a universal fingerprint based on the identification of short peptides that are unique to a specific organism. We define quasi-prime peptides as sequences that are found in only one species, and we analyzed proteomes from 21 875 species, from viruses to humans, and annotated the smallest peptide kmer sequences that are unique to a species and absent from all other proteomes. We also perform simulations across all reference proteomes and observe a lower than expected number of peptide kmers across species and taxonomies, indicating an enrichment for nullpeptides, sequences absent from a proteome. For humans, we find that quasi-primes are found in genes enriched for specific gene ontology terms, including proteasome and ATP and GTP catalysis. We also provide a set of quasi-prime peptides for a number of human pathogens and model organisms and further showcase its utility via two case studies for Mycobacterium tuberculosis and Vibrio cholerae, where we identify quasi-prime peptides in two transmembrane and extracellular proteins with relevance for pathogen detection. Our catalog of quasi-prime peptides provides the smallest unit of information that is specific to a single organism at the protein level, providing a versatile tool for species identification.

12.
Genome Biol ; 22(1): 245, 2021 08 25.
Artigo em Inglês | MEDLINE | ID: mdl-34433494

RESUMO

Nullomers and nullpeptides are short DNA or amino acid sequences that are absent from a genome or proteome, respectively. One potential cause for their absence could be their having a detrimental impact on an organism. RESULTS: Here, we identify all possible nullomers and nullpeptides in the genomes and proteomes of thirty eukaryotes and demonstrate that a significant proportion of these sequences are under negative selection. We also identify nullomers that are unique to specific functional categories: coding sequences, exons, introns, 5'UTR, 3'UTR, promoters, and show that coding sequence and promoter nullomers are most likely to be selected against. By analyzing all protein sequences across the tree of life, we further identify 36,081 peptides up to six amino acids in length that do not exist in any known organism, termed primes. We next characterize all possible single base pair mutations that can lead to the appearance of a nullomer in the human genome, observing a significantly higher number of mutations than expected by chance for specific nullomer sequences in transposable elements, likely due to their suppression. We also annotate nullomers that appear due to naturally occurring variants and show that a subset of them can be used to distinguish between different human populations. Analysis of nullomers and nullpeptides across vertebrate evolution shows they can also be used as phylogenetic classifiers. CONCLUSIONS: We provide a catalog of nullomers and nullpeptides in distinct functional categories, develop methods to systematically study them, and highlight the use of variability in these sequences in other analyses.


Assuntos
DNA/metabolismo , Evolução Molecular , Genoma Humano , Peptídeos/metabolismo , Proteínas/metabolismo , Animais , Pareamento de Bases , Sequência de Bases , Frequência do Gene/genética , Variação Genética , Humanos , Filogenia , Proteoma/metabolismo , Especificidade da Espécie
13.
Nucleic Acids Res ; 49(1): e4, 2021 01 11.
Artigo em Inglês | MEDLINE | ID: mdl-33211865

RESUMO

DNA strand asymmetries can have a major effect on several biological functions, including replication, transcription and transcription factor binding. As such, DNA strand asymmetries and mutational strand bias can provide information about biological function. However, a versatile tool to explore this does not exist. Here, we present Asymmetron, a user-friendly computational tool that performs statistical analysis and visualizations for the evaluation of strand asymmetries. Asymmetron takes as input DNA features provided with strand annotation and outputs strand asymmetries for consecutive occurrences of a single DNA feature or between pairs of features. We illustrate the use of Asymmetron by identifying transcriptional and replicative strand asymmetries of germline structural variant breakpoints. We also show that the orientation of the binding sites of 45% of human transcription factors analyzed have a significant DNA strand bias in transcribed regions, that is also corroborated in ChIP-seq analyses, and is likely associated with transcription. In summary, we provide a novel tool to assess DNA strand asymmetries and show how it can be used to derive new insights across a variety of biological disciplines.


Assuntos
Biologia Computacional/métodos , Replicação do DNA/genética , DNA/genética , Mutação , Transcrição Gênica/genética , Células A549 , Algoritmos , Linhagem Celular Transformada , DNA/química , DNA/metabolismo , Células Hep G2 , Humanos , Células K562 , Células MCF-7 , Modelos Genéticos , Ligação Proteica , Fatores de Transcrição/genética , Fatores de Transcrição/metabolismo
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...