Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 2.056
Filtrar
1.
Genome Biol ; 25(1): 166, 2024 Jun 25.
Artigo em Inglês | MEDLINE | ID: mdl-38918865

RESUMO

Nucleotide conversion RNA sequencing techniques interrogate chemical RNA modifications in cellular transcripts, resulting in mismatch-containing reads. Biases in mapping the resulting reads to reference genomes remain poorly understood. We present splice_sim, a splice-aware RNA-seq simulation and evaluation pipeline that introduces user-defined nucleotide conversions at set frequencies, creates mixture models of converted and unconverted reads, and calculates mapping accuracies per genomic annotation. By simulating nucleotide conversion RNA-seq datasets under realistic experimental conditions, including metabolic RNA labeling and RNA bisulfite sequencing, we measure mapping accuracies of state-of-the-art spliced-read mappers for mouse and human transcripts and derive strategies to prevent biases in the data interpretation.


Assuntos
RNA-Seq , Camundongos , Animais , Humanos , RNA-Seq/métodos , Splicing de RNA , Análise de Sequência de RNA/métodos , Software , Nucleotídeos/genética , Simulação por Computador
2.
Mol Biol Rep ; 51(1): 639, 2024 May 10.
Artigo em Inglês | MEDLINE | ID: mdl-38727924

RESUMO

BACKGROUND: Peucedani Radix, also known as "Qian-hu" is a traditional Chinese medicine derived from Peucedanum praeruptorum Dunn. It is widely utilized for treating wind-heat colds and coughs accompanied by excessive phlegm. However, due to morphological similarities, limited resources, and heightened market demand, numerous substitutes and adulterants of Peucedani Radix have emerged within the herbal medicine market. Moreover, Peucedani Radix is typically dried and sliced for sale, rendering traditional identification methods challenging. MATERIALS AND METHODS: We initially examined and compared 104 commercial "Qian-hu" samples from various Chinese medicinal markets and 44 species representing genuine, adulterants or substitutes, utilizing the mini barcode ITS2 region to elucidate the botanical origins of the commercial "Qian-hu". The nucleotide signature specific to Peucedani Radix was subsequently developed by analyzing the polymorphic sites within the aligned ITS2 sequences. RESULTS: The results demonstrated a success rate of 100% and 93.3% for DNA extraction and PCR amplification, respectively. Forty-five samples were authentic "Qian-hu", while the remaining samples were all adulterants, originating from nine distinct species. Peucedani Radix, its substitutes, and adulterants were successfully identified based on the neighbor-joining tree. The 24-bp nucleotide signature (5'-ATTGTCGTACGAATCCTCGTCGTC-3') revealed distinct differences between Peucedani Radix and its common substitutes and adulterants. The newly designed specific primers (PR-F/PR-R) can amplify the nucleotide signature region from commercial samples and processed materials with severe DNA degradation. CONCLUSIONS: We advocate for the utilization of ITS2 and nucleotide signature for the rapid and precise identification of herbal medicines and their adulterants to regulate the Chinese herbal medicine industry.


Assuntos
Código de Barras de DNA Taxonômico , DNA de Plantas , DNA de Plantas/genética , Código de Barras de DNA Taxonômico/métodos , Medicamentos de Ervas Chinesas/normas , Apiaceae/genética , Apiaceae/classificação , Medicina Tradicional Chinesa/normas , DNA Espaçador Ribossômico/genética , Contaminação de Medicamentos , Plantas Medicinais/genética , Filogenia , Análise de Sequência de DNA/métodos , Reação em Cadeia da Polimerase/métodos , Nucleotídeos/genética , Nucleotídeos/análise
3.
Sci Rep ; 14(1): 11930, 2024 05 24.
Artigo em Inglês | MEDLINE | ID: mdl-38789717

RESUMO

Nucleotide-binding site (NBS) domain genes are one of the superfamily of resistance genes involved in plant responses to pathogens. The current study identified 12,820 NBS-domain-containing genes across 34 species covering from mosses to monocots and dicots. These identified genes are classified into 168 classes with several novel domain architecture patterns encompassing significant diversity among plant species. Several classical (NBS, NBS-LRR, TIR-NBS, TIR-NBS-LRR, etc.) and species-specific structural patterns (TIR-NBS-TIR-Cupin_1-Cupin_1, TIR-NBS-Prenyltransf, Sugar_tr-NBS etc.) were discovered. We observed 603 orthogroups (OGs) with some core (most common orthogroups; OG0, OG1, OG2, etc.) and unique (highly specific to species; OG80, OG82, etc.) OGs with tandem duplications. The expression profiling presented the putative upregulation of OG2, OG6, and OG15 in different tissues under various biotic and abiotic stresses in susceptible and tolerant plants to cotton leaf curl disease (CLCuD). The genetic variation between susceptible (Coker 312) and tolerant (Mac7) Gossypium hirsutum accessions identified several unique variants in NBS genes of Mac7 (6583 variants) and Coker312 (5173 variants). The protein-ligand and proteins-protein interaction showed a strong interaction of some putative NBS proteins with ADP/ATP and different core proteins of the cotton leaf curl disease virus. The silencing of GaNBS (OG2) in resistant cotton through virus-induced gene silencing (VIGS) demonstrated its putative role in virus tittering. The presented study will be further helpful in understanding the plant adaptation mechanism.


Assuntos
Proteínas de Plantas , Sítios de Ligação , Proteínas de Plantas/genética , Proteínas de Plantas/metabolismo , Nucleotídeos/genética , Nucleotídeos/metabolismo , Resistência à Doença/genética , Regulação da Expressão Gênica de Plantas , Doenças das Plantas/genética , Doenças das Plantas/virologia , Genes de Plantas , Filogenia , Plantas/genética , Perfilação da Expressão Gênica , Domínios Proteicos
4.
Infect Genet Evol ; 122: 105608, 2024 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-38796047

RESUMO

Several studies have showed that the nucleotide and dinucleotide composition of viruses possibly follows their host species or protein coding region. Nevertheless, the influence of viral segment on viral nucleotide and dinucleotide composition is still unknown. Here, we explored through tomato spotted wilt virus (TSWV), a segmented virus that seriously threatens the production of tomatoes all over the world. Through nucleotide composition analysis, we found the same over-representation of A across all viral segments at the first and second codon position, but it exhibited distinct in segments at the third codon position. Interestingly, the protein coding regions which encoded by the same or different segments exhibit obvious distinct nucleotide preference. Then, we found that the dinucleotides UpG and CpU were overrepresented and the dinucleotides UpA, CpG and GpU were underrepresented, not only in the complete genomic sequences, but also in different segments, protein coding regions and host species. Notably, 100% of the data investigated here were predicted to the correct viral segment and protein coding region, despite the fact that only 67% of the data analyzed here were predicted to the correct viral host species. In conclusion, in case study of TSWV, nucleotide composition and dinucleotide preference of segment viruses are more strongly dependent on segment and protein coding region than on host species. This research provides a novel perspective on the molecular evolutionary mechanisms of TSWV and provides reference for future research on genetic diversity of segmented viruses.


Assuntos
Genoma Viral , Nucleotídeos , Solanum lycopersicum , Tospovirus , Tospovirus/genética , Solanum lycopersicum/virologia , Nucleotídeos/genética , Doenças das Plantas/virologia , RNA Viral/genética
5.
Nat Methods ; 21(6): 994-1002, 2024 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-38755321

RESUMO

Searching vast and rapidly growing nucleotide content in resources, such as runs in the Sequence Read Archive and assemblies for whole-genome shotgun sequencing projects in GenBank, is currently impractical for most researchers. Here we present Pebblescout, a tool that navigates such content by providing indexing and search capabilities. Indexing uses dense sampling of the sequences in the resource. Search finds subjects (runs or assemblies) that have short sequence matches to a user query, with well-defined guarantees and ranks them using informativeness of the matches. We illustrate the functionality of Pebblescout by creating eight databases that index over 3.7 petabases. The web service of Pebblescout can be reached at https://pebblescout.ncbi.nlm.nih.gov . We show that for a wide range of query lengths, Pebblescout provides a data-driven way for finding relevant subsets of large nucleotide resources, reducing the effort for downstream analysis substantially. We also show that Pebblescout results compare favorably to MetaGraph and Sourmash.


Assuntos
Software , Nucleotídeos/genética , Humanos , Bases de Dados Genéticas , Biologia Computacional/métodos , Bases de Dados de Ácidos Nucleicos , Algoritmos
6.
J Mol Biol ; 436(12): 168595, 2024 Jun 15.
Artigo em Inglês | MEDLINE | ID: mdl-38724003

RESUMO

During the late stage of infection, alphabaculoviruses produce many occlusion bodies (OBs) in the nuclei of the insect host's cells through the hyperexpression of polyhedrin (POLH), a major OB component encoded by polh. The strong polh promoter has been used to develop a baculovirus expression vector system for recombinant protein expression in cultured insect cells and larvae. However, the relationship between POLH accumulation and the polh coding sequence remains largely unelucidated. This study aimed to assess the importance of polh codon usage and/or nucleotide sequences in POLH accumulation by generating a baculovirus Bombyx mori nucleopolyhedrovirus (BmNPV) expressing mutant polh (co-polh) optimized according to the codon preference of its host insect. Although the deduced amino acid sequence of CO-POLH was the same as that of wild-type POLH, POLH accumulation was significantly lower in cells infected with the co-polh mutant. This reduction was due to decreased polh mRNA levels rather than translational repression. Analysis of mutant viruses with chimeric polh revealed that a 30 base-pair (bp) 5' proximal polh coding region was necessary for maintaining high polh mRNA levels. Sequence comparison of wild-type polh and co-polh identified five nucleotide differences in this region, indicating that these nucleotides were critical for polh hyperexpression. Furthermore, luciferase reporter assays showed that the 30 bp 5' coding region was sufficient for maintaining the polh promoter-driven high level of polh mRNA. Thus, our whole-gene scanning by codon optimization identified important hidden nucleotides for polh hyperexpression in alphabaculoviruses.


Assuntos
Bombyx , Nucleopoliedrovírus , Proteínas de Matriz de Corpos de Inclusão , Nucleopoliedrovírus/genética , Animais , Proteínas de Matriz de Corpos de Inclusão/genética , Bombyx/virologia , Bombyx/genética , Nucleotídeos/genética , Nucleotídeos/metabolismo , Regiões Promotoras Genéticas , Proteínas Estruturais Virais/genética , Proteínas Estruturais Virais/metabolismo , Códon/genética , Regulação Viral da Expressão Gênica , Linhagem Celular
7.
J Cancer Res Clin Oncol ; 150(5): 258, 2024 May 16.
Artigo em Inglês | MEDLINE | ID: mdl-38753091

RESUMO

PURPOSE: Breast cancer (BC) is the most prevalent malignant tumor worldwide among women, with the highest incidence rate. The mechanisms underlying nucleotide metabolism on biological functions in BC remain incompletely elucidated. MATERIALS AND METHODS: We harnessed differentially expressed nucleotide metabolism-related genes from The Cancer Genome Atlas-BRCA, constructing a prognostic risk model through univariate Cox regression and LASSO regression analyses. A validation set and the GSE7390 dataset were used to validate the risk model. Clinical relevance, survival and prognosis, immune infiltration, functional enrichment, and drug sensitivity analyses were conducted. RESULTS: Our findings identified four signature genes (DCTPP1, IFNG, SLC27A2, and MYH3) as nucleotide metabolism-related prognostic genes. Subsequently, patients were stratified into high- and low-risk groups, revealing the risk model's independence as a prognostic factor. Nomogram calibration underscored superior prediction accuracy. Gene Set Variation Analysis (GSVA) uncovered activated pathways in low-risk cohorts and mobilized pathways in high-risk cohorts. Distinctions in immune cells were noted between risk cohorts. Subsequent experiments validated that reducing SLC27A2 expression in BC cell lines or using the SLC27A2 inhibitor, Lipofermata, effectively inhibited tumor growth. CONCLUSIONS: We pinpointed four nucleotide metabolism-related prognostic genes, demonstrating promising accuracy as a risk prediction tool for patients with BC. SLC27A2 appears to be a potential therapeutic target for BC among these genes.


Assuntos
Neoplasias da Mama , Humanos , Feminino , Neoplasias da Mama/genética , Neoplasias da Mama/patologia , Prognóstico , Medição de Risco/métodos , Nucleotídeos/genética , Nomogramas , Biomarcadores Tumorais/genética , Biomarcadores Tumorais/metabolismo , Animais , Regulação Neoplásica da Expressão Gênica , Camundongos , Linhagem Celular Tumoral
8.
Biomolecules ; 14(5)2024 May 02.
Artigo em Inglês | MEDLINE | ID: mdl-38785954

RESUMO

In the cell, DNA polymerase ß (Polß) is involved in many processes aimed at maintaining genome stability and is considered the main repair DNA polymerase participating in base excision repair (BER). Polß can fill DNA gaps formed by other DNA repair enzymes. Single-nucleotide polymorphisms (SNPs) in the POLB gene can affect the enzymatic properties of the resulting protein, owing to possible amino acid substitutions. For many SNP-associated Polß variants, an association with cancer, owing to changes in polymerase activity and fidelity, has been shown. In this work, kinetic analyses and molecular dynamics simulations were used to examine the activity of naturally occurring polymorphic variants G274R, G290C, and R333W. Previously, the amino acid substitutions at these positions have been found in various types of tumors, implying a specific role of Gly-274, Gly-290, and Arg-333 in Polß functioning. All three polymorphic variants had reduced polymerase activity. Two substitutions-G274R and R333W-led to the almost complete disappearance of gap-filling and primer elongation activities, a decrease in the deoxynucleotide triphosphate-binding ability, and a lower polymerization constant, due to alterations of local contacts near the replaced amino acid residues. Thus, variants G274R, G290C, and R333W may be implicated in an elevated level of unrepaired DNA damage.


Assuntos
Substituição de Aminoácidos , DNA Polimerase beta , Simulação de Dinâmica Molecular , Polimorfismo de Nucleotídeo Único , DNA Polimerase beta/metabolismo , DNA Polimerase beta/genética , DNA Polimerase beta/química , Humanos , Cinética , Reparo do DNA/genética , Nucleotídeos/metabolismo , Nucleotídeos/genética
9.
Genome Biol Evol ; 16(4)2024 Apr 02.
Artigo em Inglês | MEDLINE | ID: mdl-38608148

RESUMO

Nucleotide diversity at a site is influenced by the relative strengths of neutral and selective population genetic processes. Therefore, attempts to estimate Effective population size based on the diversity of synonymous sites demand a better understanding of their selective constraints. The nucleotide diversity of a gene was previously found to correlate with its length. In this work, I measure nucleotide diversity at synonymous sites and uncover a pattern of low diversity towards the translation initiation site of a gene. The degree of reduction in diversity at the translation initiation site and the length of this region of reduced diversity can be quantified as "Effect Size" and "Effect Length" respectively, using parameters of an asymptotic regression model. Estimates of Effect Length across bacteria covaried with recombination rates as well as with a multitude of translation-associated traits such as the avoidance of mRNA secondary structure around translation initiation site, the number of rRNAs, and relative codon usage of ribosomal genes. Evolutionary simulations under purifying selection reproduce the observed patterns and diversity-length correlation and highlight that selective constraints on the 5'-region of a gene may be more extensive than previously believed. These results have implications for the estimation of effective population size, and relative mutation rates, and for genome scans of genes under positive selection based on "silent-site" diversity.


Assuntos
Evolução Molecular , Variação Genética , Seleção Genética , Modelos Genéticos , Nucleotídeos/genética , Uso do Códon , Iniciação Traducional da Cadeia Peptídica
10.
Sci Rep ; 14(1): 9466, 2024 04 24.
Artigo em Inglês | MEDLINE | ID: mdl-38658614

RESUMO

Long extrachromosomal circular DNA (leccDNA) regulates several biological processes such as genomic instability, gene amplification, and oncogenesis. The identification of leccDNA holds significant importance to investigate its potential associations with cancer, autoimmune, cardiovascular, and neurological diseases. In addition, understanding these associations can provide valuable insights about disease mechanisms and potential therapeutic approaches. Conventionally, wet lab-based methods are utilized to identify leccDNA, which are hindered by the need for prior knowledge, and resource-intensive processes, potentially limiting their broader applicability. To empower the process of leccDNA identification across multiple species, the paper in hand presents the very first computational predictor. The proposed iLEC-DNA predictor makes use of SVM classifier along with sequence-derived nucleotide distribution patterns and physicochemical properties-based features. In addition, the study introduces a set of 12 benchmark leccDNA datasets related to three species, namely Homo sapiens (HM), Arabidopsis Thaliana (AT), and Saccharomyces cerevisiae (SC/YS). It performs large-scale experimentation across 12 benchmark datasets under different experimental settings using the proposed predictor, more than 140 baseline predictors, and 858 encoder ensembles. The proposed predictor outperforms baseline predictors and encoder ensembles across diverse leccDNA datasets by producing average performance values of 81.09%, 62.2% and 81.08% in terms of ACC, MCC and AUC-ROC across all the datasets. The source code of the proposed and baseline predictors is available at https://github.com/FAhtisham/Extrachrosmosomal-DNA-Prediction . To facilitate the scientific community, a web application for leccDNA identification is available at https://sds_genetic_analysis.opendfki.de/iLEC_DNA/.


Assuntos
DNA Circular , Saccharomyces cerevisiae , DNA Circular/genética , Humanos , Saccharomyces cerevisiae/genética , Arabidopsis/genética , Biologia Computacional/métodos , Nucleotídeos/genética , Máquina de Vetores de Suporte
11.
BMC Genomics ; 25(1): 405, 2024 Apr 24.
Artigo em Inglês | MEDLINE | ID: mdl-38658835

RESUMO

Graph-based pangenome is gaining more popularity than linear pangenome because it stores more comprehensive information of variations. However, traditional linear genome browser has its own advantages, especially the tremendous resources accumulated historically. With the fast-growing number of individual genomes and their annotations available, the demand for a genome browser to visualize genome annotation for many individuals together with a graph-based pangenome is getting higher and higher. Here we report a new pangenome browser PPanG, a precise pangenome browser enabling nucleotide-level comparison of individual genome annotations together with a graph-based pangenome. Nine rice genomes with annotations were provided by default as potential references, and any individual genome can be selected as the reference. Our pangenome browser provides unprecedented insights on genome variations at different levels from base to gene, and reveals how the structures of a gene could differ for individuals. PPanG can be applied to any species with multiple individual genomes available and it is available at https://cgm.sjtu.edu.cn/PPanG .


Assuntos
Genômica , Genômica/métodos , Oryza/genética , Anotação de Sequência Molecular , Genoma de Planta , Variação Genética , Software , Navegador , Bases de Dados Genéticas , Nucleotídeos/genética , Genoma
12.
PLoS Comput Biol ; 20(4): e1011995, 2024 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-38656999

RESUMO

Genomes contain conserved non-coding sequences that perform important biological functions, such as gene regulation. We present a phylogenetic method, PhyloAcc-C, that associates nucleotide substitution rates with changes in a continuous trait of interest. The method takes as input a multiple sequence alignment of conserved elements, continuous trait data observed in extant species, and a background phylogeny and substitution process. Gibbs sampling is used to assign rate categories (background, conserved, accelerated) to lineages and explore whether the assigned rate categories are associated with increases or decreases in the rate of trait evolution. We test our method using simulations and then illustrate its application using mammalian body size and lifespan data previously analyzed with respect to protein coding genes. Like other studies, we find processes such as tumor suppression, telomere maintenance, and p53 regulation to be related to changes in longevity and body size. In addition, we also find that skeletal genes, and developmental processes, such as sprouting angiogenesis, are relevant.


Assuntos
Evolução Molecular , Modelos Genéticos , Filogenia , Animais , Longevidade/genética , Humanos , Biologia Computacional/métodos , Simulação por Computador , Tamanho Corporal/genética , Nucleotídeos/genética , Alinhamento de Sequência/métodos
13.
Nucleic Acids Res ; 52(10): 5950-5958, 2024 Jun 10.
Artigo em Inglês | MEDLINE | ID: mdl-38452198

RESUMO

Loss of the translational reading frame leads to misincorporation and premature termination, which can have lethal consequences. Based on structural evidence that A1503 of 16S rRNA intercalates between specific mRNA bases, we tested the possibility that it plays a role in maintenance of the reading frame by constructing ribosomes with an abasic nucleotide at position 1503. This was done by specific cleavage of 16S rRNA at position 1493 using the colicin E3 endonuclease and replacing the resulting 3'-terminal 49mer fragment with a synthetic oligonucleotide containing the abasic site using a novel splinted RNA ligation method. Ribosomes reconstituted from the abasic 1503 16S rRNA were highly active in protein synthesis but showed elevated levels of spontaneous frameshifting into the -1 reading frame. We then asked whether the residual frameshifting persisting in control ribosomes containing an intact A1503 is due to the absence of the N6-dimethyladenosine modifications at positions 1518 and 1519. Indeed, this frameshifting was rescued by site-specific methylation in vitro by the ksgA methylase. These findings thus implicate two different sites near the 3' end of 16S rRNA in maintenance of the translational reading frame, providing yet another example of a functional role for ribosomal RNA in protein synthesis.


Assuntos
Mudança da Fase de Leitura do Gene Ribossômico , Biossíntese de Proteínas , RNA Ribossômico 16S , Ribossomos , RNA Ribossômico 16S/genética , Ribossomos/metabolismo , Ribossomos/genética , Nucleotídeos/química , Nucleotídeos/genética , Metilação , Fases de Leitura Aberta , Escherichia coli/genética , Escherichia coli/metabolismo , RNA Mensageiro/genética , RNA Mensageiro/metabolismo , RNA Mensageiro/química , Conformação de Ácido Nucleico , Adenosina/análogos & derivados , Adenosina/metabolismo , Adenosina/química
14.
Methods Mol Biol ; 2760: 133-145, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38468086

RESUMO

Efficient preparation of DNA oligonucleotides containing unnatural nucleobases (UBs) that can pair with their cognates to form unnatural base pairs (UBPs) is an essential prerequisite for the application of UBPs in vitro and in vivo. Traditional preparation of oligonucleotides containing unnatural nucleobases largely relies on solid-phase synthesis, which needs to use unstable nucleoside phosphoramidites and a DNA synthesizer, and is environmentally unfriendly and limited in product length. To overcome these limitations of solid-phase synthesis, we developed enzymatic methods for daily laboratory preparation of DNA oligonucleotides containing unnatural nucleobase dNaM, dTPT3, or one of the functionalized dTPT3 derivatives, which can be used for orthogonal DNA labeling or the preparation of DNAs containing UBP dNaM-dTPT3, one of the most successful UBPs to date, based on the template-independent polymerase terminal deoxynucleotidyl transferase (TdT). Here, we first provide a detailed procedure for the TdT-based preparation of DNA oligonucleotides containing 3'-nucleotides of dNaM, dTPT3, or one of dTPT3 derivatives. We then present the procedures for enzyme-linked oligonucleotide assay (ELONA) and imaging of bacterial cells using DNA oligonucleotides containing 3'-nucleotides of dTPT3 derivatives with different functional groups. The procedure for enzymatic synthesis of DNAs containing an internal UBP dNaM-dTPT3 is also described. Hopefully, these methods will greatly facilitate the application of UBPs and the construction of semi-synthetic organisms with an expanded genetic alphabet.


Assuntos
DNA Nucleotidilexotransferase , Biologia Sintética , DNA Nucleotidilexotransferase/genética , Biologia Sintética/métodos , DNA/genética , DNA Polimerase Dirigida por DNA , Nucleotídeos/genética , Oligonucleotídeos/genética
15.
Org Biomol Chem ; 22(15): 2963-2967, 2024 04 17.
Artigo em Inglês | MEDLINE | ID: mdl-38529657

RESUMO

A type of modified nucleotide, deoxynucleotide γ-amidotriphosphates (dNTPγNH2s), exhibited around five times higher stability than dNTPs. These phosphamide nucleotides can be utilized by several DNA polymerases, and the amplification of a 10 kb DNA fragment through the polymerase chain reaction (PCR) can be accomplished even under conditions of high temperature, extended storage, or repeated freeze-thaw cycles. However, the control PCR with standard dNTPs was unsuccessful. These results indicate that dNTPγNH2s have the potential to substitute dNTPs in PCR.


Assuntos
DNA , Dimetoato , DNA Polimerase Dirigida por DNA , Nucleotídeos/genética
16.
Mol Genet Genomics ; 299(1): 23, 2024 Mar 02.
Artigo em Inglês | MEDLINE | ID: mdl-38431687

RESUMO

Nucleotide mutations in human genes have long been a hot subject for study because some of them may lead to severe human diseases. Understanding the general mutational process and evolutionary trend of human genes could help answer such questions as why certain diseases occur and what challenges we face in protecting human health. In this study, we conducted statistics on 89,895 single-nucleotide variations identified in coding regions of 18,339 human genes. The results show that C and G are frequently mutated into T and A in human genes. C/G (C or G)-to-T/A mutations lead to reduction of hydrogen bonds in double-stranded DNA because C-G and T-A base pairs are maintained by three and two hydrogen bonds respectively. C-to-T and G-to-A mutations occur predominantly in human genes because they not only reduce hydrogen bonds but also belong to transition mutation. Reduction of hydrogen bonds could reduce energy consumption not only in separating double strands of mutated DNA for transcription and replication but also in disrupting stem-loop structure of mutated mRNA for translation. It is thus considered that to reduce hydrogen bonds (and thus to reduce energy consumption in gene expression) is one of the driving forces for nucleotide mutation. Moreover, codon mutation is positively correlated to its content, suggesting that most mutations are not targeted on changing any specific codons (amino acids) but are merely for reducing hydrogen bonds. Our study provides an example of utilizing single-nucleotide variation data to infer evolutionary trend of human genes, which can be referenced to conduct similar studies in other organisms.


Assuntos
Evolução Biológica , DNA , Humanos , Mutação , DNA/genética , Códon , Nucleotídeos/genética
17.
Nucleic Acids Res ; 52(7): e35, 2024 Apr 24.
Artigo em Inglês | MEDLINE | ID: mdl-38381903

RESUMO

Nucleoside analogues like 4-thiouridine (4sU) are used to metabolically label newly synthesized RNA. Chemical conversion of 4sU before sequencing induces T-to-C mismatches in reads sequenced from labelled RNA, allowing to obtain total and labelled RNA expression profiles from a single sequencing library. Cytotoxicity due to extended periods of labelling or high 4sU concentrations has been described, but the effects of extensive 4sU labelling on expression estimates from nucleotide conversion RNA-seq have not been studied. Here, we performed nucleotide conversion RNA-seq with escalating doses of 4sU with short-term labelling (1h) and over a progressive time course (up to 2h) in different cell lines. With high concentrations or at later time points, expression estimates were biased in an RNA half-life dependent manner. We show that bias arose by a combination of reduced mappability of reads carrying multiple conversions, and a global, unspecific underrepresentation of labelled RNA emerging during library preparation and potentially global reduction of RNA synthesis. We developed a computational tool to rescue unmappable reads, which performed favourably compared to previous read mappers, and a statistical method, which could fully remove remaining bias. All methods developed here are freely available as part of our GRAND-SLAM pipeline and grandR package.


Assuntos
RNA-Seq , Tiouridina , Tiouridina/metabolismo , Tiouridina/química , RNA-Seq/métodos , Humanos , RNA/genética , Análise de Sequência de RNA/métodos , Nucleotídeos/genética
18.
J Alzheimers Dis ; 97(4): 1645-1660, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38306048

RESUMO

Background: Previous reports have demonstrated post-operative dementia and Alzheimer's disease (AD), and increased amyloid-ß levels and tau hyperphosphorylation have been observed in animal models post-anesthesia. Objective: After surgical interventions, loss in memory has been observed that has been found linked with genes modulated after anesthesia. Present study aimed to study molecular pattern present in genes modulated post anesthesia and involved in characters progressing towards AD. Methods: In the present study, 17 transcript variants belonging to eight genes, which have been found to modulate post-anesthesia and contribute to AD progression, were envisaged for their compositional features, molecular patterns, and codon and codon context-associated studies. Results: The sequences' composition was G/C rich, influencing dinucleotide preference, codon preference, codon usage, and codon context. The G/C nucleotides being highly occurring nucleotides, CpGdinucleotides were also preferred; however, CpG was highly disfavored at p3-1 at the codon junction. The nucleotide composition of Cytosine exhibited a unique feature, and unlike other nucleotides, it did not correlate with codon bias. Contrarily, it correlated with the sequence lengths. The sequences were leucine-rich, and multiple leucine repeats were present, exhibiting the functional role of neuroprotection from neuroinflammation post-anesthesia. Conclusions: The analysis pave the way to elucidate unique molecular patterns in genes modulated during anesthetic treatment and might help ameliorate the ill effects of anesthetics in the future.


Assuntos
Doença de Alzheimer , Anestesia , Anestésicos , Animais , Doença de Alzheimer/genética , Doença de Alzheimer/metabolismo , Agregados Proteicos , Leucina/genética , Proteínas tau/genética , Proteínas tau/metabolismo , Códon/genética , Nucleotídeos/genética
19.
Immunogenetics ; 76(2): 109-121, 2024 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-38400869

RESUMO

In the past, identification of HLA alleles was limited to sequencing the region of the gene coding for the peptide binding groove, resulting in a lack of sequence information in the HLA database, challenging HLA allele assignment software programs. We investigated full-length sequences of 19 HLA class I and 7 HLA class II alleles, and we extended another 47 HLA class I alleles with sequences of 5' and 3' UTR regions that were all not yet available in the IPD-IMGT/HLA database. We resolved 8638 unknown nucleotides in the coding sequence of HLA class I and 2139 of HLA class II. Furthermore, with full-length sequencing of the 26 alleles, more than 90 kb of sequence information was added to the non-coding sequences, whereas extension of the 47 alleles resulted in the addition of 5.5 kb unknown nucleotides to the 5' UTR and > 31.7 kb to the 3' UTR region. With this information, some interesting features were observed, like possible recombination events and lineage evolutionary origins. The continuing increase in the availability of full-length sequences in the HLA database will enable the identification of the evolutionary origin and will help the community to improve the alignment and assignment accuracy of HLA alleles.


Assuntos
Evolução Biológica , Nucleotídeos , Alelos , Regiões 3' não Traduzidas/genética , Membrana Celular , Nucleotídeos/genética
20.
Virol J ; 21(1): 6, 2024 01 04.
Artigo em Inglês | MEDLINE | ID: mdl-38178191

RESUMO

BACKGROUND: In cellular organisms, inosine triphosphate pyrophosphatases (ITPases) prevent the incorporation of mutagenic deaminated purines into nucleic acids. These enzymes have also been detected in the genomes of several plant RNA viruses infecting two euphorbia species. In particular, two ipomoviruses produce replicase-associated ITPases to cope with high concentration of non-canonical nucleotides found in cassava tissues. METHOD: Using high-throughput RNA sequencing on the wild euphorbia species Mercurialis perennis, two new members of the families Potyviridae and Secoviridae were identified. Both viruses encode for a putative ITPase, and were found in mixed infection with a new partitivirid. Following biological and genomic characterization of these viruses, the origin and function of the phytoviral ITPases were investigated. RESULTS: While the potyvirid was shown to be pathogenic, the secovirid and partitivirid could not be transmitted. The secovirid was found belonging to a proposed new Comovirinae genus tentatively named "Mercomovirus", which also accommodates other viruses identified through transcriptome mining, and for which an asymptomatic pollen-associated lifestyle is suspected. Homology and phylogenetic analyses inferred that the ITPases encoded by the potyvirid and secovirid were likely acquired through independent horizontal gene transfer events, forming lineages distinct from the enzymes found in cassava ipomoviruses. Possible origins from cellular organisms are discussed for these proteins. In parallel, the endogenous ITPase of M. perennis was predicted to encode for a C-terminal nuclear localization signal, which appears to be conserved among the ITPases of euphorbias but absent in other plant families. This subcellular localization is in line with the idea that nucleic acids remain protected in the nucleus, while deaminated nucleotides accumulate in the cytoplasm where they act as antiviral molecules. CONCLUSION: Three new RNA viruses infecting M. perennis are described, two of which encoding for ITPases. These enzymes have distinct origins, and are likely required by viruses to circumvent high level of cytoplasmic non-canonical nucleotides. This putative plant defense mechanism has emerged early in the evolution of euphorbias, and seems to specifically target certain groups of RNA viruses infecting perennial hosts.


Assuntos
Coinfecção , Euphorbia , Ácidos Nucleicos , Vírus de Plantas , Potyviridae , Vírus de RNA , Inosina Trifosfatase , Filogenia , Vírus de RNA/genética , Nucleotídeos/genética , Potyviridae/genética , Vírus de Plantas/genética , Plantas/genética , RNA Viral/genética , Genoma Viral
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...