Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 127
Filter
1.
Article in English | MEDLINE | ID: mdl-38862433

ABSTRACT

During the last decade, the generation and accumulation of petabase-scale high-throughput sequencing data have resulted in great challenges, including access to human data, as well as transfer, storage, and sharing of enormous amounts of data. To promote data-driven biological research, the Korean government announced that all biological data generated from government-funded research projects should be deposited at the Korea BioData Station (K-BDS), which consists of multiple databases for individual data types. Here, we introduce the Korean Nucleotide Archive (KoNA), a repository of nucleotide sequence data. As of July 2022, the Korean Read Archive in KoNA has collected over 477 TB of raw next-generation sequencing data from national genome projects. To ensure data quality and prepare for international alignment, a standard operating procedure was adopted, which is similar to that of the International Nucleotide Sequence Database Collaboration. The standard operating procedure includes quality control processes for submitted data and metadata using an automated pipeline, followed by manual examination. To ensure fast and stable data transfer, a high-speed transmission system called GBox is used in KoNA. Furthermore, the data uploaded to or downloaded from KoNA through GBox can be readily processed using a cloud computing service called Bio-Express. This seamless coupling of KoNA, GBox, and Bio-Express enhances the data experience, including submission, access, and analysis of raw nucleotide sequences. KoNA not only satisfies the unmet needs for a national sequence repository in Korea but also provides datasets to researchers globally and contributes to advances in genomics. The KoNA is available at https://www.kobic.re.kr/kona/.


Subject(s)
Databases, Nucleic Acid , Republic of Korea , Humans , High-Throughput Nucleotide Sequencing/methods
2.
Vopr Virusol ; 69(2): 162-174, 2024 May 06.
Article in English | MEDLINE | ID: mdl-38843022

ABSTRACT

The objective is to determine the complete nucleotide sequence and conduct a phylogenetic analysis of genome variants of the Puumala virus isolated in the Saratov region. MATERIALS AND METHODS: The samples for the study were field material collected in the Gagarinsky (formerly Saratovsky), Engelssky, Novoburassky and Khvalynsky districts of the Saratov region in the period from 2019 to 2022. To specifically enrich the Puumala virus genome in the samples, were used PCR and developed a specific primer panel. Next, the resulting PCR products were sequenced and the fragments were assembled into one sequence for each segment of the virus genome. To construct phylogenetic trees, the maximum parsimony algorithm was used. RESULTS: Genetic variants of the Puumala virus isolated in the Saratov region have a high degree of genome similarity to each other, which indicates their unity of origin. According to phylogenetic analysis, they all form a separate branch in the cluster formed by hantaviruses from other subjects of the Volga Federal District. The virus variants from the Republics of Udmurtia and Tatarstan, as well as from the Samara and Ulyanovsk regions, are closest to the samples from the Saratov region. CONCLUSION: The data obtained show the presence of a pronounced territorial confinement of strains to certain regions or areas that are the natural biotopes of their carriers. This makes it possible to fairly accurately determine the territory of possible infection of patients and/or the circulation of carriers of these virus variants based on the sequence of individual segments of their genome.


Subject(s)
Genome, Viral , Phylogeny , Puumala virus , Puumala virus/genetics , Puumala virus/classification , Puumala virus/isolation & purification , Humans , Russia/epidemiology , Genetic Variation , Hemorrhagic Fever with Renal Syndrome/virology , Animals
3.
Biomedica ; 44(1): 54-66, 2024 03 31.
Article in English, Spanish | MEDLINE | ID: mdl-38648352

ABSTRACT

Introduction. During the development of the SARS-CoV-2 pandemic in Antioquia, we experienced epidemiological peaks related to the α, É£, ß, ƛ, and δ variants. δ had the highest incidence and prevalence. This lineage is of concern due to its clinical manifestations and epidemiological characteristics. A total of 253 δ sublineages have been reported in the PANGOLIN database. The sublineage identification through genomic analysis has made it possible to trace their evolution and propagation. Objective. To characterize the genetic diversity of the different SARS-CoV-2 δ sublineages in Antioquia and to describe its prevalence. Materials and methods. We collected sociodemographic information from 2,675 samples, and obtained 1,115 genomes from the GISAID database between July 12th, 2021, and January 18th, 2022. From the analyzed genomes, 515 were selected because of their high coverage values (>90%) to perform phylogenetic analysis and to infer allele frequencies of mutations of interest. Results. We characterized 24 sublineages. The most prevalent was AY.25. Mutations of interest as L452R, P681R, and P681H were identified in this sublineage, comprising a frequency close to 0.99. Conclusions. This study identified that the AY.25 sublineage has a transmission advantage compared to the other δ sublineages. This attribute may be related to the presence of the L452R and P681R mutations associated in other studies with higher evasion of the immune system and less efficacy of drugs against SARS-CoV-2.


Introducción. Durante el desarrollo de la pandemia por SARS-CoV-2 en Antioquia se presentaron picos epidemiológicos relacionados con las variantes α, É£, ß, ƛ y δ, donde δ tuvo la mayor incidencia y prevalencia. Este linaje se considera una variante de preocupación dadas las manifestaciones clínicas que desencadena y sus características epidemiológicas. Se han informado 253 sublinajes δ en la base de datos PANGOLIN. La identificación de estos sublinajes mediante análisis genómico ha permitido rastrear su evolución y propagación. Objetivo. Caracterizar la diversidad genética de los diferentes sublinajes δ de SARSCoV-2 en Antioquia y determinar su prevalencia. Materiales y métodos. Se recopiló información sociodemográfica de 2.675 muestras y de 1.115 genomas del repositorio GISAID entre el 12 de julio de 2021 y el 18 de enero de 2022. Se seleccionaron 501 por su alto porcentaje de cobertura (>90 %) para realizar análisis filogenéticos e inferencia de frecuencias alélicas de mutaciones de interés. Resultados. Se caracterizaron 24 sublinajes donde el más prevalente fue AY.25. En este sublinaje se identificaron mutaciones de interés como L452R, P681R y P681H, que comprendían una frecuencia cercana a 0,99. Conclusiones. Este estudio permitió identificar que el sublinaje AY.25 tiene una ventaja de transmisión en comparación con los otros sublinajes δ. Esto puede estar relacionado con la presencia de las mutaciones L452R y P681R que en otros estudios se han visto asociadas con una mayor transmisibilidad, evasión del sistema inmunitario y menor eficacia de los medicamentos contra SARS-CoV-2.


Subject(s)
COVID-19 , Genome, Viral , Phylogeny , SARS-CoV-2 , Colombia/epidemiology , COVID-19/epidemiology , Humans , SARS-CoV-2/genetics , Male , Female , Mutation , Adult , Middle Aged , Pandemics , Young Adult , Aged , Adolescent , Gene Frequency , Genetic Variation
4.
BioData Min ; 17(1): 4, 2024 Feb 15.
Article in English | MEDLINE | ID: mdl-38360720

ABSTRACT

BACKGROUND: 1-methyladenosine (m1A) is a variant of methyladenosine that holds a methyl substituent in the 1st position having a prominent role in RNA stability and human metabolites. OBJECTIVE: Traditional approaches, such as mass spectrometry and site-directed mutagenesis, proved to be time-consuming and complicated. METHODOLOGY: The present research focused on the identification of m1A sites within RNA sequences using novel feature development mechanisms. The obtained features were used to train the ensemble models, including blending, boosting, and bagging. Independent testing and k-fold cross validation were then performed on the trained ensemble models. RESULTS: The proposed model outperformed the preexisting predictors and revealed optimized scores based on major accuracy metrics. CONCLUSION: For research purpose, a user-friendly webserver of the proposed model can be accessed through https://taseersuleman-m1a-ensem1.streamlit.app/ .

5.
World J Microbiol Biotechnol ; 40(3): 93, 2024 Feb 13.
Article in English | MEDLINE | ID: mdl-38349588

ABSTRACT

The aim of the study in this article is to systematise the newly introduced strains of Lactobacillus based on determining the nucleotide sequence of a particular set of their genes (loci). The primary approach employed to address this issue involves conducting a laboratory experiment. During this experiment, a thorough examination was carried out on a set of organic compounds consisting of small DNA elements from the Lactobacillus genus. The Multilocus genotyping method served as the central technique, complemented by additional molecular-biological and population methods. These additional methods were utilized to determine the extent of phylogenetic similarity among pure cultures of Lactobacillus and to classify them accordingly. The article presents the gene isolates that were used for Multilocus typing; the number of L. casei isolates suitable for Multilocus genotyping was revealed; the gene alleles that allowed classifying L. casei isolates into five sequencing types were revealed; the effectiveness of genetic typing method for Multilocus sequencing was substantiated. The article is of practical value for microbiologists and geneticists in the field of molecular biology, as well as for technologists in the food industry. With the development of applied methods in genetic systematics, it has become possible to study pure culture of Lactobacillus species. The application of modern methods of genotypic classification of Lactobacillus species will make it possible to increase the efficiency of using better and safer products in the food industry and medicine.


Subject(s)
Lacticaseibacillus casei , Lacticaseibacillus casei/genetics , Genotype , Phylogeny , Lactobacillus/genetics , Molecular Biology
6.
Int J Mol Sci ; 25(4)2024 Feb 07.
Article in English | MEDLINE | ID: mdl-38396705

ABSTRACT

Various attempts to amplify an AQP11 cDNA from tissues of the spiny dogfish (Squalus acanthias) were made. Two pairs of deoxy-inosine-containing degenerate primers were designed based on conserved amino acid sequences from an AQP11 alignment. These primers yielded some faint bands from gill cDNA that were sequenced. Blast searches with the sequences showed they were not AQP11. An elasmobranch AQP11 nucleotide sequence alignment was produced to identify conserved regions to make further degenerate primers. One primer pair produced a short 148 bp fragment showing particularly strong amplification in gill and intestine. It was sequenced and represented a piece of the AQP11 gene. However, as the fragment may have resulted from contaminating genomic DNA (in total RNA used to make cDNA), 5' and 3' RACE were performed to amplify the two ends of the putative cDNA. Furthermore, 5' and 3' RACE amplifications depend on the presence of a 5' cap nucleotide and a poly A tail, respectively on the putative AQP11 mRNA. Hence, successful amplification was only possible from cDNA and not genomic DNA. Nested RACE amplifications were performed using gill and intestinal RACE cDNA, but none of the DNA fragments sequenced were AQP11. Consequently, the spiny dogfish AQP11 gene may represent a pseudogene.


Subject(s)
Squalus acanthias , Animals , Squalus acanthias/genetics , DNA, Complementary/genetics , Pseudogenes/genetics , Base Sequence , DNA/genetics
7.
Naunyn Schmiedebergs Arch Pharmacol ; 397(7): 5049-5066, 2024 Jul.
Article in English | MEDLINE | ID: mdl-38194106

ABSTRACT

Human gene research studies that describe wrongly identified nucleotide sequence reagents have been mostly identified in journals of low to moderate impact factor, where unreliable findings could be considered to have limited influence on future research. This study examined whether papers describing wrongly identified nucleotide sequences are also published in high-impact-factor cancer research journals. We manually verified nucleotide sequence identities in original Molecular Cancer articles published in 2014, 2016, 2018, and 2020, including nucleotide sequence reagents that were claimed to target circRNAs. Using keywords identified in some 2018 and 2020 Molecular Cancer papers, we also verified nucleotide sequence identities in 2020 Oncogene papers that studied miRNA(s) and/or circRNA(s). Overall, 3.8% (251/6647) and 4.0% (47/1165) nucleotide sequences that were verified in Molecular Cancer and Oncogene papers, respectively, were found to be wrongly identified. Wrongly identified nucleotide sequences were distributed across 18% (91/500) original Molecular Cancer papers, including 38% (31/82) Molecular Cancer papers from 2020, and 40% (21/52) selected Oncogene papers from 2020. Original papers with wrongly identified nucleotide sequences were therefore unexpectedly frequent in two high-impact-factor cancer research journals, highlighting the risks of employing journal impact factors or citations as proxies for research quality.


Subject(s)
Journal Impact Factor , Neoplasms , Periodicals as Topic , Humans , Neoplasms/genetics , Base Sequence , MicroRNAs/genetics , RNA, Circular/genetics , Biomedical Research
8.
J Genet Eng Biotechnol ; 21(1): 94, 2023 Oct 06.
Article in English | MEDLINE | ID: mdl-37801152

ABSTRACT

BACKGROUND: Vibrio species are among the autochthonous bacterial  populations found in surface waters and associated with various life-threatening extraintestinal diseases, especially in human populations with underlying illnesses and wound infections. Presently, very diminutive information exists regarding these species' mutational diversity of virulence and resistance genes. This study evaluated variations in endonucleases and mutational diversity of the virulence and resistance genes of Vibrio isolates, harboring virulence-correlated gene (vcgCPI), dihydropteroate synthase type 1 and type II genes (Sul 1 and 11), (aadA) aminoglycoside (3'') (9) adenylyltransferase gene, (aac(3)-IIa, (aacC2)a, aminoglycoside N(3)-acetyltransferase III, and (strA) aminoglycoside 3'-phosphotransferase resistance genes. METHODS: Using combinations of molecular biology techniques, bioinformatics tools, and sequence analysis. RESULTS: Our result revealed various nucleotide variations in virulence determinants of V. vulnificus (vcgCPI) at nucleotide positions (codon) 73-75 (A → G) and 300-302 (N → S). The aminoglycosides resistance gene (aadA) of Vibrio species depicts a nucleotide difference at position 482 (A → G), while the aminoglycosides resistance gene (sul 1 and 11) showed two variable regions of nucleotide polymorphism (102 and 140). The amino acid differences exist with the nucleotide polymorphism at position 140 (A → E). The banding patterns produced by the restriction enzymes HinP1I, MwoI, and StyD4I showed significant variations. Also, the restriction enzyme digestion of protein dihydropteroate synthase type 1 and type II genes (Sul 1 and 11) differed significantly, while enzymes DpnI and Hinf1 indicate no significant differences. The restriction enzyme NlaIV showed no band compared to reference isolates from the GenBank. However, the resistant determinants show significant point nucleotide mutation, which does not produce any amino acid change with diverse polymorphic regions, as revealed in the restriction digest profile. CONCLUSION: The described virulence and resistance determinants possess specific polymorphic locus relevant to pathogenomics studies, pharmacogenomic, and control of such water-associated strains.

9.
Open Vet J ; 13(6): 732-741, 2023 06.
Article in English | MEDLINE | ID: mdl-37545700

ABSTRACT

Background: Infectious diseases of young and adult birds with respiratory syndrome are a significant deterrent to the development of industrial poultry farming due to decreased productivity and significant mortality. The only effective method of combating viral diseases is timely and targeted vaccination, which largely depends on laboratory diagnostic results. Aim: This article aims to study the real-time reverse transcription polymerase chain reaction, (RT-PCR) which has the prospect of more effective diagnosis of vaccine strains of chicken infectious bronchitis and Newcastle disease. Methods: The fastest and most accurate method for the differential diagnosis of pathogens in an associative viral infection is RT-PCR. The method proposed in the article for selecting primers for amplification made it possible to use this method for the simultaneous interspecies differential diagnosis of two or more viral agents, significantly accelerating their diagnosis. Results: The correlation of the nucleotide sequence obtained from sequencing to a specific virus strain is complicated by the lack of a single nomenclature mechanism for separating genetic groups. Conclusion: The results of this study will allow easy and fast typing of sequences into known and databased virus strains and avoid further confusion in the nomenclature of genetic groups in the future.


Subject(s)
Bronchitis , Communicable Diseases , Coronavirus Infections , Newcastle Disease , Poultry Diseases , Vaccines , Animals , Chickens , Newcastle Disease/diagnosis , Newcastle Disease/prevention & control , Reverse Transcriptase Polymerase Chain Reaction/veterinary , Poultry Diseases/diagnosis , Poultry Diseases/prevention & control , Coronavirus Infections/diagnosis , Coronavirus Infections/prevention & control , Coronavirus Infections/veterinary , Communicable Diseases/veterinary , Bronchitis/veterinary
10.
Comput Biol Med ; 160: 107030, 2023 06.
Article in English | MEDLINE | ID: mdl-37196456

ABSTRACT

Methylation is a major DNA epigenetic modification for regulating the biological processes without altering the DNA sequence, and multiple types of DNA methylations have been discovered, including 6mA, 5hmC, and 4mC. Multiple computational approaches were developed to automatically identify the DNA methylation residues using machine learning or deep learning algorithms. The machine learning (ML) based methods are difficult to be transferred to the other predicting tasks of the DNA methylation sites using additional knowledge. Deep learning (DL) may facilitate the transfer learning of knowledge from similar tasks, but they are often ineffective on small datasets. This study proposes an integrated feature representation framework EpiTEAmDNA based on the strategies of transfer learning and ensemble learning, which is evaluated on multiple DNA methylation types across 15 species. EpiTEAmDNA integrates convolutional neural network (CNN) and conventional machine learning methods, and shows improved performances than the existing DL-based methods on small datasets when no additional knowledge is available. The experimental data suggests that the EpiTEAmDNA models may be further improved via transfer learning based on additional knowledge. The evaluation experiments on the independent test datasets also suggest that the proposed EpiTEAmDNA framework outperforms the existing models in most prediction tasks of the 3 DNA methylation types across 15 species. The source code, pre-trained global model, and the EpiTEAmDNA feature representation framework are freely available at http://www.healthinformaticslab.org/supp/.


Subject(s)
Machine Learning , Neural Networks, Computer , DNA/genetics , Epigenesis, Genetic , DNA Methylation
11.
Bull Exp Biol Med ; 174(6): 749-753, 2023 Apr.
Article in English | MEDLINE | ID: mdl-37160796

ABSTRACT

A recombinant form of pneumolysin from Streptococcus pneumoniae was obtained. By using Vector NTI Advance 11.0 bioinformatic analysis software, specific primers were designed in order to amplify the genome fragment of strain No. 3358 S. pneumoniae serotype 19F containing the nucleotide sequence encoding the full-length pneumolysin protein. A PCR product with a molecular weight corresponding to the nucleotide sequence of the S. pneumoniae genome fragment encoding the full-length pneumolysin was obtained. An expression system for recombinant pneumolysin in E. coli was constructed. Sequencing confirmed the identity of the inserted nucleotide sequence encoding the full-length recombinant pneumolysin synthesized in E. coli M15 strain. Purification of the recombinant protein was performed by affinity chromatography using Ni-Sepharose in 8 M urea buffer solution. Confirmation of the recombinant protein was performed by immunoblotting with monoclonal antibodies to pneumolysin.


Subject(s)
Escherichia coli , Streptococcus pneumoniae , Streptococcus pneumoniae/genetics , Escherichia coli/genetics , Escherichia coli/metabolism , Streptolysins/genetics , Bacterial Proteins/genetics , Bacterial Proteins/metabolism , Recombinant Proteins/genetics , Recombinant Proteins/metabolism
12.
Biomedicines ; 11(3)2023 Mar 08.
Article in English | MEDLINE | ID: mdl-36979802

ABSTRACT

Estrogens enhance cellular mitochondrial activity. The diminution of female hormones during menopause may have an effect on the mitochondrial genome and the expression of mitochondrial proteins. Hence, oxidative stress and the pro-inflammatory state contribute to the formation of systemic illnesses including arterial hypertension (AH). This study aimed to determine the types and frequency of mutations in the mitochondrial DNA (mtDNA) nucleotide sequence in the hypervariable regions 1 and 2 (HV1 and HV2) and the 12S RNA coding sequence of the D-loop in postmenopausal women with hypertension. In our study, 100 women were investigated, 53 of whom were postmenopausal and 47 of whom were premenopausal (53.9 ± 3.7 years vs. 47.7 ± 4.2 years, respectively). Of those studied, 35 premenopausal and 40 postmenopausal women were diagnosed with AH. A medical checkup with 24 h monitoring of blood pressure (RR) and heart rate was undertaken (HR). The polymorphism of the D-loop and 12S rDNA region of mtDNA was examined. Changes in the nucleotide sequence of mtDNA were observed in 23% of the group of 100 women. The changes were identified in 91.3% of HV1 and HV2 regions, 60.9% of HV1 segments, 47.5% of HV2 regions, and 43.5% of 12S rDNA regions. The frequency of nucleotide sequence alterations in mtDNA was substantially higher in postmenopausal women (34%) than in premenopausal women (10.6%), p = 0.016. A higher frequency of changes in HV1 + HV2 sections in postmenopausal women (30.2%) compared to the premenopausal group (10.6%) was detected, p = 0.011. Only postmenopausal women were found to have modifications to the HV2 segment and the 12S rDNA region. After menopause, polymorphism in the mtDNA region was substantially more frequent in women with arterial hypertension than before menopause (p = 0.030; 37.5% vs. 11.5%). Comparable findings were observed in the HV2 and HV1 regions of the AH group (35% vs. 11.5%), p = 0.015, in the HV1 segment (25% vs. 11.5%), p = 0.529, and in the HV2 segment, 12S rDNA (25% vs. 0%). More than 80% of all changes in nucleotide sequence were homoplasmic. The mtDNA polymorphisms of the nucleotide sequence in the HV1 and HV2 regions, the HV2 region alone, and the 12S RNA coding sequence were associated with estrogen deficiency and a more severe course of arterial hypertension, accompanied by symptoms of adrenergic stimulation.

13.
Viruses ; 15(2)2023 01 19.
Article in English | MEDLINE | ID: mdl-36851496

ABSTRACT

Pepper (Capsicum spp.; Family: Solanaceae; 2n = 24) is an important crop cultivated worldwide for the consumption of its fresh and dried processed fruits. Pepper fruits are used as raw materials in a wide variety of industrial processes. As a multipurpose vegetable crop, there is a need to increase the yield. However, yield productivity of pepper is severely constrained by infectious plant pathogens, including viruses, bacteria, fungi, and oomycetes. The pepper mild mottle virus (PMMoV) is currently one of the most damaging pathogens associated with yield losses in pepper production worldwide. In addition to impacts on pepper productivity, PMMoV has been detected in domestic and aquatic water resources, as well as in the excreta of animals, including humans. Therefore, PMMoV has been suggested as a potential indicator of domestic water quality. These findings present additional concerns and trigger the need to control the infectious pathogen in crop production. This review provides an overview of the distribution, economic impacts, management, and genome sequence variation of some isolates of PMMoV. We also describe genetic resources available for crop breeding against PMMoV.


Subject(s)
Communicable Diseases , Tobamovirus , Animals , Humans , Water Quality , Plant Breeding , Tobamovirus/genetics
14.
Anim Biotechnol ; 34(3): 658-663, 2023 Jun.
Article in English | MEDLINE | ID: mdl-34895066

ABSTRACT

The thyroid stimulating hormone receptor (TSHR), a glycoprotein hormone receptor, plays an important role in metabolic regulation and photoperiod control in the time of reproduction in birds and mammals. Previous genome-wide association studies revealed that the TSHR gene was related to reproduction and its function was identified in female reproduction, but rare studies reported the polymorphism of TSHR gene. However, the molecular mutations of the TSHR gene in sheep have not been reported so far. Herein, we explored potential polymorphisms of the sheep TSHR gene, and a 29 bp nucleotide sequence variant (rs1089565492) was identified in the AUW sheep. There were three genotypes of the 29 bp variant locus detected which named 'II' 'DD' and 'ID' been identified. Association analysis results showed the 29 bp variant was significantly associated with the litter size of the AUW sheep (p < 0.05). This finding suggests that the 29 bp nucleotide sequence variant within TSHR gene could be a candidate marker of reproduction traits for sheep breeding improving through the marker-assisted selection (MAS).


Subject(s)
Genome-Wide Association Study , Receptors, Thyrotropin , Sheep/genetics , Animals , Female , Pregnancy , Receptors, Thyrotropin/genetics , Litter Size/genetics , Genotype , Mutation/genetics , Polymorphism, Single Nucleotide , Mammals/genetics
15.
Cytogenet Genome Res ; 162(6): 323-333, 2022.
Article in English | MEDLINE | ID: mdl-36535261

ABSTRACT

The greater long-tailed hamster (Tscherskia triton, Cricetinae) has a unique karyotype (2n = 28), containing 11 pairs of acrocentric chromosomes with large C-band-positive centromeric heterochromatin blocks. To understand the origin and evolutionary process of heterochromatin in this species, we isolated novel families of chromosome site-specific highly repetitive DNA sequences from TaqI-digested genomic DNA and then characterized them by chromosome in situ and filter hybridization. The TaqI-families of repetitive sequences were classified into 2 types by their genome organization and chromosomal distribution: the 110-bp repeated sequence organized in large tandem arrays (as satellite DNA), localized to centromeric C-positive heterochromatin of acrocentric autosomes (chromosomes 1-11) and submetacentric X chromosome, and the 405-bp repeated sequence that was composed of 30-32-bp internal repeats, distributed in the pericentromeric region on the short arms of X and Y chromosomes. The repetitive sequences did not cross-hybridize with genomic DNA of any genera of Cricetinae (Mesocricetus, Cricetulus, and Phodopus). These results suggest that the 110-bp and 405-bp repeats rapidly diverged in the lineage of T. triton, evolving in a concerted manner among autosomes and X chromosome and within X and Y chromosomes, respectively. The 110-bp centromeric repeat contained a 17-bp motif in which 9 bases are essential for binding with the centromere-associated protein CENP-B, suggesting the possibility that the 110-bp major satellite DNA carrying the 17-bp motif may have a role in the formation of specified structure and/or function of centromeres in T. triton.


Subject(s)
DNA, Satellite , Heterochromatin , Cricetinae , Animals , Base Sequence , Heterochromatin/genetics , DNA, Satellite/genetics , In Situ Hybridization, Fluorescence , Repetitive Sequences, Nucleic Acid/genetics , Centromere/genetics , DNA , Karyotyping
16.
Plant Dis ; 2022 Nov 28.
Article in English | MEDLINE | ID: mdl-36444140

ABSTRACT

Perilla mosaic virus (PerMV; the genus Emaravirus in the family Fimoviridae) has a multiple, negative-sense, single-stranded RNA genome (ICTV, 2018). PerMV has been reported in Japan, where it was transmitted by an eriophyid mite species (Acari: Eriophyidae) to Perilla frutescens (L.) Britton var. crispa (Kubota et al., 2020). In September 2021, typical symptoms of the virus including yellow flecks, mosaic symptoms, and malformation were observed in leaves of P. frutescens in a cultivated field in Iseo-myeon, Wanju, South Korea (Suppl. Fig. 1). Visual estimates indicated that symptom incidence reached 70%, and the top leaves of perilla plants exhibited more severe symptoms and leaf distortion. To identify the virus species accurately, total RNA was extracted from five symptomatic perilla leaves collected using the RNeasy Plant Mini Kit (Qiagen, Hilden, Germany) then cDNAs were amplified by reverse-transcription polymerase chain reaction (RT-PCR) using two pairs of primers to PerMV specific primer set designed to amplify 412- and 491-bp cDNAs of the nucleocapsid protein gene RNA 3 and movement protein gene RNA 4, respectively (Suppl. Table). Single-infection of PerMV in symptomatic Korean perilla plants was confirmed by high-throughput sequence (HTS) analysis and de novo transcriptome assembly using the Illumina HiSeq 4000 platform (Macrogen Inc., Seoul, Korea). The assembled sequences were aligned with viral reference genomes through searches performed using the BLASTn tool. Seven contigs (597-7,213 bp) revealed 92.09-97.37% nucleotide homology with RNAs of the isolate PerMV_Kochi_Nankoku_2011 (accession numbers LC496090 to LC496099) in the GenBank database. Other viruses including turnip mosaic virus and cucumber green mottle mosaic virus were not identified by HTS analysis (Cho et al., 2021; Park et al., 2020; Song et al., 2022). Seven RNA genomes of PerMV were confirmed by RT-PCR using specific primer sets designed to amplify part of each genome (Suppl. Table 1 and Fig. 2). The complete nucleotide sequences of PerMV (named IS isolate) RNA 1-7 were determined to be 7,177, 2,089, 1,094, 1,302, 1,079, 1,098, and 995 bp in length, respectively; these were deposited in GenBank (LC721296-LC721303). Sap from a symptomatic leaf sample confirmed for single infection was inoculated mechanically onto the leaves of 10 healthy P. frutescens seedlings, which developed the same PerMV symptoms within 3 weeks. These results indicate that PerMV is the causal agent of viral disease in Korean perilla plants cultivated in South Korea. To our knowledge, this is the first report of a perilla mosaic emaravirus infecting to Korean perilla, P. frutescens in South Korea.

17.
Genes (Basel) ; 13(9)2022 08 25.
Article in English | MEDLINE | ID: mdl-36140696

ABSTRACT

Identification of miRNA-mRNA interactions is critical to understand the new paradigms in gene regulation. Existing methods show suboptimal performance owing to inappropriate feature selection and limited integration of intuitive biological features of both miRNAs and mRNAs. The present regularized least square-based method, mintRULS, employs features of miRNAs and their target sites using pairwise similarity metrics based on free energy, sequence and repeat identities, and target site accessibility to predict miRNA-target site interactions. We hypothesized that miRNAs sharing similar structural and functional features are more likely to target the same mRNA, and conversely, mRNAs with similar features can be targeted by the same miRNA. Our prediction model achieved an impressive AUC of 0.93 and 0.92 in LOOCV and LmiTOCV settings, respectively. In comparison, other popular tools such as miRDB, TargetScan, MBSTAR, RPmirDIP, and STarMir scored AUCs at 0.73, 0.77, 0.55, 0.84, and 0.67, respectively, in LOOCV setting. Similarly, mintRULS outperformed other methods using metrics such as accuracy, sensitivity, specificity, and MCC. Our method also demonstrated high accuracy when validated against experimentally derived data from condition- and cell-specific studies and expression studies of miRNAs and target genes, both in human and mouse.


Subject(s)
MicroRNAs , Animals , Gene Expression Regulation , Humans , Least-Squares Analysis , Mice , MicroRNAs/metabolism , RNA, Messenger/metabolism
18.
Virusdisease ; 33(2): 194-207, 2022 Jun.
Article in English | MEDLINE | ID: mdl-35991698

ABSTRACT

Garden croton (Codiaeum variegatum L.) plants showing typical begomovirus symptoms of vein twisting, enation and curling were collected from different gardens at Varanasi, Uttar Pradesh state of India ranged from 20 to 30%. All the 10 ten (CR1-CR10) infected samples of garden croton resulted in expected amplicon of 1.2 Kb in PCR specific to begomoviruses. No amplification was obtained for betasatellite and alphasatellite specific primers. The complete genome sequence of DNA-A and DNA-B for two isolates (CR1 and CR2) was obtained through rolling cycle amplification (RCA) and comparisons were made with other begomoviruses using Sequence Demarcation Tool (SDT) which revealed that, DNA-A of two isolates, CR1 (Acc.No.: MW816855) and CR2 (Acc.No.: MW816856) showed maximum nucleotide (nt) identity of 85.7-85.9% with Tomato leaf curl Karnataka virus, which is below the threshold percentage of begomovirus species demarcation, hence considered as novel begomovirus and proposed the name Garden croton enation leaf curl virus (CroELCuV) [IN: Varanasi: Croton: 18]. Further, DNA-B these isolates shared maximum nt identity of 91.0-92.2% (DNA-B) with Tomato leaf curl New Delhi virus. Recombination and GC plot analysis showed that the recombination occured at in low GC content regions of DNA-A and DNA-B of the CroELCuV and are derived from the previously reported several begomoviruses. This is the first record of novel bipartite begomovirus associated with vein twisting, enation and leaf curling of disease of garden garden croton in India and world. Supplementary Information: The online version contains supplementary material available at 10.1007/s13337-022-00772-0.

19.
J Infect Chemother ; 28(11): 1523-1530, 2022 Nov.
Article in English | MEDLINE | ID: mdl-35963598

ABSTRACT

PURPOSE: In Japan, the introduction of pneumococcal conjugate vaccine (PCV) in children has decreased vaccine-type (VT) pneumococcal infections caused by penicillin (PEN)-non-susceptible Streptococcus pneumoniae. PEN-non-susceptible strains have gradually emerged among non-vaccine types (NVT). In this study, we aim to investigate the pbp gene mutations and the characteristics of PEN-binding proteins (PBPs) that mediate PEN resistance in NVT strains. MATERIALS AND METHODS: Pneumococcal 41 strains of NVT isolated from patients with invasive pneumococcal infection were randomly selected. Nucleotide sequences for pbp genes encoding PBP1A, PBP2X, and PBP2B were analyzed, and amino acid (AA) substitutions that contribute to ß-lactam resistance were identified. In addition, the three-dimensional (3D) structure of abnormal PBPs in the resistant strain was compared with that of a reference R6 strain via homology modeling. RESULTS: In PEN-non-susceptible NVT strains, Thr to Ala or Ser substitutions in the conserved AA motif (STMK) were important in PBP1A and PBP2X. In PBP2B, substitutions from Thr to Ala, adjacent to the SSN motif, and from Glu to Gly were essential. The 3D structure modeling indicated that AA substitutions are characterized by accumulation around the enzymatic active pocket in PBPs. Many AA substitutions detected throughout the PBP domains were not associated with resistance, except for AA substitutions in or adjacent to AA motifs. Clonal complexes and sequence types showed that almost all NVT cases originated in other countries and spread to Japan via repeat mutations. CONCLUSIONS: NVT with diverse AA substitutions increased gradually with pressure from both antimicrobial agents and vaccines.


Subject(s)
Pneumococcal Infections , Streptococcus pneumoniae , Amino Acid Substitution , Bacterial Proteins/genetics , Bacterial Proteins/metabolism , Child , Humans , Microbial Sensitivity Tests , Penicillin Resistance/genetics , Penicillin-Binding Proteins/genetics , Penicillins , Pneumococcal Infections/genetics , Pneumococcal Infections/prevention & control
20.
Patterns (N Y) ; 3(9): 100562, 2022 Sep 09.
Article in English | MEDLINE | ID: mdl-35818472

ABSTRACT

Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) genome data are essential for epidemiology, vaccine development, and tracking emerging variants. Millions of SARS-CoV-2 genomes have been sequenced during the pandemic. However, downloading SARS-CoV-2 genomes from databases is slow and unreliable, largely due to suboptimal choice of compression method. We evaluated the available compressors and found that Nucleotide Archival Format (NAF) would provide a drastic improvement compared with current methods. For Global Initiative on Sharing Avian Flu Data's (GISAID) pre-compressed datasets, NAF would increase efficiency 52.2 times for gzip-compressed data and 3.7 times for xz-compressed data. For DNA DataBank of Japan (DDBJ), NAF would improve throughput 40 times for gzip-compressed data. For GenBank and European Nucleotide Archive (ENA), NAF would accelerate data distribution by a factor of 29.3 times compared with uncompressed FASTA. This article provides a tutorial for installing and using NAF. Offering a NAF download option in sequence databases would provide a significant saving of time, bandwidth, and disk space and accelerate biological and medical research worldwide.

SELECTION OF CITATIONS
SEARCH DETAIL
...