Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 17 de 17
Filter
Add more filters










Publication year range
1.
Anim Reprod ; 21(2): e20230131, 2024.
Article in English | MEDLINE | ID: mdl-38912163

ABSTRACT

In reproductive technologies, uncovering the molecular aspects of oocyte and embryo competence under different conditions is crucial for refining protocols and enhancing efficiency. RNA-seq generates high-throughput data and provides transcriptomes that can undergo additional computational analyses. This study presented the transcriptomic profiles of in vitro matured oocytes and blastocysts produced in vitro from buffalo crossbred (Bubalus bubalis), coupled with gene co-expression and module preservation analysis. Cumulus Oophorus Complexes, obtained from slaughterhouse-derived ovaries, were subjected to in vitro maturation to yield metaphase II oocytes (616) or followed in vitro fertilization and culture to yield blastocysts for sequencing (526). Oocyte maturation (72%, ±3.34 sd) and embryo development (21.3%, ±4.18 sd) rates were obtained from three in vitro embryo production routines following standard protocols. Sequencing of 410 metaphase II oocytes and 70 hatched blastocysts (grade 1 and 2) identified a total of 13,976 genes, with 62% being ubiquitously expressed (8,649). Among them, the differentially expressed genes (4,153) and the strongly variable genes with the higher expression (fold-change above 11) were highlighted in oocytes (BMP15, UCHL1, WEE1, NLRPs, KPNA7, ZP2, and ZP4) and blastocysts (APOA1, KRT18, ANXA2, S100A14, SLC34A2, PRSS8 and ANXA2) as representative indicators of molecular quality. Additionally, genes exclusively found in oocytes (224) and blastocysts (2,200) with specific biological functions were identified. Gene co-expression network and module preservation analysis revealed strong preservation of functional modules related to exosome components, steroid metabolism, cell proliferation, and morphogenesis. However, cell cycle and amino acid transport modules exhibited weak preservation, which may reflect differences in embryo development kinetics and the activation of cell signaling pathways between buffalo and bovine. This comprehensive transcriptomic profile serves as a valuable resource for assessing the molecular quality of buffalo oocytes and embryos in future in vitro embryo production assays.

2.
BMC Bioinformatics ; 25(1): 217, 2024 Jun 18.
Article in English | MEDLINE | ID: mdl-38890569

ABSTRACT

BACKGROUND: Tandem repeats are specific sequences in genomic DNA repeated in tandem that are present in all organisms. Among the subcategories of TRs we have Satellite repeats, that is divided into macrosatellites, minisatellites, and microsatellites, being the last two of specific interest because they can identify polymorphisms between organisms due to their instability. Currently, most mining tools focus on Simple Sequence Repeats (SSR) mining, and only a few can identify SSRs in the coding regions. RESULTS: We developed a microsatellite mining software called SATIN (Micro and Mini SATellite IdentificatioN tool) based on a new sliding window algorithm written in C and Python. It represents a new approach to SSR mining by addressing the limitations of existing tools, particularly in coding region SSR mining. SATIN is available at https://github.com/labgm/SATIN.git . It was shown to be the second fastest for perfect and compound SSR mining. It can identify SSRs from coding regions plus SSRs with motif sizes bigger than 6. Besides the SSR mining, SATIN can also analyze SSRs polymorphism on coding-regions from pre-determined groups, and identify SSRs differentially abundant among them on a per-gene basis. To validate, we analyzed SSRs from two groups of Escherichia coli (K12 and O157) and compared the results with 5 known SSRs from coding regions. SATIN identified all 5 SSRs from 237 genes with at least one SSR on it. CONCLUSIONS: The SATIN is a novel microsatellite search software that utilizes an innovative sliding window technique based on a numerical list for repeat region search to identify perfect, and composite SSRs while generating comprehensible and analyzable outputs. It is a tool capable of using files in fasta or GenBank format as input for microsatellite mining, also being able to identify SSRs present in coding regions for GenBank files. In conclusion, we expect SATIN to help identify potential SSRs to be used as genetic markers.


Subject(s)
Data Mining , Microsatellite Repeats , Polymorphism, Genetic , Software , Microsatellite Repeats/genetics , Data Mining/methods , Algorithms , Open Reading Frames/genetics , DNA, Satellite/genetics
3.
J Appl Microbiol ; 135(5)2024 May 01.
Article in English | MEDLINE | ID: mdl-38627246

ABSTRACT

AIMS: The present study aimed to use a conventional and metagenomic approach to investigate the microbiological diversity of water bodies in a network of drainage channels and rivers located in the central area of the city of Belém, northern Brazil, which is considered one of the largest cities in the Brazilian Amazon. METHODS AND RESULTS: In eight of the analyzed points, both bacterial and viral microbiological indicators of environmental contamination-physical-chemical and metals-were assessed. The bacterial resistance genes, drug resistance mechanisms, and viral viability in the environment were also assessed. A total of 473 families of bacteria and 83 families of viruses were identified. Based on the analysis of metals, the levels of three metals (Cd, Fe, and Mn) were found to be above the recommended acceptable level by local legislation. The levels of the following three physicochemical parameters were also higher than recommended: biochemical oxygen demand, dissolved oxygen, and turbidity. Sixty-three bacterial resistance genes that conferred resistance to 13 different classes of antimicrobials were identified. Further, five mechanisms of antimicrobial resistance were identified and viral viability in the environment was confirmed. CONCLUSIONS: Intense human actions combined with a lack of public policies and poor environmental education of the population cause environmental degradation, especially in water bodies. Thus, urgent interventions are warranted to restore the quality of this precious and scarce asset worldwide.


Subject(s)
Bacteria , Metagenomics , Water Microbiology , Brazil , Bacteria/genetics , Bacteria/isolation & purification , Bacteria/classification , Bacteria/drug effects , Environmental Health , Rivers/microbiology , Rivers/virology , Viruses/genetics , Viruses/isolation & purification , Environmental Monitoring , Drug Resistance, Bacterial/genetics , Humans , Cities , Metals/pharmacology
4.
Braz J Vet Med ; 45: e000723, 2023.
Article in English | MEDLINE | ID: mdl-37859867

ABSTRACT

The canine circovirus (CanCV) is a single-stranded DNA virus that has become an important emerging virus associated with gastroenteritis in dogs worldwide. In the present study, the CanCV was detected by PCR in 15% (22/147) of dogs from animal shelters in Belém, between 2019 and 2020. We observed an association between the CanCV infection and the presence of diarrhea in animals younger than one year of age (p > 0.01). The Brazilian strains were grouped in Chinese genotypes, with 99.54 to 100% nucleotilde homology. The GMRF Bayesian Skyride used the molecular clock model, which was the best suited technique to plot the dataset. The most recent common ancestor (TMRCA) was estimated in 2017, with the evolution rate of 1.6 x 10-3 s/s/y. The viral family diversity was also investigated, with emphasis on the families of the enteric pathogenic viruses Parvoviridae, Picornaviridae and Astroviridae, which were detected in the CanCV positive pooled samples. This study highlights the importance of the CanCV as an emergent virus that causes diarrhea in Brazilian dogs. The results found herein contribute to the understanding of the role of CanCV in enteric diseases and in the evolutionary molecular characterization of the circulating genotypes. Furthermore, we increased the understanding of the fecal virome in dogs with diarrhea, providing data for the monitoring and prevention viral gastroenteric diseases in domestic animals.


O circovírus canino (CanCV) é um vírus de DNA de fita simples que se tornou um importante vírus emergente associado à gastroenterite em cães em todo o mundo. No presente estudo, o CanCV foi detectado por PCR em 15% (22/147) dos cães de abrigos de animais em Belém, entre 2019 e 2020. Observamos uma associação entre a infecção pelo CanCV e a presença de diarreia em animais menores de um ano de idade (p > 0,01). As linhagens brasileiras foram agrupadas em genótipos chineses, com 99,54 a 100% de homologia nucleotídica. O GMRF Bayesian Skyride foi o modelo de relógio molecular utilizado, sendo o método mais adequado para o conjunto de dados. O ancestral comum mais recente (TMRCA) foi estimado em 2017, com taxa de evolução de 1,6 x 10-3 s/s/ano. A diversidade de famílias virais também foi investigada, com destaque para as famílias dos vírus patogênicos entéricos Parvoviridae, Picornaviridae e Astroviridae, que foram detectadas nos pools de amostras positivas para CanCV. Este estudo destaca a importância do CanCV como um vírus emergente que causa diarreia em cães brasileiros. Os resultados aqui encontrados contribuem para a compreensão do papel do CanCV nas doenças entéricas e na caracterização molecular evolutiva dos genótipos circulantes. Além disso, aumentamos a compreensão do viroma fecal em cães com diarreia, fornecendo dados para o monitoramento e prevenção de doenças gastroentéricas virais em animais domésticos.

5.
Front Bioinform ; 2: 931583, 2022.
Article in English | MEDLINE | ID: mdl-36304273

ABSTRACT

Corynebacterium pseudotuberculosis is the causative bacterial agent of the zoonotic disease known as caseous lymphadenitis, and it presents several mechanisms of response to host defenses, including the presence of virulence factors (VFs). The genomes of these bacteria have several polymorphic markers known as microsatellites, or simple sequence repeats (SSRs), that can be used to characterize the genome, to study possible polymorphisms existing among strains, and to verify the effects of such polymorphic markers in coding regions and regions associated with VFs. In this study, several SSRs were identified within coding regions throughout the 54 genomes of this species, revealing possible polymorphisms associated with coding regions that could be used as strain-specific or serotype-specific identifiers of C. pseudotuberculosis. The similarities associated with SSRs amongst the different serum variants of C. pseudotuberculosis, biovars equi and ovis, were also evaluated, and it was possible to identify SSRs located in coding regions responsible for a VF enrolled in pathogenesis known to mediate bacterial adherence (SpaH-type pili virulence factor). Phylogenetic analyses revealed that strains sharing SSR patterns, including the possible polymorphisms identified in the same position of gene-coding regions, were displayed by strains with a common ancestor, corroborating with the Genome Tree Report of the NCBI. Statistical analysis showed that the microsatellite groups belonging to equi and ovis biovars have a significance of 0.006 (p-value) in similarity, thus indicating them as good biomarker candidates for C. pseudotuberculosis.

6.
Front Public Health ; 10: 869886, 2022.
Article in English | MEDLINE | ID: mdl-35548089

ABSTRACT

Despite the importance of understanding the ecology of freshwater viruses, there are not many studies on the subject compared to marine viruses. The microbiological interactions in these environments are still poorly known, especially between bacteriophages and their host bacteria and between cyanophages and cyanobacteria. Lake Bologna, Belém, capital of the Brazilian State of Pará, is a water source that supplies the city and its metropolitan region. However, it remains unexplored regarding the contents of its virome and viral diversity composition. Therefore, this work aims to explore the taxonomic diversity of DNA viruses in this lake, especially bacteriophages and cyanophages, since they can act as transducers of resistance genes and reporters of water quality for human consumption. We used metagenomic sequencing data generated by previous studies. We analyzed it at the taxonomic level using the tools Kraken2, Bracken, and Pavian; later, the data was assembled using Genome Detective, which performs the assembly of viruses. The results observed here suggest the existence of a widely diverse viral community and established microbial phage-regulated dynamics in Lake Bolonha. This work is the first ever to describe the virome of Lake Bolonha using a metagenomic approach based on high-throughput sequencing, as it contributes to the understanding of water-related public health concerns regarding the spreading of antibiotic resistance genes and population control of native bacteria and cyanobacteria.


Subject(s)
Bacteriophages , Viruses , Bacteria/genetics , Bacteriophages/genetics , Humans , Lakes/microbiology , Metagenome , Metagenomics , Viruses/genetics
8.
Sci Rep ; 8(1): 9670, 2018 06 25.
Article in English | MEDLINE | ID: mdl-29942087

ABSTRACT

High-throughput sequencing technologies are a milestone in molecular biology for facilitating great advances in genomics by enabling the deposit of large volumes of biological data to public databases. The availability of such data has made possible the comparative genomic analysis through pipelines, using the entire gene repertoire of genomes. However, a large number of unfinished genomes exist in public databases; their number is approximately 16-fold higher than the number of complete genomes, which creates bias during comparative analyses. Therefore, the present work proposes a new tool called Pan4Drafts, an automated pipeline for pan-genomic analysis of draft prokaryotic genomes to maximize the representation and accuracy of the gene repertoire of unfinished genomes by using reads from sequencing data. Pan4Draft allows to perform comparative analyses using different methodologies such as combining complete and draft genomes, using only draft genomes or only complete genomes. Pan4Draft is available at http://www.computationalbiology.ufpa.br/pan4drafts and the test dataset is available at https://sourceforge.net/projects/pan4drafts .


Subject(s)
Genomics/methods , Software , Databases, Genetic , Genomics/standards , Molecular Sequence Annotation , Reference Standards , User-Computer Interface
9.
Data Brief ; 18: 1581-1587, 2018 Jun.
Article in English | MEDLINE | ID: mdl-29904660

ABSTRACT

Herein, we present the draft genome of Trametes villosa isolate CCMB561, a wood-decaying Basidiomycota commonly found in tropical semiarid climate. The genome assembly was 57.98 Mb in size with an L50 of 691. A total of 16,711 putative protein-encoding genes was predicted, including 590 genes coding for carbohydrate-active enzymes (CAZy), directly involved in the decomposition of lignocellulosic materials. This is the first genome of this species of high interest in bioenergy research. The draft genome of Trametes villosa isolate CCMB561 will provide an important resource for future investigations in biofuel production, bioremediation and other green technologies.

10.
Front Genet ; 9: 644, 2018.
Article in English | MEDLINE | ID: mdl-30619469

ABSTRACT

This study developed a computational tool with a graphical interface and a web-service that allows the identification of phage regions through homology search and gene clustering. It uses G+C content variation evaluation and tRNA prediction sites as evidence to reinforce the presence of prophages in indeterminate regions. Also, it performs the functional characterization of the prophages regions through data integration of biological databases. The performance of PhageWeb was compared to other available tools (PHASTER, Prophinder, and PhiSpy) using Sensitivity (Sn) and Positive Predictive Value (PPV) tests. As a reference for the tests, more than 80 manually annotated genomes were used. In the PhageWeb analysis, the Sn index was 86.1% and the PPV was approximately 87%, while the second best tool presented Sn and PPV values of 83.3 and 86.5%, respectively. These numbers allowed us to observe a greater precision in the regions identified by PhageWeb while compared to other prediction tools submitted to the same tests. Additionally, PhageWeb was much faster than the other computational alternatives, decreasing the processing time to approximately one-ninth of the time required by the second best software. PhageWeb is freely available at http://computationalbiology.ufpa.br/phageweb.

11.
PLoS One ; 12(5): e0178154, 2017.
Article in English | MEDLINE | ID: mdl-28542514

ABSTRACT

With increased production of genomic data since the advent of next-generation sequencing (NGS), there has been a need to develop new bioinformatics tools and areas, such as comparative genomics. In comparative genomics, the genetic material of an organism is directly compared to that of another organism to better understand biological species. Moreover, the exponentially growing number of deposited prokaryote genomes has enabled the investigation of several genomic characteristics that are intrinsic to certain species. Thus, a new approach to comparative genomics, termed pan-genomics, was developed. In pan-genomics, various organisms of the same species or genus are compared. Currently, there are many tools that can perform pan-genomic analyses, such as PGAP (Pan-Genome Analysis Pipeline), Panseq (Pan-Genome Sequence Analysis Program) and PGAT (Prokaryotic Genome Analysis Tool). Among these software tools, PGAP was developed in the Perl scripting language and its reliance on UNIX platform terminals and its requirement for an extensive parameterized command line can become a problem for users without previous computational knowledge. Thus, the aim of this study was to develop a web application, known as PanWeb, that serves as a graphical interface for PGAP. In addition, using the output files of the PGAP pipeline, the application generates graphics using custom-developed scripts in the R programming language. PanWeb is freely available at http://www.computationalbiology.ufpa.br/panweb.


Subject(s)
Genomics , Software , User-Computer Interface , Algorithms , Computational Biology , Computer Graphics , Databases, Genetic , Escherichia coli/classification , Escherichia coli/genetics , Genome, Bacterial , High-Throughput Nucleotide Sequencing , Internet , Phylogeny , Programming Languages
12.
Genome Announc ; 5(13)2017 Mar 30.
Article in English | MEDLINE | ID: mdl-28360158

ABSTRACT

We report here the draft genome sequence of Corynebacterium pseudotuberculosis PA05, isolated from an ovine host in Pará State, Brazil. C. pseudotuberculosis is an etiological agent of diseases with veterinary and medical importance. The genome contains 2,435,137 bp, a G+C content of 52.2%, 2,295 coding sequences, five pseudogenes, 53 tRNAs, and six rRNAs.

13.
PLoS One ; 12(1): e0170676, 2017.
Article in English | MEDLINE | ID: mdl-28125655

ABSTRACT

Seven genomes of Corynebacterium pseudotuberculosis biovar equi were sequenced on the Ion Torrent PGM platform, generating high-quality scaffolds over 2.35 Mbp. This bacterium is the causative agent of disease known as "pigeon fever" which commonly affects horses worldwide. The pangenome of biovar equi was calculated and two phylogenomic approaches were used to identify clustering patterns within Corynebacterium genus. Furthermore, other comparative analyses were performed including the prediction of genomic islands and prophages, and SNP-based phylogeny. In the phylogenomic tree, C. pseudotuberculosis was divided into two distinct clades, one formed by nitrate non-reducing species (biovar ovis) and another formed by nitrate-reducing species (biovar equi). In the latter group, the strains isolated from California were more related to each other, while the strains CIP 52.97 and 1/06-A formed the outermost clade of the biovar equi. A total of 1,355 core genes were identified, corresponding to 42.5% of the pangenome. This pangenome has one of the smallest core genomes described in the literature, suggesting a high genetic variability of biovar equi of C. pseudotuberculosis. The analysis of the similarity between the resistance islands identified a higher proximity between the strains that caused more severe infectious conditions (infection in the internal organs). Pathogenicity islands were largely conserved between strains. Several genes that modulate the pathogenicity of C. pseudotuberculosis were described including peptidases, recombination enzymes, micoside synthesis enzymes, bacteriocins with antimicrobial activity and several others. Finally, no genotypic differences were observed between the strains that caused the three different types of infection (external abscess formation, infection with abscess formation in the internal organs, and ulcerative lymphangitis). Instead, it was noted that there is a higher phenetic correlation between strains isolated at California compared to the other strains. Additionally, high variability of resistance islands suggests gene acquisition through several events of horizontal gene transfer.


Subject(s)
Corynebacterium Infections/genetics , Corynebacterium pseudotuberculosis/genetics , Genome, Bacterial/genetics , Horse Diseases/genetics , Rhodococcus equi/genetics , Animals , Corynebacterium Infections/microbiology , Corynebacterium pseudotuberculosis/pathogenicity , Genotype , High-Throughput Nucleotide Sequencing , Horse Diseases/microbiology , Horses/microbiology , Phylogeny , Polymorphism, Single Nucleotide/genetics , Rhodococcus equi/pathogenicity
14.
PLoS One ; 11(5): e0155327, 2016.
Article in English | MEDLINE | ID: mdl-27171416

ABSTRACT

The advent of NGS (Next Generation Sequencing) technologies has resulted in an exponential increase in the number of complete genomes available in biological databases. This advance has allowed the development of several computational tools enabling analyses of large amounts of data in each of the various steps, from processing and quality filtering to gap filling and manual curation. The tools developed for gap closure are very useful as they result in more complete genomes, which will influence downstream analyses of genomic plasticity and comparative genomics. However, the gap filling step remains a challenge for genome assembly, often requiring manual intervention. Here, we present GapBlaster, a graphical application to evaluate and close gaps. GapBlaster was developed via Java programming language. The software uses contigs obtained in the assembly of the genome to perform an alignment against a draft of the genome/scaffold, using BLAST or Mummer to close gaps. Then, all identified alignments of contigs that extend through the gaps in the draft sequence are presented to the user for further evaluation via the GapBlaster graphical interface. GapBlaster presents significant results compared to other similar software and has the advantage of offering a graphical interface for manual curation of the gaps. GapBlaster program, the user guide and the test datasets are freely available at https://sourceforge.net/projects/gapblaster2015/. It requires Sun JDK 8 and Blast or Mummer.


Subject(s)
Genome , High-Throughput Nucleotide Sequencing/methods , Prokaryotic Cells/metabolism , Software , Data Curation , Reference Standards
15.
Data Brief ; 5: 963-6, 2015 Dec.
Article in English | MEDLINE | ID: mdl-26702428

ABSTRACT

Corynebacterium pseudotuberculosis causes significant loss to goat and sheep farmers because it is the causal agent of the infectious disease caseous lymphadenitis, which may lead to outcomes ranging from skin injury to animal death (Ruiz et al., 2011) [1]. This bacterium was grown under osmotic (2 M), acid (pH) and heat (50 °C) stress and under control (Normal-BHI brain heart infusion) conditions, which simulate the conditions faced by the bacteria during the infectious process. Subsequently, cDNA of each condition was sequenced by the SOLiD3 Plus platform using the RNA-Seq technique [2], [3], [4]. The data produced was processed to evaluate the differential gene expression, which is helpful to understand the adaptation mechanisms during the infection in the host. The sequencing data of all conditions are available in the European Bioinformatics Institute (EBI) repository under accession number E-MTAB-2017.

16.
Gene ; 563(2): 165-71, 2015 Jun 01.
Article in English | MEDLINE | ID: mdl-25796604

ABSTRACT

BACKGROUND: With the emergence of large-scale sequencing platforms since 2005, there has been a great revolution regarding methods for decoding DNA sequences, which have also affected quantitative and qualitative gene expression analyses through the RNA-Sequencing technique. However, issues related to the amount of data required for the analyses have been considered because they affect the reliability of the experiments. Thus, RNA depletion during sample preparation may influence the results. Moreover, because data produced by these platforms show variations in quality, quality filters are often used to remove sequences likely to contain errors to increase the accuracy of the results. However, when reads of quality filters are removed, the expression profile in RNA-Seq experiments may be influenced. RESULT: The present study aimed to analyze the impact of different quality filter values for Corynebacterium pseudotuberculosis (sequenced by SOLiD platform), Microcystis aeruginosa and Kineococcus radiotolerans (sequenced by Illumina platform) RNA-Seq data. Although up to 47.9% of the reads produced by the SOLiD technology were removed after the QV20 quality filter is applied, and 15.85% were removed from K. radiotolerans data set using the QV30 filter, Illumina data showed the largest number of unique differentially expressed genes after applying the most stringent filter (QV30), with 69 genes. In contrast, for SOLiD, the acid stress condition with the QV20 filter yielded only 41 unique differentially expressed genes. Even for the highest quality M. aeruginosa data, the quality filter affected the expression profile. The most stringent quality filter generated a greater number of unique differentially expressed genes: 9 for high molecular weight dissolved organic matter condition and 12 for low P conditions. CONCLUSION: Even high-accuracy sequencing technologies are subject to the influence of quality filters when evaluating RNA-Seq data using the reference approach.


Subject(s)
RNA/genetics , Sequence Analysis, RNA/methods , Corynebacterium pseudotuberculosis/genetics , Microcystis/genetics , Reproducibility of Results
17.
Genome Announc ; 2(6)2014 Nov 13.
Article in English | MEDLINE | ID: mdl-25395628

ABSTRACT

The genome of Corynebacterium pseudotuberculosis MB20 bv. equi was sequenced using the Ion Personal Genome Machine (PGM) platform, and showed a size of 2,363,089 bp, with 2,365 coding sequences and a GC content of 52.1%. These results will serve as a basis for further studies on the pathogenicity of C. pseudotuberculosis bv. equi.

SELECTION OF CITATIONS
SEARCH DETAIL
...