Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 23
Filter
1.
Comput Biol Med ; 172: 108233, 2024 Apr.
Article in English | MEDLINE | ID: mdl-38452471

ABSTRACT

BACKGROUND: Cancer cachexia is a severe metabolic syndrome marked by skeletal muscle atrophy. A successful clinical intervention for cancer cachexia is currently lacking. The study of cachexia mechanisms is largely based on preclinical animal models and the availability of high-throughput transcriptomic datasets of cachectic mouse muscles is increasing through the extensive use of next generation sequencing technologies. METHODS: Cachectic mouse muscle transcriptomic datasets of ten different studies were combined and mined by seven attribute weighting models, which analysed both categorical variables and numerical variables. The transcriptomic signature of cancer cachexia was identified by attribute weighting algorithms and was used to evaluate the performance of eleven pattern discovery models. The signature was employed to find the best combination of drugs (drug repurposing) for developing cancer cachexia treatment strategies, as well as to evaluate currently used cachexia drugs by literature mining. RESULTS: Attribute weighting algorithms ranked 26 genes as the transcriptomic signature of muscle from mice with cancer cachexia. Deep Learning and Random Forest models performed better in differentiating cancer cachexia cases based on muscle transcriptomic data. Literature mining revealed that a combination of melatonin and infliximab has negative interactions with 2 key genes (Rorc and Fbxo32) upregulated in the transcriptomic signature of cancer cachexia in muscle. CONCLUSIONS: The integration of machine learning, meta-analysis and literature mining was found to be an efficient approach to identifying a robust transcriptomic signature for cancer cachexia, with implications for improving clinical diagnosis and management of this condition.


Subject(s)
Cachexia , Neoplasms , Animals , Mice , Cachexia/genetics , Cachexia/metabolism , Data Mining , Gene Expression Profiling , Machine Learning , Meta-Analysis as Topic , Muscle, Skeletal , Neoplasms/complications , Neoplasms/genetics , Neoplasms/metabolism
2.
Mol Neurobiol ; 2024 Jan 22.
Article in English | MEDLINE | ID: mdl-38252383

ABSTRACT

Amyotrophic lateral sclerosis (ALS) is a progressive motor neuron disease characterised by the deposition of aggregated proteins including TAR DNA-binding protein 43 (TDP-43) in vulnerable motor neurons and the brain. Extracellular vesicles (EVs) facilitate the spread of neurodegenerative diseases and can be easily accessed in the bloodstream. This study aimed to identify a panel of EV miRNAs that can capture the pathology occurring in the brain and peripheral circulation. EVs were isolated from the cortex (BDEVs) and serum (serum EVs) of 3 month-old and 6-month-old TDP-43*Q331K and TDP-43*WT mice. Following characterisation and miRNA isolation, the EVs underwent next-generation sequencing where 24 differentially packaged miRNAs were identified in the TDP-43*Q331K BDEVs and 7 in the TDP-43*Q331K serum EVs. Several miRNAs, including miR-183-5p, were linked to ALS. Additionally, miR-122-5p and miR-486b-5p were identified in both panels, demonstrating the ability of the serum EVs to capture the dysregulation occurring in the brain. This is the first study to identify miRNAs common to both the serum EVs and BDEVs in a mouse model of ALS.

3.
Front Behav Neurosci ; 17: 1257881, 2023.
Article in English | MEDLINE | ID: mdl-38094940

ABSTRACT

Calorie restriction (CR) is considered an effective intervention for anxiety, aging, and obesity. We investigated the effects of short- and long-term CR on behavior as well as transcriptome profiles in the hypothalamus, amygdala, prefrontal cortex, pituitary, and adrenal glands of Hooded Wistar and Long Evans male rats. A reduction in anxiety-like behavior, as assessed via the elevated plus maze, was observed in both short- and long-term CR. Despite this, short- and long-term CR regulated different sets of genes, leading to distinct transcriptomic signatures. The employed models were able to simultaneously analyze categorical and numerical variables, evaluating the effect of tissue type along with expression data. In all tissues, transcription factors, zinc finger protein 45-like and zinc finger BTB domain-containing two, were the top selected genes by the models in short and long-term CR treatments, respectively. Text mining identified associations between genes of the short-term CR signature and neurodegeneration, stress, and obesity and between genes of the long-term signature and the nervous system. Literature mining-based drug repurposing showed that alongside known CR mimetics such as resveratrol and rapamycin, candidates not typically associated with CR mimetics may be repurposed based on their interaction with transcriptomic signatures of CR. This study goes some way to unravelling the global effects of CR and opens new avenues for treatment for emotional disorders, neurodegeneration, and obesity.

4.
Animals (Basel) ; 13(9)2023 Apr 25.
Article in English | MEDLINE | ID: mdl-37174495

ABSTRACT

The literature has identified poor nutrition as the leading factor in the manifestation of many behavioural issues in animals, including aggression, hyperalertness, and stereotypies. Literature focused on all species of interest consistently reported that although there were no significant differences in the richness of specific bacterial taxa in the microbiota of individual subjects with abnormal behaviour (termed alpha diversity), there was variability in species diversity between these subjects compared to controls (termed beta diversity). As seen in humans with mental disorders, animals exhibiting abnormal behaviour often have an enrichment of pro-inflammatory and lactic acid-producing bacteria and a reduction in butyrate-producing bacteria. It is evident from the literature that an association exists between gut microbiota diversity (and by extension, the concurrent production of microbial metabolites) and abnormal behavioural phenotypes across various species, including pigs, dogs, and horses. Similar microbiota population changes are also evident in human mental health patients. However, there are insufficient data to identify this association as a cause or effect. This review provides testable hypotheses for future research to establish causal relationships between gut microbiota and behavioural issues in animals, offering promising potential for the development of novel therapeutic and/or preventative interventions aimed at restoring a healthy gut-brain-immune axis to mitigate behavioural issues and, in turn, improve health, performance, and production in animals.

5.
Nutrients ; 14(21)2022 Nov 04.
Article in English | MEDLINE | ID: mdl-36364936

ABSTRACT

Further examination of the molecular regulators of long-term calorie restriction (CR), reported to have an anxiolytic effect, may highlight novel therapeutic targets for anxiety disorders. Here, adult male Hooded Wistar rats were exposed to a 25% CR whilst anxiety-like behaviour was assessed at 6-, 12-, and 18-months of age via the elevated plus maze, open field, and acoustic startle tests. Next-generation sequencing was then used to measure transcriptome-wide gene expression in the hypothalamus, amygdala, pituitary, and adrenal glands. Results showed an anxiolytic behavioural profile across early, middle, and late adulthood by CR, with the strongest effects noted at 6-months. Transcriptomic analysis by seven attribute weighting algorithms, including Info Gain Ratio, Rule, Chi Squared, Gini Index, Uncertainty, Relief, and Info Gain, led to the development of a signature of long-term CR, independent of region. Complement C1q A chain (C1qa), an extracellular protein, expression was significantly decreased by CR in most regions examined. Furthermore, text mining highlighted the positive involvement of C1qa in anxiety, depression, neurodegeneration, stress, and ageing, collectively identifying a suitable biomarker candidate for CR. Overall, the current study identified anxiety-related phenotypic changes and a novel transcriptome signature of long-term CR, indicating potential therapeutic targets for anxiety, depression, and neurodegeneration.


Subject(s)
Anti-Anxiety Agents , Caloric Restriction , Rats , Animals , Male , Transcriptome , Rats, Wistar , Anxiety/genetics , Anxiety/metabolism , Aging , Brain , Adrenal Glands
6.
Mol Biol Rep ; 49(6): 4237-4246, 2022 Jun.
Article in English | MEDLINE | ID: mdl-35286517

ABSTRACT

BACKGROUND: Splice-disrupt genomic variants are one of the causes of cancer-causing errors in gene expression. Little is known about splice-disrupt genomic variants. METHODS AND RESULTS: Here, pattern of splice-disrupt variants was investigated using 21,842,764 genomic variants in different types of prostate cancer. A particular attention was paid to genomic locations of splice-disrupt variants on target genes. HLA-A in prostate cancer, MSR1 in familial prostate cancer, and EGFR in both castration-resistant prostate cancer and metastatic castration-resistant had the highest allele frequencies of splice-disrupt variations. Some splice-disrupt variants, located on coding sequences of NCOR2, PTPRC, and CRP, were solely present in the advanced metastatic castration-resistant prostate cancer. High-risk splice-disrupt variants were identified based on computationally calculated Polymorphism Phenotyping (PolyPhen), Sorting Intolerant From Tolerant (SIFT), and Genomic Evolutionary Rate Profiling (GERP) + + scores as well as the recorded clinical significance in dbSNP database of NCBI. Functional annotation of damaging splice-disrupt variants highlighted important cancer-associated functions, including endocrine resistance, lipid metabolic process, steroid metabolic process, regulation of mitotic cell cycle, and regulation of metabolic process. This is the first study that profiles the splice-disrupt genomic variants and their target genes in prostate cancer. Literature mining based variant analysis highlighted the importance of rs1800716 variant, located on the CYP2D6 gene, involved in a range of important functions, such as RNA spicing, drug interaction, death, and urotoxicity. CONCLUSIONS: This is the first study that profiles the splice-disrupt genomic variants and their target genes in different types of prostate cancer. Unravelling alternative splicing opens a new avenue towards the establishment of new diagnostic and prognostic markers for prostate cancer progression and metastasis.


Subject(s)
Prostatic Neoplasms, Castration-Resistant , Receptors, Androgen , Alternative Splicing/genetics , Genomics , Humans , Male , Prostate/metabolism , Prostatic Neoplasms, Castration-Resistant/genetics , Prostatic Neoplasms, Castration-Resistant/metabolism , Prostatic Neoplasms, Castration-Resistant/pathology , Receptors, Androgen/metabolism
7.
Inform Med Unlocked ; 27: 100805, 2021.
Article in English | MEDLINE | ID: mdl-34849394

ABSTRACT

School closures have been used as one of the main nonpharmaceutical interventions to overcome the spread of SARS-CoV-2. Different countries use this intervention with a wide range of time intervals from the date of the first confirmed case or death. This study aimed to investigate whether fast or late school closures affect the cumulative number of COVID-19 cases or deaths. A worldwide population-based observational study has been conducted and a range of attributes were weighted using 10 attribute weighting models against the normalized number of infected cases or death in the form of numeric, binominal and polynomial labels. Statistical analysis was performed for the most weighted and the most common attributes of all types of labels. By the end of March 2021, the school closure data of 198 countries with at least one COVID-19 case were available. The days before the first school closure were one of the most weighted factors in relation to the normalized number of infected cases and deaths in numeric, binomial, and quartile forms. The average of days before the first school closure in the lowest quartile to highest quartile of infected cases (Q1, Q2, Q3 and Q4) was -6.10 [95% CI, -26.5 to 14.2], 9.35 [95% CI, 2.16 to 16.53], 17.55 [95% CI, 5.95 to 29.15], and 16.00 [95% CI, 11.69 to 20.31], respectively. In addition, 188 countries reported at least one death from COVID-19. The average of the days before the first school closure in the lowest quartile of death to highest quartile (Q1, Q2, Q3 and Q4) was -49.4 [95% CI, -76.5 to -22.3], -10.34 [95% CI, -30.12 to 9.44], -18.74 [95% CI, -32.72 to -4.77], and -12.89 [95% CI, -27.84 to 2.06], respectively. Countries that closed schools faster, especially before the detection of any confirmed case or death, had fewer COVID-19 cases or deaths per million of the population on total days of involvement. It can be concluded that rapid prevention policies are the main determinants of the countries' success.

8.
World J Stem Cells ; 13(10): 1394-1416, 2021 Oct 26.
Article in English | MEDLINE | ID: mdl-34786151

ABSTRACT

Alternative ribonucleic acid (RNA) splicing can lead to the assembly of different protein isoforms with distinctive functions. The outcome of alternative splicing (AS) can result in a complete loss of function or the acquisition of new functions. There is a gap in knowledge of abnormal RNA splice variants promoting cancer stem cells (CSCs), and their prospective contribution in cancer progression. AS directly regulates the self-renewal features of stem cells (SCs) and stem-like cancer cells. Notably, octamer-binding transcription factor 4A spliced variant of octamer-binding transcription factor 4 contributes to maintaining stemness properties in both SCs and CSCs. The epithelial to mesenchymal transition pathway regulates the AS events in CSCs to maintain stemness. The alternative spliced variants of CSCs markers, including cluster of differentiation 44, aldehyde dehydrogenase, and doublecortin-like kinase, α6ß1 integrin, have pivotal roles in increasing self-renewal properties and maintaining the pluripotency of CSCs. Various splicing analysis tools are considered in this study. LeafCutter software can be considered as the best tool for differential splicing analysis and identification of the type of splicing events. Additionally, LeafCutter can be used for efficient mapping splicing quantitative trait loci. Altogether, the accumulating evidence re-enforces the fact that gene and protein expression need to be investigated in parallel with alternative splice variants.

9.
Cells ; 10(11)2021 11 12.
Article in English | MEDLINE | ID: mdl-34831362

ABSTRACT

Predicting cancer cells' response to a plant-derived agent is critical for the drug discovery process. Recently transcriptomes advancements have provided an opportunity to identify regulatory signatures to predict drug activity. Here in this study, a combination of meta-analysis and machine learning models have been used to determine regulatory signatures focusing on differentially expressed transcription factors (TFs) of herbal components on cancer cells. In order to increase the size of the dataset, six datasets were combined in a meta-analysis from studies that had evaluated the gene expression in cancer cell lines before and after herbal extract treatments. Then, categorical feature analysis based on the machine learning methods was applied to examine transcription factors in order to find the best signature/pattern capable of discriminating between control and treated groups. It was found that this integrative approach could recognize the combination of TFs as predictive biomarkers. It was observed that the random forest (RF) model produced the best combination rules, including AIP/TFE3/VGLL4/ID1 and AIP/ZNF7/DXO with the highest modulating capacity. As the RF algorithm combines the output of many trees to set up an ultimate model, its predictive rules are more accurate and reproducible than other trees. The discovered regulatory signature suggests an effective procedure to figure out the efficacy of investigational herbal compounds on particular cells in the drug discovery process.


Subject(s)
Gene Expression Profiling , Gene Expression Regulation, Neoplastic , Neoplasms/genetics , Phytochemicals/pharmacology , Algorithms , Cell Line, Tumor , Databases, Genetic , Gene Expression Regulation, Neoplastic/drug effects , Gene Ontology , Humans , Reproducibility of Results , Transcription Factors/metabolism
10.
Comput Biol Med ; 138: 104893, 2021 11.
Article in English | MEDLINE | ID: mdl-34598069

ABSTRACT

Understanding the underlying molecular mechanism of transporter activity is one of the major discussions in structural biology. A transporter can exclusively transport one ion (specific transporter) or multiple ions (general transporter). This study compared categorical and numerical features of general and specific calcium transporters using machine learning and attribute weighting models. To this end, 444 protein features, such as the frequency of dipeptides, organism, and subcellular location, were extracted for general (n = 103) and specific calcium transporters (n = 238). Aliphatic index, subcellular location, organism, Ile-Leu frequency, Glycine frequency, hydrophobic frequency, and specific dipeptides such as Ile-Leu, Phe-Val, and Tyr-Gln were the key features in differentiating general from specific calcium transporters. Calcium transporters in the cell outer membranes were specific, while the inner ones were general; additionally, when the hydrophobic frequency or Aliphatic index is increased, the calcium transporter act as a general transporter. Random Forest with accuracy criterion showed the highest accuracy (88.88% ±5.75%) and high AUC (0.964 ± 0.020), based on 5-fold cross-validation. Decision Tree with accuracy criterion was able to predict the specificity of calcium transporter irrespective of the organism and subcellular location. This study demonstrates the precise classification of transporter function based on sequence-derived physicochemical features.


Subject(s)
Machine Learning
11.
Animals (Basel) ; 11(6)2021 Jun 01.
Article in English | MEDLINE | ID: mdl-34205858

ABSTRACT

Subclinical mastitis, an economically challenging disease of dairy cattle, is associated with an increased use of antimicrobials which reduces milk quantity and quality. It is more common than clinical mastitis and far more difficult to detect. Recently, much attention has been paid to the development of machine-learning expert systems for early detection of subclinical mastitis from milking features. However, differences between animals within a farm as well as between farms, particularly across multiple years, are major obstacles to the generalisation of machine learning models. Here, for the first time, we integrated scaling by quartiling with classification based on associations in a multi-year study to deal with farm heterogeneity by discovery of multiple patterns towards mastitis. The data were obtained from one farm comprising Holstein Friesian cows in Ongaonga, New Zealand, using an electronic automated monitoring system. The data collection was repeated annually over 3 consecutive years. Some discovered rules, such as when the milking peak flow is low, electrical conductivity (EC) of milk is low, milk lactose is low, milk fat is high, and milk volume is low, the cow has subclinical mastitis, reached high confidence (>70%) in multiple years. On averages, over 3 years, low level of milk lactose and high value of milk EC were part of 93% and 83.8% of all subclinical mastitis detecting rules, offering a reproducible pattern of subclinical mastitis detection. The scaled year-independent combinational rules provide an easy-to-apply and cost-effective machine-learning expert system for early detection of hidden mastitis using milking parameters.

12.
Cells ; 10(2)2021 02 04.
Article in English | MEDLINE | ID: mdl-33557205

ABSTRACT

Our knowledge of the evolution and the role of untranslated region (UTR) in SARS-CoV-2 pathogenicity is very limited. Leader sequence, originated from UTR, is found at the 5' ends of all encoded SARS-CoV-2 transcripts, highlighting its importance. Here, evolution of leader sequence was compared between human pathogenic and non-pathogenic coronaviruses. Then, profiling of microRNAs that can inactivate the key UTR regions of coronaviruses was carried out. A distinguished pattern of evolution in leader sequence of SARS-CoV-2 was found. Mining all available microRNA families against leader sequences of coronaviruses resulted in discovery of 39 microRNAs with a stable thermodynamic binding energy. Notably, SARS-CoV-2 had a lower binding stability against microRNAs. hsa-MIR-5004-3p was the only human microRNA able to target the leader sequence of SARS and to a lesser extent, also SARS-CoV-2. However, its binding stability decreased remarkably in SARS-COV-2. We found some plant microRNAs with low and stable binding energy against SARS-COV-2. Meta-analysis documented a significant (p < 0.01) decline in the expression of MIR-5004-3p after SARS-COV-2 infection in trachea, lung biopsy, and bronchial organoids as well as lung-derived Calu-3 and A549 cells. The paucity of the innate human inhibitory microRNAs to bind to leader sequence of SARS-CoV-2 can contribute to its high replication in infected human cells.


Subject(s)
5' Untranslated Regions , COVID-19/virology , MicroRNAs/genetics , SARS-CoV-2/genetics , Virus Replication , Animals , Computational Biology , Evolution, Molecular , Genome, Viral , Humans , MicroRNAs/pharmacology , Nucleic Acid Conformation , RNA, Plant/pharmacology , SARS-CoV-2/physiology
13.
Vet Microbiol ; 245: 108685, 2020 Jun.
Article in English | MEDLINE | ID: mdl-32456818

ABSTRACT

Extraintestinal pathogenic Escherichia coli (ExPEC) can cause urinary tract and other types of infection in cats, but the relationship of cat ExPEC to human ExPEC remains equivocal. This study investigated the prevalence of ExPEC-associated sequence types (STs) from phylogenetic group B2 among fluoroquinolone-susceptible cat clinical isolates. For this, 323 fluoroquinolone-susceptible cat clinical E. coli isolates from Australia underwent PCR-based phylotyping and random amplified polymorphic DNA analysis to determine clonal relatedness. Of the 274 group B2 isolates, 53 underwent whole genome sequencing (WGS), whereas 221 underwent PCR-based screening for (group B2) sequence type complexes (STc) STc12, STc73, ST131, and STc372. Group B2 was the dominant phylogenetic group (274/323, 85 %), whereas within group B2 ST73 dominated, according to both WGS (43 % of 53; followed by ST127, ST12, and ST372 [4/53, 8 % each]) and ST-specific PCR (20 % of 221). In WGS-based comparisons of cat and reference human ST73 isolates, cat isolates had a relatively conserved virulence gene profile but were phylogenetically diverse. Although in the phylogram most cat and human ST73 isolates occupied host species-specific clusters within serotype-specific clades (O2:H1, O6:H1, O25:H1, O50/O2:H1), cat and human isolates were intermingled within two serotype-specific clades: O120:H31 (3 cat and 2 human isolates) and O22:H1 (3 cat and 5 human isolates). These findings confirm the importance of human-associated group B2 lineages as a cause of urinary tract infections in cats. The close genetic relationship of some cat and human ST73 strains suggests bi-directional transmission may be possible.


Subject(s)
Anti-Bacterial Agents/pharmacology , Cat Diseases/microbiology , Escherichia coli Infections/veterinary , Extraintestinal Pathogenic Escherichia coli/classification , Extraintestinal Pathogenic Escherichia coli/drug effects , Fluoroquinolones/pharmacology , Animals , Bacteremia/microbiology , Cats/microbiology , Genomics , Genotype , Humans , Phylogeny , Urinary Tract Infections/microbiology , Urinary Tract Infections/veterinary , Virulence/genetics , Whole Genome Sequencing
14.
Andrologia ; 52(1): e13453, 2020 Feb.
Article in English | MEDLINE | ID: mdl-31762071

ABSTRACT

miRNAs (MicroRNAs), known as noncoding and important endogenous factors regulating the expression protein-coding genes, are vital regulators in each biological process. Thus, this study aims to explore the key role of four microRNAs in regulating the spermatogenesis. To conduct this experiment, 55 infertile and fertile men provided the study with the sperm and testicular tissue samples. To study the spermatozoa in terms of the morphology, Diff-Quick was applied. Then, quantitative real-time polymerase chain reaction (RT-PCR) was conducted on samples. Our data indicated that in contrast to the miR-15b, significant increasing of miR-383 and miR-122 occurred in both severe oligoasthenoteratozoospermia (SOAT) and moderate oligoasthenoteratozoospermia (MOAT) compared to normal sperm group (N). In addition, it was observed that miR-15b and miR-122 increased in patients with nonobstructive azoospermia (NOA) compared with obstructive azoospermia (OA) group. Expression levels of target genes including P53, CASPASE-9 and CYCLIN D1 underwent principle changes according to miRNAs expression level. Our finding indicated that miRNAs had essential role in the regulation of spermatogenesis, and their expression altering was associated with sperm abnormalities. Thus, microRNAs can be introduced as useful biomarkers to determine male infertility reasons to choose the effective treatment.


Subject(s)
Azoospermia/diagnosis , Gene Regulatory Networks , MicroRNAs/metabolism , Oligospermia/diagnosis , Spermatogenesis/genetics , Adult , Azoospermia/genetics , Biomarkers/analysis , Biomarkers/metabolism , Caspase 9/genetics , Cyclin D1/genetics , Gene Expression Profiling , Gene Expression Regulation , Humans , Male , MicroRNAs/analysis , Oligospermia/genetics , Spermatozoa/metabolism , Tumor Suppressor Protein p53/genetics , Young Adult
15.
Comput Biol Med ; 114: 103456, 2019 11.
Article in English | MEDLINE | ID: mdl-31605926

ABSTRACT

Sub-clinical bovine mastitis decreases milk quality and production. Moreover, sub-clinical mastitis leads to the use of antibiotics with consequent increased risk of the emergence of antibiotic-resistant bacteria. Therefore, early detection of infected cows is of great importance. The Somatic Cell Count (SCC) day-test used for mastitis surveillance, gives data that fluctuate widely between days, creating questions about its reliability and early prediction power. The recent identification of risk parameters of sub-clinical mastitis based on milking parameters by machine learning models is emerging as a promising new tool to enhance early prediction of mastitis occurrence. To develop the optimal approach for early sub-clinical mastitis prediction, we implemented 2 steps: (1) Finding the best statistical models to accurately link patterns of risk factors to sub-clinical mastitis, and (2) Extending this application from the farms tested to new farms (method generalization). Herein, we applied various machine learning-based prediction systems on a big milking dataset to uncover the best predictive models of sub-clinical mastitis. Data from 364,249 milking instances were collected by an electronic automated in-line monitoring system where milk volume, lactose concentration, electrical conductivity (EC), protein concentration, peak flow and milking time for each sample were measured. To provide a platform for the application of the models developed to other farms, the Z transformation approach was employed. Following this, various prediction systems [Deep Learning (DL), Naïve Bayes, Generalized Liner Model, Logistic Regression, Decision Tree, Gradient-Boosted Tree (GBT) and Random Forest] were applied to the non-transformed milking dataset and to a Z-standardized dataset. ROC (Receiver Operating Characteristics Curve), AUC (Area Under The Curve), and high accuracy demonstrated the high sensitivity of GBT and DL in detecting sub-clinical mastitis. GBT was the most accurate model (accuracy of 84.9%) in prediction of sub-clinical bovine mastitis. These data demonstrate how these models could be applied for prediction of sub-clinical mastitis in multiple bovine herds regardless of the size and sampling techniques.


Subject(s)
Asymptomatic Infections , Deep Learning , Diagnosis, Computer-Assisted/methods , Mastitis, Bovine/diagnosis , Animals , Cattle , Decision Trees , Early Diagnosis , Female , Milk/chemistry
16.
Mol Carcinog ; 58(6): 862-874, 2019 06.
Article in English | MEDLINE | ID: mdl-30644608

ABSTRACT

A considerable number of deposited variants has provided new possibilities for knowledge discovery in different types of prostate cancer. Here, we analyzed variants located on 3'UTR, 5'UTR, CDs, Intergenic, and Intronic regions in castration-resistant prostate cancer (8496 variants), familial prostate cancer (3241 variants), metastatic castration-resistant prostate cancer (3693 variants), and prostate cancer (16599 variants). Chromosome regions 10p15-p14 and 2p13 were highly enriched (P < 0.00001) for variants located in 3'UTR, 5'UTR, CDs, intergenic, and intronic regions in castration-resistant prostate cancer. In contrast, 10p15-p14, 10q23.3, 12q13.11, 13q12.3, 1q25, and 8p22 regions were enriched (P < 0.001) in familial prostate cancer. In metastatic castration-resistant prostate cancer, 10p15-p14, 10q23.3, 11q22-q23, 14q21.1, and 14q32.13 were highly variant regions (P < 0.001). Chromosome 2 and chromosome 1 hosted many enriched variant regions. AKR1C3, BRCA1, BRCA2, CHGA, CYP19A1, HOXB13, KLK3, and PTEN contained the highest number of 3'UTR, 5'UTR, CDs, Intergenic, and Intronic variants. Network analysis showed that these genes are upstream of important functions including prostate gland development, tumor recurrence, prostate cancer-specific survival, tumor progression, cancer mortality, long-term survival, cancer recurrence, angiogenesis, and AR. Interestingly, all of EGFR, JAK2, NR3C1, PDZD2, and SEMA3C genes had single nucleotide polymorphisms (SNP) in castration-resistant prostate cancer, consistent with high selection pressure on these genes during drug treatment and consequent resistance. High occurrence of variants in 3'UTRs suggests the importance of regulatory variants in different types of prostate cancer; an area that has been neglected compared with coding variants. This study provides a comprehensive overview of genomic regions contributing to different types of prostate cancer.


Subject(s)
Chromosomes/genetics , Gene Regulatory Networks , Genetic Variation , Prostatic Neoplasms/diagnosis , 3' Untranslated Regions , 5' Untranslated Regions , Diagnosis, Differential , Humans , Male , Neoplasm Metastasis/diagnosis , Neoplasm Metastasis/genetics , Open Reading Frames , Prostatic Neoplasms/genetics , Prostatic Neoplasms, Castration-Resistant/diagnosis , Prostatic Neoplasms, Castration-Resistant/genetics
17.
Gene ; 691: 114-124, 2019 Apr 05.
Article in English | MEDLINE | ID: mdl-30620887

ABSTRACT

Biosynthesis of secondary metabolites in plant is a complex process, regulated by many genes and influenced by several factors. In recent years, the next-generation sequencing (NGS) technology and advanced statistical analysis such as meta-analysis and computational systems biology have provided novel opportunities to overcome biological complexity. Here, we performed a meta-analysis on publicly available transcriptome datasets of twelve economically significant medicinal plants to identify differentially expressed genes (DEGs) between shoot and root tissues and to find the key molecular features which may be effective in the biosynthesis of secondary metabolites. Meta-analysis identified a total of 880 genes with differential expression between two tissues. Functional enrichment and KEGG pathway analysis indicated that the functions of those DEGs are highly associated with the developmental process, starch metabolic process, response to stimulus, porphyrin and chlorophyll metabolism, biosynthesis of secondary metabolites and phenylalanine metabolism. In addition, systems biology analysis of the DEGs was applied to find protein-protein interaction network and discovery of significant modules. The detected modules were associated with hormone signal transduction, transcription repressor activity, response to light stimulus and epigenetic processes. Finally, analysis was extended to search for putative miRNAs that are associated with DEGs. A total of 31 miRNAs were detected which belonged to 16 conserved families. The present study provides a comprehensive view to better understand the tissue-specific expression of genes and mechanisms involved in secondary metabolites synthesis and may provide candidate genes for future researches to improve yield of secondary metabolites.


Subject(s)
Gene Expression Profiling/methods , Genetic Markers , Plant Proteins/genetics , Plants, Medicinal/genetics , Gene Expression Regulation, Plant , Gene Regulatory Networks , MicroRNAs/genetics , Organ Specificity , Protein Interaction Maps , Secondary Metabolism , Sequence Analysis, RNA , Systems Biology
18.
Front Plant Sci ; 9: 1550, 2018.
Article in English | MEDLINE | ID: mdl-30483277

ABSTRACT

Plant root symbiosis with Arbuscular mycorrhizal (AM) fungi improves uptake of water and mineral nutrients, improving plant development under stressful conditions. Unraveling the unified transcriptomic signature of a successful colonization provides a better understanding of symbiosis. We developed a framework for finding the transcriptomic signature of Arbuscular mycorrhiza colonization and its regulating transcription factors in roots of Medicago truncatula. Expression profiles of roots in response to AM species were collected from four separate studies and were combined by direct merging meta-analysis. Batch effect, the major concern in expression meta-analysis, was reduced by three normalization steps: Robust Multi-array Average algorithm, Z-standardization, and quartiling normalization. Then, expression profile of 33685 genes in 18 root samples of Medicago as numerical features, as well as study ID and Arbuscular mycorrhiza type as categorical features, were mined by seven models: RELIEF, UNCERTAINTY, GINI INDEX, Chi Squared, RULE, INFO GAIN, and INFO GAIN RATIO. In total, 73 genes selected by machine learning models were up-regulated in response to AM (Z-value difference > 0.5). Feature weighting models also documented that this signature is independent from study (batch) effect. The AM inoculation signature obtained was able to differentiate efficiently between AM inoculated and non-inoculated samples. The AP2 domain class transcription factor, GRAS family transcription factors, and cyclin-dependent kinase were among the highly expressed meta-genes identified in the signature. We found high correspondence between the AM colonization signature obtained in this study and independent RNA-seq experiments on AM colonization, validating the repeatability of the colonization signature. Promoter analysis of upregulated genes in the transcriptomic signature led to the key regulators of AM colonization, including the essential transcription factors for endosymbiosis establishment and development such as NF-YA factors. The approach developed in this study offers three distinct novel features: (I) it improves direct merging meta-analysis by integrating supervised machine learning models and normalization steps to reduce study-specific batch effects; (II) seven attribute weighting models assessed the suitability of each gene for the transcriptomic signature which contributes to robustness of the signature (III) the approach is justifiable, easy to apply, and useful in practice. Our integrative framework of meta-analysis, promoter analysis, and machine learning provides a foundation to reveal the transcriptomic signature and regulatory circuits governing Arbuscular mycorrhizal symbiosis and is transferable to the other biological settings.

19.
Gene ; 659: 29-36, 2018 Jun 15.
Article in English | MEDLINE | ID: mdl-29555200

ABSTRACT

Exponentially growing scientific knowledge in scientific publications has resulted in the emergence of a new interdisciplinary science of literature mining. In text mining, the machine reads the published literature and transfers the discovered knowledge to mathematical-like formulas. In an integrative approach in this study, we used text mining in combination with network discovery, pathway analysis, and enrichment analysis of genomic regions for better understanding of biomarkers in lung cancer. Particular attention was paid to non-coding biomarkers. In total, 60 MicroRNA biomarkers were reported for lung cancer, including some prognostic biomarkers. MIR21, MIR155, MALAT1, and MIR31 were the top non-coding RNA biomarkers of lung cancer. Text mining identified 447 proteins which have been studied as biomarkers in lung cancer. EGFR (receptor), TP53 (transcription factor), KRAS, CDKN2A, ENO2, KRT19, RASSF1, GRP (ligand), SHOX2 (transcription factor), and ERBB2 (receptor) were the most studied proteins. Within small molecules, thymosin-a1, oestrogen, and 8-OHdG have received more attention. We found some chromosomal bands, such as 7q32.2, 18q12.1, 6p12, 11p15.5, and 3p21.3 that are highly involved in deriving lung cancer biomarkers.


Subject(s)
Biomarkers, Tumor/genetics , Lung Neoplasms/genetics , Systems Biology/methods , Cell Line, Tumor , Data Mining , Gene Regulatory Networks , Humans , RNA, Untranslated/genetics
20.
BMC Genomics ; 17(1): 925, 2016 11 16.
Article in English | MEDLINE | ID: mdl-27852224

ABSTRACT

BACKGROUND: Recent (2013 and 2009) zoonotic transmission of avian or porcine influenza to humans highlights an increase in host range by evading species barriers. Gene reassortment or antigenic shift between viruses from two or more hosts can generate a new life-threatening virus when the new shuffled virus is no longer recognized by antibodies existing within human populations. There is no large scale study to help understand the underlying mechanisms of host transmission. Furthermore, there is no clear understanding of how different segments of the influenza genome contribute in the final determination of host range. METHODS: To obtain insight into the rules underpinning host range determination, various supervised machine learning algorithms were employed to mine reassortment changes in different viral segments in a range of hosts. Our multi-host dataset contained whole segments of 674 influenza strains organized into three host categories: avian, human, and swine. Some of the sequences were assigned to multiple hosts. In point of fact, the datasets are a form of multi-labeled dataset and we utilized a multi-label learning method to identify discriminative sequence sites. Then algorithms such as CBA, Ripper, and decision tree were applied to extract informative and descriptive association rules for each viral protein segment. RESULT: We found informative rules in all segments that are common within the same host class but varied between different hosts. For example, for infection of an avian host, HA14V and NS1230S were the most important discriminative and combinatorial positions. CONCLUSION: Host range identification is facilitated by high support combined rules in this study. Our major goal was to detect discriminative genomic positions that were able to identify multi host viruses, because such viruses are likely to cause pandemic or disastrous epidemics.


Subject(s)
Genome, Viral , Influenza A virus/genetics , Orthomyxoviridae Infections/transmission , Algorithms , Animals , Birds , Hemagglutinin Glycoproteins, Influenza Virus/genetics , Hemagglutinin Glycoproteins, Influenza Virus/metabolism , Host Specificity , Humans , Influenza A virus/isolation & purification , Influenza in Birds/genetics , Influenza in Birds/pathology , Influenza in Birds/transmission , Orthomyxoviridae Infections/pathology , Orthomyxoviridae Infections/virology , Swine , Viral Proteins/genetics , Viral Proteins/metabolism , Virus Internalization , Zoonoses/transmission
SELECTION OF CITATIONS
SEARCH DETAIL
...