Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 18 de 18
Filtrar
1.
Sci Rep ; 13(1): 22472, 2023 12 18.
Artigo em Inglês | MEDLINE | ID: mdl-38110715

RESUMO

Hepatocellular carcinoma (HCC) is a prevalent malignancy and there is a lack of effective biomarkers for HCC diagnosis. Living organisms are complex, and different omics molecules interact with each other to implement various biological functions. Genomics and metabolomics, which are the top and bottom of systems biology, play an important role in HCC clinical management. Fatty acid metabolism is associated with malignancy, prognosis, and immune phenotype in cancer, which is a potential hallmark in malignant tumors. In this study, the genes and metabolites related to fatty acid metabolism were thoroughly investigated by a dynamic network construction algorithm named EWS-DDA for the early diagnosis and prognosis of HCC. Three gene ratios and eight metabolite ratios were identified by EWS-DDA as potential biomarkers for HCC clinical management. Further analysis using biological analysis, statistical analysis and document validation in the discovery and validation sets suggested that the selected potential biomarkers had great clinical prognostic value and helped to achieve effective early diagnosis of HCC. Experimental results suggested that in-depth evaluation of fatty acid metabolism from different omics viewpoints can facilitate the further understanding of pathological alterations associated with HCC characteristics, improving the performance of early diagnosis and clinical prognosis.


Assuntos
Carcinoma Hepatocelular , Neoplasias Hepáticas , Humanos , Carcinoma Hepatocelular/diagnóstico , Carcinoma Hepatocelular/genética , Carcinoma Hepatocelular/metabolismo , Neoplasias Hepáticas/diagnóstico , Neoplasias Hepáticas/genética , Neoplasias Hepáticas/metabolismo , Multiômica , Biomarcadores Tumorais/genética , Prognóstico , Ácidos Graxos
2.
Brief Bioinform ; 25(1)2023 11 22.
Artigo em Inglês | MEDLINE | ID: mdl-38018913

RESUMO

MicroRNAs (miRNAs) play important roles in the occurrence and development of diseases. However, it is still challenging to identify the effective miRNA biomarkers for improving the disease diagnosis and prognosis. In this study, we proposed the miRNA data analysis method based on multi-view miRNA networks and reinforcement learning, miRMarker, to define the potential miRNA disease biomarkers. miRMarker constructs the cooperative regulation network and functional similarity network based on the expression data and known miRNA-disease relations, respectively. The cooperative regulation of miRNAs was evaluated by measuring the changes of relative expression. Natural language processing was introduced for calculating the miRNA functional similarity. Then, miRMarker integrates the multi-view miRNA networks and defines the informative miRNA modules through a reinforcement learning strategy. We compared miRMarker with eight efficient data analysis methods on nine transcriptomics datasets to show its superiority in disease sample discrimination. The comparison results suggested that miRMarker outperformed other data analysis methods in receiver operating characteristic analysis. Furthermore, the defined miRNA modules of miRMarker on colorectal cancer data not only show the excellent performance of cancer sample discrimination but also play significant roles in the cancer-related pathway disturbances. The experimental results indicate that miRMarker can build the robust miRNA interaction network by integrating the multi-view networks. Besides, exploring the miRNA interaction network using reinforcement learning favors defining the important miRNA modules. In summary, miRMarker can be a hopeful tool in biomarker identification for human diseases.


Assuntos
MicroRNAs , Neoplasias , Humanos , MicroRNAs/genética , MicroRNAs/metabolismo , Biologia Computacional/métodos , Biomarcadores , Perfilação da Expressão Gênica , Redes Reguladoras de Genes , Neoplasias/diagnóstico , Neoplasias/genética
3.
Comput Biol Med ; 164: 107252, 2023 09.
Artigo em Inglês | MEDLINE | ID: mdl-37454504

RESUMO

Effective biomarker identification and accurate sample label prediction are still challenging for complex diseases. Patient similarity network (PSN) analysis is a powerful tool in disease omics data analysis. The topology of PSN can reflect the discriminative ability of the corresponding feature space on which the sample network is built. In this study, a novel omics data analysis method based on the sample reference network (DA-SRN) is proposed to identify the potential biomarkers and predict the sample categories. DA-SRN defines the informative features and the sample reference network in optimizing the network structure by genetic algorithm. It labels the samples based on the graph neural network, the reference network and the selected informative features. DA-SRN was compared with nine efficient omics data analysis methods on the genomics, metabolomics and transcriptomics datasets to show its validation. The comparison results showed that it outperformed the other methods in area under receiver operating characteristic curve (AUROC), sensitivity, specificity and area under precision-recall curve (AUPRC) in most cases. Besides, the important metabolites identified by DA-SRN for the type 2 diabetes (T2D) metabolomics data were further examined. The pathway analysis revealed the close relationships between the identified metabolites and the critical metabolic pathways related to the occurrence and development of T2D. The experimental results illustrate that DA-SRN can extract the valuable information from the complex omics data by analyzing the sample relationship, and is promising in biomarker identification and sample discrimination for complex diseases.


Assuntos
Diabetes Mellitus Tipo 2 , Humanos , Diabetes Mellitus Tipo 2/genética , Metabolômica/métodos , Genômica , Biomarcadores/análise , Redes Neurais de Computação
4.
J Lipid Res ; 64(6): 100378, 2023 06.
Artigo em Inglês | MEDLINE | ID: mdl-37087100

RESUMO

Reliability, robustness, and interlaboratory comparability of quantitative measurements is critical for clinical lipidomics studies. Lipids' different ex vivo stability in blood bears the risk of misinterpretation of data. Clear recommendations for the process of blood sample collection are required. We studied by UHPLC-high resolution mass spectrometry, as part of the "Preanalytics interest group" of the International Lipidomics Society, the stability of 417 lipid species in EDTA whole blood after exposure to either 4°C, 21°C, or 30°C at six different time points (0.5 h-24 h) to cover common daily routine conditions in clinical settings. In total, >800 samples were analyzed. 325 and 288 robust lipid species resisted 24 h exposure of EDTA whole blood to 21°C or 30°C, respectively. Most significant instabilities were detected for FA, LPE, and LPC. Based on our data, we recommend cooling whole blood at once and permanent. Plasma should be separated within 4 h, unless the focus is solely on robust lipids. Lists are provided to check the ex vivo (in)stability of distinct lipids and potential biomarkers of interest in whole blood. To conclude, our results contribute to the international efforts towards reliable and comparable clinical lipidomics data paving the way to the proper diagnostic application of distinct lipid patterns or lipid profiles in the future.


Assuntos
Lipidômica , Lipídeos , Lipidômica/métodos , Lipídeos/química , Ácido Edético , Reprodutibilidade dos Testes , Espectrometria de Massas/métodos
5.
Artigo em Inglês | MEDLINE | ID: mdl-35594220

RESUMO

During the development of complex diseases, there is a critical transition from one status to another at a tipping point, which can be an early indicator of disease deterioration. To effectively enhance the performance of early risk identification, a novel dynamic network construction algorithm for identifying early warning signals based on a data-driven approach (EWS-DDA) was proposed. In EWS-DDA, the shrunken centroid was introduced to measure dynamic expression changes in assumed pathway reactions during the progression of complex disease for network construction and to define early warning signals by means of a data-driven approach. We applied EWS-DDA to perform a comprehensive analysis of gene expression profiles of gastric cancer (GC) from The Cancer Genome Atlas database and the Gene Expression Omnibus database. Six crucial genes were selected as potential biomarkers for the early diagnosis of GC. The experimental results of statistical analysis and biological analysis suggested that the six genes play important roles in GC occurrence and development. Then, EWS-DDA was compared with other state-of-the-art network methods to validate its performance. The theoretical analysis and comparison results suggested that EWS-DDA has great potential for a more complete presentation of disease deterioration and effective extraction of early warning information.


Assuntos
Neoplasias Gástricas , Humanos , Neoplasias Gástricas/diagnóstico , Neoplasias Gástricas/genética , Detecção Precoce de Câncer , Biomarcadores/metabolismo , Bases de Dados Factuais , Algoritmos
6.
J Bioinform Comput Biol ; 20(6): 2250027, 2022 12.
Artigo em Inglês | MEDLINE | ID: mdl-36573886

RESUMO

Lung adenocarcinoma (LUAD) seriously threatens human health and generally results from dysfunction of relevant module molecules, which dynamically change with time and conditions, rather than that of an individual molecule. In this study, a novel network construction algorithm for identifying early warning network signals (IEWNS) is proposed for improving the performance of LUAD early diagnosis. To this end, we theoretically derived a dynamic criterion, namely, the relationship of variation (RV), to construct dynamic networks. RV infers correlation [Formula: see text] statistics to measure dynamic changes in molecular relationships during the process of disease development. Based on the dynamic networks constructed by IEWNS, network warning signals used to represent the occurrence of LUAD deterioration can be defined without human intervention. IEWNS was employed to perform a comprehensive analysis of gene expression profiles of LUAD from The Cancer Genome Atlas (TCGA) database and the Gene Expression Omnibus (GEO) database. The experimental results suggest that the potential biomarkers selected by IEWNS can facilitate a better understanding of pathogenetic mechanisms and help to achieve effective early diagnosis of LUAD. In conclusion, IEWNS provides novel insight into the initiation and progression of LUAD and helps to define prospective biomarkers for assessing disease deterioration.


Assuntos
Adenocarcinoma de Pulmão , Neoplasias Pulmonares , Humanos , Neoplasias Pulmonares/diagnóstico , Neoplasias Pulmonares/genética , Detecção Precoce de Câncer , Adenocarcinoma de Pulmão/diagnóstico , Adenocarcinoma de Pulmão/genética , Adenocarcinoma de Pulmão/patologia , Biomarcadores , Transcriptoma , Regulação Neoplásica da Expressão Gênica , Biomarcadores Tumorais/genética
7.
J Biomed Inform ; 128: 104048, 2022 04.
Artigo em Inglês | MEDLINE | ID: mdl-35248795

RESUMO

The occurrence and development of diseases are related to the dysfunction of biomolecules (genes, metabolites, etc.) and the changes of molecule interactions. Identifying the key molecules related to the physiological and pathological changes of organisms from omics data is of great significance for disease diagnosis, early warning and drug-target prediction, etc. A novel feature selection algorithm based on the feature individual distinguishing ability and feature influence in the biological network (FS-DANI) is proposed for defining important biomolecules (features) to discriminate different disease conditions. The feature individual distinguishing ability is evaluated based on the overlapping area of the feature effective ranges in different classes. FS-DANI measures the feature network influence based on the module importance in the correlation network and the feature centrality in the modules. The feature comprehensive weight is obtained by combining the feature individual distinguishing ability and feature influence in the network. Then crucial feature subset is determined by the sequential forward search (SFS) on the feature list sorted according to the comprehensive weights of features. FS-DANI is compared with the six efficient feature selection methods on ten public omics datasets. The ablation experiment is also conducted. Experimental results show that FS-DANI is better than the compared algorithms in accuracy, sensitivity and specificity on the whole. On analyzing the gastric cancer miRNA expression data, FS-DANI identified two miRNAs (hsa-miR-18a* and hsa-miR-381), whose AUCs for distinguishing gastric cancer samples and normal samples are 0.959 and 0.879 in the discovery set and an independent validation set, respectively. Hence, evaluating biomolecules from the molecular level and network level is helpful for identifying the potential disease biomarkers of high performance.


Assuntos
Algoritmos , Área Sob a Curva
8.
Artigo em Inglês | MEDLINE | ID: mdl-32894721

RESUMO

Different biomarker patterns, such as those of molecular biomarkers and ratio biomarkers, have their own merits in clinical applications. In this study, a novel machine learning method used in biomedical data analysis for constructing classification models by combining different biomarker patterns (CDBP)is proposed. CDBP uses relative expression reversals to measure the discriminative ability of different biomarker patterns, and selects the pattern with the higher score for classifier construction. The decision boundary of CDBP can be characterized in simple and biologically meaningful manners. The CDBP method was compared with eight state-of-the-art methods on eight gene expression datasets to test its performance. CDBP, with fewer features or ratio features, had the highest classification performance. Subsequently, CDBP was employed to extract crucial diagnostic information from a rat hepatocarcinogenesis metabolomics dataset. The potential biomarkers selected by CDBP provided better classification of hepatocellular carcinoma (HCC)and non-HCC stages than previous works in the animal model. The statistical analyses of these potential biomarkers in an independent human dataset confirmed their discriminative abilities of different liver diseases. These experimental results highlight the potential of CDBP for biomarker identification from high-dimensional biomedical datasets and demonstrate that it can be a useful tool for disease classification.


Assuntos
Carcinoma Hepatocelular , Neoplasias Hepáticas , Animais , Biomarcadores , Carcinoma Hepatocelular/diagnóstico , Carcinoma Hepatocelular/genética , Neoplasias Hepáticas/diagnóstico , Neoplasias Hepáticas/genética , Aprendizado de Máquina , Metabolômica , Ratos
9.
Anal Bioanal Chem ; 414(1): 235-250, 2022 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-34951658

RESUMO

Omics mainly includes genomics, epigenomics, transcriptomics, proteomics and metabolomics. The rapid development of omics technology has opened up new ways to study disease diagnosis and prognosis and to define prospective information of complex diseases. Since omics data are usually large and complex, the method used to analyze the data and to define important information is crucial in omics study. In this review, we focus on advances in biomarker discovery methods based on omics data in the last decade, and categorize them as individual feature analysis, combinatorial feature analysis and network analysis. We also discuss the challenges and perspectives in this field.


Assuntos
Biomarcadores , Epigenômica/métodos , Genômica/métodos , Metabolômica/métodos , Proteômica/métodos , Humanos
10.
Comput Biol Chem ; 93: 107539, 2021 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-34246891

RESUMO

BACKGROUND: Uremia is a worldwide epidemic disease and poses a serious threat to human health. Both maintenance hemodialysis (HD) and maintenance high flux hemodialysis (HFD) are common treatments for uremia and are generally used in clinical applications. In-depth exploration of patients' metabolic responses to different dialysis patterns can facilitate the understanding of pathological alterations associated with uremia and the effects of different dialysis methods on uremia, which may be used for future personalized therapy. However, due to variations of multiple factors (i.e., genetic, epigenetic and environment) in the process of disease treatments, identification of the similarities and differences in plasma metabolite changes in uremic patients in response to HD and HFD remains challenging. METHODS: In this study, a computational strategy for metabolic network construction based on the overlapping ratio (MNC-OR) was proposed for disease treatment effect research. In MNC-OR, the overlapping ratio was introduced to measure metabolic reactions and to construct metabolic networks for analysis of different treatment options. Then, MNC-OR was employed to analyze HD-pattern-dependent changes in plasma metabolites to explore the pathological alterations associated with uremia and the effectiveness of different dialysis patterns (i.e., HD and HFD) on uremia. Based on the networks constructed by MNC-OR, two network analysis techniques, namely, similarity analysis and difference analysis of network topology, were used to find the similarity and differences in metabolic signals in patients under treatment with either HD or HFD, which can facilitate the understanding of pathological alterations associated with uremia and provide the guidance for personalized dialysis therapy. RESULTS: Similarity analysis of network topology suggested that abnormal energy metabolism, gut metabolism and pyrimidine metabolism might occur in uremic patients, and maintenance of both HFD and HD therapies have beneficial effects on uremia. Then, difference analysis of network topology was employed to extract the crucial information related to HD-pattern-dependent changes in plasma metabolites. Experimental results indicated that the amino acid metabolism was closer to the normal status in HFD-treated patients; however, in HD-treated patients, the ability of antioxidation showed greater reduction, and the protein O-GlcNAcylation level was higher. Our findings demonstrate the potential of MNC-OR for explaining the metabolic similarities and differences of patients in response to different dialysis methods, thereby contributing to the guidance of personalized dialysis therapy.


Assuntos
Biologia Computacional , Redes e Vias Metabólicas , Diálise Renal , Uremia/metabolismo , Humanos , Uremia/diagnóstico
11.
J Biomed Inform ; 118: 103796, 2021 06.
Artigo em Inglês | MEDLINE | ID: mdl-33932596

RESUMO

Individual variation in genetic and environmental factors can cause the differences in metabolic phenotypes, which may have an effect on drug responses of patients. Deep exploration of patients' responses to therapeutic agents is a crucial and urgent event in the personalized treatment study. Using machine learning methods for the discovery of suitability evaluation biomarkers can provide deep insight into the mechanism of disease therapy and facilitate the development of personalized medicine. To find important metabolic network signals for the prediction of patients' drug responses, a novel method referred to as differential metabolic network construction (DMNC) was proposed. In DMNC, concentration changes in metabolite ratios between different pathological states are measured to construct differential metabolic networks, which can be used to advance clinical decision-making. In this study, DMNC was applied to characterize type 2 diabetes mellitus (T2DM) patients' responses against gliclazide modified-release (MR) therapy. Two T2DM metabolomics datasets from different batches of subjects treated by gliclazide MR were analyzed in depth. A network biomarker was defined to assess the patients' suitability for gliclazide MR. It can be effective in the prediction of significant responders from nonsignificant responders, achieving area under the curve values of 0.893 and 1.000 for the discovery and validation sets, respectively. Compared with the metabolites selected by the other methods, the network biomarker selected by DMNC was more stable and precise to reflect the metabolic responses in patients to gliclazide MR therapy, thereby contributing for the personalized medicine of T2DM patients. The better performance of DMNC validated its potential for the identification of network biomarkers to characterize the responses against therapeutic treatments and provide valuable information for personalized medicine.


Assuntos
Diabetes Mellitus Tipo 2 , Gliclazida , Diabetes Mellitus Tipo 2/tratamento farmacológico , Humanos , Hipoglicemiantes/uso terapêutico , Redes e Vias Metabólicas , Medicina de Precisão
12.
Anal Bioanal Chem ; 413(12): 3153-3165, 2021 May.
Artigo em Inglês | MEDLINE | ID: mdl-33796932

RESUMO

Comprehensive prognostic risk prediction of hepatocellular carcinoma (HCC) after surgical treatment is particularly important for guiding clinical decision-making and improving postoperative survival. Hence, we aimed to build prognostic models based on serum metabolomics data, and assess the prognostic risk of HCC within 5 years after surgical resection. A pseudotargeted gas chromatography-mass spectrometry (GC-MS)-based metabolomics method was applied to analyze serum profiling of 78 HCC patients. Important metabolic features with discriminant ability were identified by a novel network-based metabolic feature selection method based on combinational significance index (N-CSI). Subsequently, phenylalanine and galactose were further identified to be relevant with mortality by the Cox regression analysis, while galactose and tyrosine were associated with recurrence and metastasis. Two models to predict risk of mortality (risk score of overall survival, RSOS) and risk of recurrence and metastasis (risk score of disease-free survival, RSDFS) were generated based on two panels of metabolites, respectively, which present favorable ability to predict prognosis of HCC, especially when combined with clinical staging system. The performance of models was further validated in an external independent cohort from 91 HCC patients. This study demonstrated that metabolomics is a powerful tool for risk screening of HCC prognosis.


Assuntos
Carcinoma Hepatocelular/sangue , Cromatografia Gasosa-Espectrometria de Massas/métodos , Neoplasias Hepáticas/sangue , Metabolômica/métodos , Adulto , Carcinoma Hepatocelular/patologia , Carcinoma Hepatocelular/cirurgia , Feminino , Humanos , Neoplasias Hepáticas/patologia , Neoplasias Hepáticas/cirurgia , Masculino , Pessoa de Meia-Idade , Prognóstico , Reprodutibilidade dos Testes , Análise de Sobrevida
13.
Glob Chall ; 5(4): 2000088, 2021 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-33854788

RESUMO

In a Chinese prospective cohort, 500 patients with new-onset type 2 diabetes (T2D) within 4.61 years and 500 matched healthy participants are selected as case and control groups, and randomized into discovery and validation sets to discover the metabolite changes before T2D onset and the related diabetogenic loci. A serum metabolomics analysis reveals that 81 metabolites changed significantly before T2D onset. Based on binary logistic regression, eight metabolites are defined as a biomarker panel for T2D prediction. Pipecolinic acid, carnitine C14:0, epinephrine and phosphatidylethanolamine 34:2 are first found associated with future T2D. The addition of the biomarker panel to the clinical markers (BMI, triglycerides, and fasting glucose) significantly improves the predictive ability in the discovery and validation sets, respectively. By associating metabolomics with genomics, a significant correlation (p < 5.0 × 10-8) between eicosatetraenoic acid and the FADS1 (rs174559) gene is observed, and suggestive correlations (p < 5.0 × 10-6) between pipecolinic acid and CHRM3 (rs535514), and leucine/isoleucine and WWOX (rs72487966) are discovered. Elevated leucine/isoleucine levels increased the risk of T2D. In conclusion, multiple metabolic dysregulations are observed to occur before T2D onset, and the new biomarker panel can help to predict T2D risk.

14.
J Proteome Res ; 19(8): 3533-3541, 2020 08 07.
Artigo em Inglês | MEDLINE | ID: mdl-32618195

RESUMO

Assessment and prediction of prognostic risk in patients with hepatocellular carcinoma (HCC) would greatly benefit the optimal treatment selection. Here, we aimed to identify the critical metabolites associated with the outcomes and develop a risk score to assess the prognosis of HCC patients after curative resection. A total of 78 serum samples of HCC patients were analyzed by liquid chromatography-mass spectrometry to characterize the metabolic profiling. A novel network-based feature selection method (NFSM) was developed to define the critical metabolites with the most discriminant capacity to outcomes. The metabolites defined by NFSM was further reduced by Cox regression analysis to generate a prognostic metabolite panel-phenylalanine and choline. Furthermore, univariate and multivariate Cox regression analyses were applied to combine the metabolite panel with the presence of satellite nodes to generate a global prognostic index (GPI) score for overall survival assessment. Compared with the current clinical classification systems, including the Barcelona-clinic liver cancer stage, tumor-node-metastasis stage, and albumin-bilirubin grade, the GPI score presented comparable performance, according to the time-dependent receiver operating characteristic curves and was validated in an independent cohort, which suggested that metabolomics could serve as a helpful tool to stratify the HCC prognostic risk after operation.


Assuntos
Carcinoma Hepatocelular , Neoplasias Hepáticas , Carcinoma Hepatocelular/diagnóstico , Carcinoma Hepatocelular/cirurgia , Cromatografia Líquida , Humanos , Neoplasias Hepáticas/diagnóstico , Neoplasias Hepáticas/cirurgia , Espectrometria de Massas , Metabolômica , Prognóstico
15.
Anal Bioanal Chem ; 411(24): 6377-6386, 2019 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-31384984

RESUMO

Omics techniques develop quickly and have made a great contribution to disease study. Omics data are usually complex. How to analyze the data and mine important information has been a key part in omics research. To study the nature of disease mechanisms systematically, we propose a new data analysis method to define the network biomarkers based on horizontal comparison (DNB-HC). DNB-HC performs molecule horizontal relationships to characterize the physiological status and differential network analysis to screen the biomarkers. We applied DNB-HC to analyze a large-scale metabolomics data, which contained 550 samples from a nested case-control hepatocellular carcinoma (HCC) study. A network biomarker was defined, and its areas under curves (AUC) in the receiver-operating characteristic (ROC) analysis for HCC discrimination were larger than those defined by six efficient feature selection methods in most cases. The effectiveness was further corroborated by another nested HCC dataset. Besides, the performance of the defined biomarkers was better than that of α-fetoprotein (AFP), a commonly used clinical biomarker for distinguishing HCC from high-risk population of liver cirrhosis in other two independent metabolomics validation sets. All and 90.3% of the AFP false-negative patients with HCC were correctly diagnosed in these two sets, respectively. The experimental results illustrate that DNB-HC can mine more important information reflecting the nature of the research problems by studying the feature horizontal relationship systematically and identifying effective disease biomarkers in clinical practice. Graphical abstract.


Assuntos
Biomarcadores Tumorais/metabolismo , Carcinoma Hepatocelular/metabolismo , Neoplasias Hepáticas/metabolismo , Metabolômica , Adulto , Estudos de Casos e Controles , Cromatografia Líquida/métodos , Estudos de Coortes , Feminino , Humanos , Masculino , Pessoa de Meia-Idade , Reprodutibilidade dos Testes , Espectrometria de Massas em Tandem/métodos , alfa-Fetoproteínas/metabolismo
16.
Anal Chem ; 90(19): 11401-11408, 2018 10 02.
Artigo em Inglês | MEDLINE | ID: mdl-30148611

RESUMO

The pseudotargeted metabolomics method integrates advantages of nontargeted and targeted analysis because it can acquire data of metabolites in the multireaction monitoring (MRM) mode of mass spectrometry (MS) without needing standards. The key is the ion-pair information collection from samples to be analyzed. It is well-known that sequential windowed acquisition of all theoretical Fragment ion (SWATH) MS mode can acquire MS2 information to a maximum extent. To expediently acquire as many ion-pairs as possible with optimal collision energy (CE), an ion-pair selection approach based on SWATH MS acquisition with variable isolation windows was developed in this study. Initially, nontargeted acquisition of all metabolites information in plasma Standard Reference Material (SRM 1950) was performed by ultra high-performance liquid chromatography (UHPLC)-quadrupole time-of-flight (Q-TOF) MS platform with three CEs. With the help of software tool, the ion-pairs of unique metabolites were gained. Then they were validated in scheduled MRM coupled with UHPLC. After removing false positive, the ion-pairs with an optimal CE was integrated. A total of 1373 unique metabolite ion-pairs were obtained at positive ion mode. And repeatability of the established pseudotargeted approach was evaluated by intraday and interday precision. The results demonstrated the method was stable, reliable, and suitable for metabolomics study. As an application example, alterations of serum metabolites in Type 2 diabetes were investigated by using the established method. This work provides a pseudotargeted ion-pair selection method based on SWATH MS acquisition with the characters of increased metabolite coverage, suitable CE, and convenient processing.


Assuntos
Metabolômica/métodos , Soro/metabolismo , Adulto , Glicemia/análise , Cromatografia Líquida de Alta Pressão , Diabetes Mellitus Tipo 2/metabolismo , Diabetes Mellitus Tipo 2/patologia , Feminino , Humanos , Íons/química , Lipoproteínas HDL/sangue , Masculino , Pessoa de Meia-Idade , Espectrometria de Massas por Ionização por Electrospray
17.
J Pharm Biomed Anal ; 157: 20-26, 2018 Aug 05.
Artigo em Inglês | MEDLINE | ID: mdl-29754039

RESUMO

Feature relationships are complex and may contain important information. k top scoring pairs (k-TSP) studies feature relationships by the horizontal comparison. This study examines feature relationships and proposes vertical and horizontal k-TSP (VH-k-TSP) to identify the discriminative feature pairs by evaluating feature pairs based on the vertical and horizontal comparisons. Complexity is introduced to compute the discriminative abilities of feature pairs by means of these two comparisons. VH-k-TSP was compared with support vector machine-recursive feature elimination, relative simplicity-support vector machine, k-TSP and M-k-TSP on nine public genomics datasets. For multi-class problems, one-to-one method was used. The experiments showed that VH-k-TSP outperformed the four methods in most cases. Then, VH-k-TSP was applied to a metabolomics data of liver disease. An accuracy rate of 88.11 ±â€¯3.30% in discrimination between cirrhosis and hepatocellular carcinoma was obtained by VH-k-TSP, better than 77.39 ±â€¯4.10% and 79.28 ±â€¯3.73% obtained by k-TSP and M-k-TSP, respectively. Hence combining the vertical and horizontal comparisons could define more discriminative feature pairs.


Assuntos
Carcinoma Hepatocelular/genética , Carcinoma Hepatocelular/metabolismo , Hepatopatias/genética , Hepatopatias/metabolismo , Neoplasias Hepáticas/genética , Neoplasias Hepáticas/metabolismo , Genômica/métodos , Humanos , Metabolômica/métodos , Máquina de Vetores de Suporte
18.
Molecules ; 23(1)2017 Dec 26.
Artigo em Inglês | MEDLINE | ID: mdl-29278382

RESUMO

Feature selection is an important topic in bioinformatics. Defining informative features from complex high dimensional biological data is critical in disease study, drug development, etc. Support vector machine-recursive feature elimination (SVM-RFE) is an efficient feature selection technique that has shown its power in many applications. It ranks the features according to the recursive feature deletion sequence based on SVM. In this study, we propose a method, SVM-RFE-OA, which combines the classification accuracy rate and the average overlapping ratio of the samples to determine the number of features to be selected from the feature rank of SVM-RFE. Meanwhile, to measure the feature weights more accurately, we propose a modified SVM-RFE-OA (M-SVM-RFE-OA) algorithm that temporally screens out the samples lying in a heavy overlapping area in each iteration. The experiments on the eight public biological datasets show that the discriminative ability of the feature subset could be measured more accurately by combining the classification accuracy rate with the average overlapping degree of the samples compared with using the classification accuracy rate alone, and shielding the samples in the overlapping area made the calculation of the feature weights more stable and accurate. The methods proposed in this study can also be used with other RFE techniques to define potential biomarkers from big biological data.


Assuntos
Biomarcadores/metabolismo , Biologia Computacional/métodos , Doença/classificação , Máquina de Vetores de Suporte , Algoritmos , Humanos , Fenótipo
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...