ABSTRACT
Gene expression signatures of toxicity and clinical response benefit both safety assessment and clinical practice; however, difficulties in connecting signature genes with the predicted end points have limited their application. The Microarray Quality Control Consortium II (MAQCII) project generated 262 signatures for ten clinical and three toxicological end points from six gene expression data sets, an unprecedented collection of diverse signatures that has permitted a wide-ranging analysis on the nature of such predictive models. A comprehensive analysis of the genes of these signatures and their nonredundant unions using ontology enrichment, biological network building and interactome connectivity analyses demonstrated the link between gene signatures and the biological basis of their predictive power. Different signatures for a given end point were more similar at the level of biological properties and transcriptional control than at the gene level. Signatures tended to be enriched in function and pathway in an end point and model-specific manner, and showed a topological bias for incoming interactions. Importantly, the level of biological similarity between different signatures for a given end point correlated positively with the accuracy of the signature predictions. These findings will aid the understanding, and application of predictive genomic signatures, and support their broader application in predictive medicine.
Subject(s)
Algorithms , Gene Expression Profiling , Genomics/statistics & numerical data , Databases, Genetic , Endpoint Determination/statistics & numerical data , Humans , Neural Networks, Computer , Oligonucleotide Array Sequence Analysis , Phenotype , Predictive Value of Tests , Proteins/classification , Proteins/genetics , Quality ControlABSTRACT
The authors have previously applied two integrated platforms, MetaCore and MetaDrug, for the assembly and analysis of human biological networks as a useful method for the integration and functional interpretation of high-throughput experimental data. The present study demonstrates in detail the specific algorithms that are used in both software platforms. Using a standard set of genes as input, namely CYP3A4 (an enzyme), PXR (a nuclear hormone receptor), MDR1 (a transporter) and hERG (an ion channel) related to the absorption, distribution, metabolism, excretion and toxicity (ADME/Tox) of xenobiotics, we have now generated networks with each algorithm. The relative advantages and disadvantages of these algorithms are explained using these examples as well as appropriate instances of utility to illustrate further the particular circumstances for their use. In addition, the benefits of the different network algorithms are identified when compared with algorithms available in other products, where this information is available.