Search | VHL Regional Portal

Greedy Methods, Randomization Approaches, and Multiarm Bandit Algorithms for Efficient Sparsity-Constrained Optimization.

Rakotomamonjy, Alain; Koco, Sokol; Ralaivola, Liva.

IEEE Trans Neural Netw Learn Syst ; 28(11): 2789-2802, 2017 11.

Article in English | MEDLINE | ID: mdl-28113680

ABSTRACT

Several sparsity-constrained algorithms, such as orthogonal matching pursuit (OMP) or the Frank-Wolfe (FW) algorithm, with sparsity constraints work by iteratively selecting a novel atom to add to the current nonzero set of variables. This selection step is usually performed by computing the gradient and then by looking for the gradient component with maximal absolute entry. This step can be computationally expensive especially for large-scale and high-dimensional data. In this paper, we aim at accelerating these sparsity-constrained optimization algorithms by exploiting the key observation that, for these algorithms to work, one only needs the coordinate of the gradient's top entry. Hence, we introduce algorithms based on greedy methods and randomization approaches that aim at cheaply estimating the gradient and its top entry. Another of our contribution is to cast the problem of finding the best gradient entry as a best-arm identification in a multiarmed bandit problem. Owing to this novel insight, we are able to provide a bandit-based algorithm that directly estimates the top entry in a very efficient way. Theoretical observations stating that the resulting inexact FW or OMP algorithms act, with high probability, similar to their exact versions are also given. We have carried out several experiments showing that the greedy deterministic and the bandit approaches we propose can achieve an acceleration of an order of magnitude while being as efficient as the exact gradient when used in algorithms, such as OMP, FW, or CoSaMP.

Graph-based inter-subject pattern analysis of FMRI data.

Takerkart, Sylvain; Auzias, Guillaume; Thirion, Bertrand; Ralaivola, Liva.

PLoS One ; 9(8): e104586, 2014.

Article in English | MEDLINE | ID: mdl-25127129

ABSTRACT

In brain imaging, solving learning problems in multi-subjects settings is difficult because of the differences that exist across individuals. Here we introduce a novel classification framework based on group-invariant graphical representations, allowing to overcome the inter-subject variability present in functional magnetic resonance imaging (fMRI) data and to perform multivariate pattern analysis across subjects. Our contribution is twofold: first, we propose an unsupervised representation learning scheme that encodes all relevant characteristics of distributed fMRI patterns into attributed graphs; second, we introduce a custom-designed graph kernel that exploits all these characteristics and makes it possible to perform supervised learning (here, classification) directly in graph space. The well-foundedness of our technique and the robustness of the performance to the parameter setting are demonstrated through inter-subject classification experiments conducted on both artificial data and a real fMRI experiment aimed at characterizing local cortical representations. Our results show that our framework produces accurate inter-subject predictions and that it outperforms a wide range of state-of-the-art vector- and parcel-based classification methods. Moreover, the genericity of our method makes it is easily adaptable to a wide range of potential applications. The dataset used in this study and an implementation of our framework are available at http://dx.doi.org/10.6084/m9.figshare.1086317.

Subject(s)

Brain Mapping/methods , Brain/diagnostic imaging , Image Interpretation, Computer-Assisted/methods , Neuroimaging/methods , Algorithms , Humans , Magnetic Resonance Imaging/methods , Pattern Recognition, Automated/methods , Radiography

One- to four-dimensional kernels for virtual screening and the prediction of physical, chemical, and biological properties.

Azencott, Chloé-Agathe; Ksikes, Alexandre; Swamidass, S Joshua; Chen, Jonathan H; Ralaivola, Liva; Baldi, Pierre.

J Chem Inf Model ; 47(3): 965-74, 2007.

Article in English | MEDLINE | ID: mdl-17338509

ABSTRACT

Many chemoinformatics applications, including high-throughput virtual screening, benefit from being able to rapidly predict the physical, chemical, and biological properties of small molecules to screen large repositories and identify suitable candidates. When training sets are available, machine learning methods provide an effective alternative to ab initio methods for these predictions. Here, we leverage rich molecular representations including 1D SMILES strings, 2D graphs of bonds, and 3D coordinates to derive efficient machine learning kernels to address regression problems. We further expand the library of available spectral kernels for small molecules developed for classification problems to include 2.5D surface and 3D kernels using Delaunay tetrahedrization and other techniques from computational geometry, 3D pharmacophore kernels, and 3.5D or 4D kernels capable of taking into account multiple molecular configurations, such as conformers. The kernels are comprehensively tested using cross-validation and redundancy-reduction methods on regression problems using several available data sets to predict boiling points, melting points, aqueous solubility, octanol/water partition coefficients, and biological activity with state-of-the art results. When sufficient training data are available, 2D spectral kernels in general tend to yield the best and most robust results, better than state-of-the art. On data sets containing thousands of molecules, the kernels achieve a squared correlation coefficient of 0.91 for aqueous solubility prediction and 0.94 for octanol/water partition coefficient prediction. Averaging over conformations improves the performance of kernels based on the three-dimensional structure of molecules, especially on challenging data sets. Kernel predictors for aqueous solubility (kSOL), LogP (kLOGP), and melting point (kMELT) are available over the Web through: http://cdb.ics.uci.edu.

Subject(s)

Drug Evaluation, Preclinical/methods , Informatics/methods , Pharmaceutical Preparations/chemistry , Alkanes/chemistry , Benzodiazepines/chemistry , Benzodiazepines/pharmacology , Solubility , Transition Temperature

The pharmacophore kernel for virtual screening with support vector machines.

Mahé, Pierre; Ralaivola, Liva; Stoven, Véronique; Vert, Jean-Philippe.

J Chem Inf Model ; 46(5): 2003-14, 2006.

Article in English | MEDLINE | ID: mdl-16995731

ABSTRACT

We introduce a family of positive definite kernels specifically optimized for the manipulation of 3D structures of molecules with kernel methods. The kernels are based on the comparison of the three-point pharmacophores present in the 3D structures of molecules, a set of molecular features known to be particularly relevant for virtual screening applications. We present a computationally demanding exact implementation of these kernels, as well as fast approximations related to the classical fingerprint-based approaches. Experimental results suggest that this new approach is competitive with state-of-the-art algorithms based on the 2D structure of molecules for the detection of inhibitors of several drug targets.

Subject(s)

Pharmaceutical Preparations , Algorithms

Graph kernels for chemical informatics.

Ralaivola, Liva; Swamidass, Sanjay J; Saigo, Hiroto; Baldi, Pierre.

Neural Netw ; 18(8): 1093-110, 2005 Oct.

Article in English | MEDLINE | ID: mdl-16157471

ABSTRACT

Increased availability of large repositories of chemical compounds is creating new challenges and opportunities for the application of machine learning methods to problems in computational chemistry and chemical informatics. Because chemical compounds are often represented by the graph of their covalent bonds, machine learning methods in this domain must be capable of processing graphical structures with variable size. Here, we first briefly review the literature on graph kernels and then introduce three new kernels (Tanimoto, MinMax, Hybrid) based on the idea of molecular fingerprints and counting labeled paths of depth up to d using depth-first search from each possible vertex. The kernels are applied to three classification problems to predict mutagenicity, toxicity, and anti-cancer activity on three publicly available data sets. The kernels achieve performances at least comparable, and most often superior, to those previously reported in the literature reaching accuracies of 91.5% on the Mutag dataset, 65-67% on the PTC (Predictive Toxicology Challenge) dataset, and 72% on the NCI (National Cancer Institute) dataset. Properties and tradeoffs of these kernels, as well as other proposed kernels that leverage 1D or 3D representations of molecules, are briefly discussed.

Subject(s)

Artificial Intelligence , Computer Graphics , Information Storage and Retrieval , Models, Chemical , Pattern Recognition, Automated , Sequence Analysis, Protein , Anticarcinogenic Agents , Databases, Genetic , Drug-Related Side Effects and Adverse Reactions , National Practitioner Data Bank , Structure-Activity Relationship , United States

Kernels for small molecules and the prediction of mutagenicity, toxicity and anti-cancer activity.

Swamidass, S Joshua; Chen, Jonathan; Bruand, Jocelyne; Phung, Peter; Ralaivola, Liva; Baldi, Pierre.

Bioinformatics ; 21 Suppl 1: i359-68, 2005 Jun.

Article in English | MEDLINE | ID: mdl-15961479

ABSTRACT

MOTIVATION: Small molecules play a fundamental role in organic chemistry and biology. They can be used to probe biological systems and to discover new drugs and other useful compounds. As increasing numbers of large datasets of small molecules become available, it is necessary to develop computational methods that can deal with molecules of variable size and structure and predict their physical, chemical and biological properties. RESULTS: Here we develop several new classes of kernels for small molecules using their 1D, 2D and 3D representations. In 1D, we consider string kernels based on SMILES strings. In 2D, we introduce several similarity kernels based on conventional or generalized fingerprints. Generalized fingerprints are derived by counting in different ways subpaths contained in the graph of bonds, using depth-first searches. In 3D, we consider similarity measures between histograms of pairwise distances between atom classes. These kernels can be computed efficiently and are applied to problems of classification and prediction of mutagenicity, toxicity and anti-cancer activity on three publicly available datasets. The results derived using cross-validation methods are state-of-the-art. Tradeoffs between various kernels are briefly discussed. AVAILABILITY: Datasets available from http://www.igb.uci.edu/servers/servers.html

Subject(s)

Antineoplastic Agents/pharmacology , Computational Biology/methods , Neoplasms/pathology , Animals , Computer Simulation , Databases, Factual , Drug-Related Side Effects and Adverse Reactions , Female , Male , Mice , Models, Molecular , Models, Statistical , Mutagens , Pattern Recognition, Automated , ROC Curve , Rats

Gene networks inference using dynamic Bayesian networks.

Perrin, Bruno-Edouard; Ralaivola, Liva; Mazurie, Aurélien; Bottani, Samuele; Mallet, Jacques; d'Alché-Buc, Florence.

Bioinformatics ; 19 Suppl 2: ii138-48, 2003 Oct.

Article in English | MEDLINE | ID: mdl-14534183

ABSTRACT

This article deals with the identification of gene regulatory networks from experimental data using a statistical machine learning approach. A stochastic model of gene interactions capable of handling missing variables is proposed. It can be described as a dynamic Bayesian network particularly well suited to tackle the stochastic nature of gene regulation and gene expression measurement. Parameters of the model are learned through a penalized likelihood maximization implemented through an extended version of EM algorithm. Our approach is tested against experimental data relative to the S.O.S. DNA Repair network of the Escherichia coli bacterium. It appears to be able to extract the main regulations between the genes involved in this network. An added missing variable is found to model the main protein of the network. Good prediction abilities on unlearned data are observed. These first results are very promising: they show the power of the learning algorithm and the ability of the model to capture gene interactions.

Subject(s)

DNA Repair/physiology , Escherichia coli Proteins/metabolism , Escherichia coli/physiology , Gene Expression Profiling/methods , Gene Expression Regulation/physiology , Models, Biological , Signal Transduction/physiology , Algorithms , Artificial Intelligence , Bayes Theorem , Computer Simulation , Data Interpretation, Statistical , Pattern Recognition, Automated

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL