Search | VHL Regional Portal

A novel artificial intelligence-based approach for identification of deoxynucleotide aptamers.

Heredia, Frances L; Roche-Lima, Abiel; Parés-Matos, Elsie I.

PLoS Comput Biol ; 17(8): e1009247, 2021 08.

Article in English | MEDLINE | ID: mdl-34343165

ABSTRACT

The selection of a DNA aptamer through the Systematic Evolution of Ligands by EXponential enrichment (SELEX) method involves multiple binding steps, in which a target and a library of randomized DNA sequences are mixed for selection of a single, nucleotide-specific molecule. Usually, 10 to 20 steps are required for SELEX to be completed. Throughout this process it is necessary to discriminate between true DNA aptamers and unspecified DNA-binding sequences. Thus, a novel machine learning-based approach was developed to support and simplify the early steps of the SELEX process, to help discriminate binding between DNA aptamers from those unspecified targets of DNA-binding sequences. An Artificial Intelligence (AI) approach to identify aptamers were implemented based on Natural Language Processing (NLP) and Machine Learning (ML). NLP method (CountVectorizer) was used to extract information from the nucleotide sequences. Four ML algorithms (Logistic Regression, Decision Tree, Gaussian Naïve Bayes, Support Vector Machines) were trained using data from the NLP method along with sequence information. The best performing model was Support Vector Machines because it had the best ability to discriminate between positive and negative classes. In our model, an Accuracy (A) of 0.995, the fraction of samples that the model correctly classified, and an Area Under the Receiving Operating Curve (AUROC) of 0.998, the degree by which a model is capable of distinguishing between classes, were observed. The developed AI approach is useful to identify potential DNA aptamers to reduce the amount of rounds in a SELEX selection. This new approach could be applied in the design of DNA libraries and result in a more efficient and faster process for DNA aptamers to be chosen during SELEX.

Subject(s)

Aptamers, Nucleotide/metabolism , Artificial Intelligence , SELEX Aptamer Technique/methods , Algorithms , Aptamers, Nucleotide/chemistry , Bayes Theorem , Computational Biology , Decision Trees , Gene Library , Humans , Ligands , Logistic Models , Machine Learning , Natural Language Processing , Protein Binding , SELEX Aptamer Technique/statistics & numerical data , Support Vector Machine

Controlling uncertainty in aptamer selection.

Spill, Fabian; Weinstein, Zohar B; Irani Shemirani, Atena; Ho, Nga; Desai, Darash; Zaman, Muhammad H.

Proc Natl Acad Sci U S A ; 113(43): 12076-12081, 2016 10 25.

Article in English | MEDLINE | ID: mdl-27790993

ABSTRACT

The search for high-affinity aptamers for targets such as proteins, small molecules, or cancer cells remains a formidable endeavor. Systematic Evolution of Ligands by EXponential Enrichment (SELEX) offers an iterative process to discover these aptamers through evolutionary selection of high-affinity candidates from a highly diverse random pool. This randomness dictates an unknown population distribution of fitness parameters, encoded by the binding affinities, toward SELEX targets. Adding to this uncertainty, repeating SELEX under identical conditions may lead to variable outcomes. These uncertainties pose a challenge when tuning selection pressures to isolate high-affinity ligands. Here, we present a stochastic hybrid model that describes the evolutionary selection of aptamers to explore the impact of these unknowns. To our surprise, we find that even single copies of high-affinity ligands in a pool of billions can strongly influence population dynamics, yet their survival is highly dependent on chance. We perform Monte Carlo simulations to explore the impact of environmental parameters, such as the target concentration, on selection efficiency in SELEX and identify strategies to control these uncertainties to ultimately improve the outcome and speed of this time- and resource-intensive process.

Subject(s)

Aptamers, Nucleotide/chemistry , Nucleic Acids/chemistry , Proteins/chemistry , SELEX Aptamer Technique/statistics & numerical data , Small Molecule Libraries/chemistry , Binding Sites , Binding, Competitive , Humans , Kinetics , Ligands , Monte Carlo Method , Stochastic Processes , Uncertainty

Revealing protein-lncRNA interaction.

Ferrè, Fabrizio; Colantoni, Alessio; Helmer-Citterich, Manuela.

Brief Bioinform ; 17(1): 106-16, 2016 Jan.

Article in English | MEDLINE | ID: mdl-26041786

ABSTRACT

Long non-coding RNAs (lncRNAs) are associated to a plethora of cellular functions, most of which require the interaction with one or more RNA-binding proteins (RBPs); similarly, RBPs are often able to bind a large number of different RNAs. The currently available knowledge is already drawing an intricate network of interactions, whose deregulation is frequently associated to pathological states. Several different techniques were developed in the past years to obtain protein-RNA binding data in a high-throughput fashion. In parallel, in silico inference methods were developed for the accurate computational prediction of the interaction of RBP-lncRNA pairs. The field is growing rapidly, and it is foreseeable that in the near future, the protein-lncRNA interaction network will rise, offering essential clues for a better understanding of lncRNA cellular mechanisms and their disease-associated perturbations.

Subject(s)

RNA, Long Noncoding/metabolism , RNA-Binding Proteins/metabolism , Computational Biology/methods , Computer Simulation , High-Throughput Nucleotide Sequencing/statistics & numerical data , Humans , Models, Molecular , Nucleic Acid Conformation , Protein Conformation , Protein Interaction Maps/genetics , RNA, Long Noncoding/chemistry , RNA, Long Noncoding/genetics , RNA-Binding Proteins/chemistry , RNA-Binding Proteins/genetics , SELEX Aptamer Technique/statistics & numerical data

An integrated approach to blood-based cancer diagnosis and biomarker discovery.

Min, Martin Renqiang; Chowdhury, Salim; Qi, Yanjun; Stewart, Alex; Ostroff, Rachel.

Pac Symp Biocomput ; : 87-98, 2014.

Article in English | MEDLINE | ID: mdl-24297536

ABSTRACT

Disrupted or abnormal biological processes responsible for cancers often quantitatively manifest as disrupted additive and multiplicative interactions of gene/protein expressions correlating with cancer progression. However, the examination of all possible combinatorial interactions between gene features in most case-control studies with limited training data is computationally infeasible. In this paper, we propose a practically feasible data integration approach, QUIRE (QUadratic Interactions among infoRmative fEatures), to identify discriminative complex interactions among informative gene features for cancer diagnosis and biomarker discovery directly based on patient blood samples. QUIRE works in two stages, where it first identifies functionally relevant gene groups for the disease with the help of gene functional annotations and available physical protein interactions, then it explores the combinatorial relationships among the genes from the selected informative groups. Based on our private experimentally generated data from patient blood samples using a novel SOMAmer (Slow Off-rate Modified Aptamer) technology, we apply QUIRE to cancer diagnosis and biomarker discovery for Renal Cell Carcinoma (RCC) and Ovarian Cancer (OVC). To further demonstrate the general applicability of our approach, we also apply QUIRE to a publicly available Colorectal Cancer (CRC) dataset that can be used to prioritize our SOMAmer design. Our experimental results show that QUIRE identifies gene-gene interactions that can better identify the different cancer stages of samples, as compared to other state-of-the-art feature selection methods. A literature survey shows that many of the interactions identified by QUIRE play important roles in the development of cancer.

Subject(s)

Biomarkers/blood , Neoplasms/blood , Neoplasms/diagnosis , Artificial Intelligence , Carcinoma, Renal Cell/blood , Carcinoma, Renal Cell/diagnosis , Carcinoma, Renal Cell/genetics , Colorectal Neoplasms/blood , Colorectal Neoplasms/diagnosis , Colorectal Neoplasms/genetics , Computational Biology , Disease Progression , Epistasis, Genetic , Female , Genetic Markers , Genome-Wide Association Study/statistics & numerical data , Humans , Kidney Neoplasms/blood , Kidney Neoplasms/diagnosis , Kidney Neoplasms/genetics , Models, Genetic , Neoplasms/genetics , Ovarian Neoplasms/blood , Ovarian Neoplasms/diagnosis , Ovarian Neoplasms/genetics , SELEX Aptamer Technique/statistics & numerical data

Theoretical modeling of masking DNA application in aptamer-facilitated biomarker discovery.

Cherney, Leonid T; Obrecht, Natalia M; Krylov, Sergey N.

Anal Chem ; 85(8): 4157-64, 2013 Apr 16.

Article in English | MEDLINE | ID: mdl-23480390

ABSTRACT

In aptamer-facilitated biomarker discovery (AptaBiD), aptamers are selected from a library of random DNA (or RNA) sequences for their ability to specifically bind cell-surface biomarkers. The library is incubated with intact cells, and cell-bound DNA molecules are separated from those unbound and amplified by the polymerase chain reaction (PCR). The partitioning/amplification cycle is repeated multiple times while alternating target cells and control cells. Efficient aptamer selection in AptaBiD relies on the inclusion of masking DNA within the cell and library mixture. Masking DNA lacks primer regions for PCR amplification and is typically taken in excess to the library. The role of masking DNA within the selection mixture is to outcompete any nonspecific binding sequences within the initial library, thus allowing specific DNA sequences (i.e., aptamers) to be selected more efficiently. Efficient AptaBiD requires an optimum ratio of masking DNA to library DNA, at which aptamers still bind specific binding sites but nonaptamers within the library do not bind nonspecific binding sites. Here, we have developed a mathematical model that describes the binding processes taking place within the equilibrium mixture of masking DNA, library DNA, and target cells. An obtained mathematical solution allows one to estimate the concentration of masking DNA that is required to outcompete the library DNA at a desirable ratio of bound masking DNA to bound library DNA. The required concentration depends on concentrations of the library and cells as well as on unknown cell characteristics. These characteristics include the concentration of total binding sites on the cell surface, N, and equilibrium dissociation constants, K(nsL) and K(nsM), for nonspecific binding of the library DNA and masking DNA, respectively. We developed a theory that allows the determination of N, K(nsL), and K(nsM) based on measurements of EC50 values for cells mixed separately with the library and masking DNA (EC50 is the concentration of fluorescently labeled DNA at which half of the maximum fluorescence signal from DNA-bound cells is reached). We also obtained expressions for signals from bound DNA (measured by flow cytometry) in terms of N, K(nsL), and K(nsM). These expressions can be used for the verification of N, K(nsL), and K(nsM) values found from EC50 measurements. The developed procedure was applied to MCF-7 breast cancer cells, and corresponding values of N, K(nsL), and K(nsM) were established for the first time. The concentration of masking DNA required for AptaBiD with MCF-7 breast cancer cells was also estimated.

Subject(s)

Aptamers, Nucleotide/genetics , DNA, Neoplasm/analysis , Flow Cytometry/statistics & numerical data , Models, Chemical , SELEX Aptamer Technique/statistics & numerical data , Binding Sites , Binding, Competitive , Biomarkers/analysis , Cell Line, Tumor , DNA Primers/genetics , DNA, Neoplasm/genetics , Female , Gene Library , Humans , Kinetics , Polymerase Chain Reaction , SELEX Aptamer Technique/methods

A highly sensitive aptasensor towards Plasmodium lactate dehydrogenase for the diagnosis of malaria.

Lee, Seonghwan; Song, Kyung-Mi; Jeon, Weejeong; Jo, Hunho; Shim, Yoon-Bo; Ban, Changill.

Biosens Bioelectron ; 35(1): 291-296, 2012 May 15.

Article in English | MEDLINE | ID: mdl-22459583

ABSTRACT

Finding a highly sensitive diagnostic technique for malaria has challenged scientists for the last century. In the present study, we identified versatile single-strand DNA aptamers for Plasmodium lactate dehydrogenase (pLDH), a biomarker for malaria, via the Systematic Evolution of Ligands by EXponential enrichment (SELEX). The pLDH aptamers selectively bound to the target proteins with high sensitivity (K(d)=16.8-49.6 nM). The selected aptamers were characterized using an electrophoretic mobility shift assay, a quartz crystal microbalance, a fluorescence assay, and circular dichroism spectroscopy. We also designed a simple aptasensor using electrochemical impedance spectroscopy; both Plasmodium vivax LDH and Plasmodium falciparum LDH were selectively detected with a detection limit of 1 pM. Furthermore, the pLDH aptasensor clearly distinguished between malaria-positive blood samples of two major species (P. vivax and P. falciparum) and a negative control, indicating that it may be a useful tool for the diagnosis, monitoring, and surveillance of malaria.

Subject(s)

Aptamers, Nucleotide , Biosensing Techniques/methods , L-Lactate Dehydrogenase/blood , Malaria/diagnosis , Plasmodium/enzymology , SELEX Aptamer Technique/methods , Aptamers, Nucleotide/chemistry , Base Sequence , Biomarkers/blood , Biosensing Techniques/statistics & numerical data , Circular Dichroism , Dielectric Spectroscopy , Electrophoretic Mobility Shift Assay , Humans , Limit of Detection , Malaria/enzymology , Malaria/parasitology , Malaria, Falciparum/diagnosis , Malaria, Vivax/diagnosis , Nucleic Acid Conformation , Quartz Crystal Microbalance Techniques , SELEX Aptamer Technique/statistics & numerical data

Selection of thrombin-binding aptamers by using computational approach for aptasensor application.

Bini, Alessandra; Mascini, Marcello; Mascini, Marco; Turner, Anthony P F.

Biosens Bioelectron ; 26(11): 4411-6, 2011 Jul 15.

Article in English | MEDLINE | ID: mdl-21636260

ABSTRACT

The possibility of introducing a computationally assisted method to study aptamer-protein interaction was evaluated with the aim of streamlining the screening and selection of new aptamers. Starting from information on the 15-mer (5'-GGTTGGTGTGGTTGG-3') thrombin binding aptamer (TBA), a library of mutated DNA sequences (994 elements) was generated and screened using shapegauss a shape-based scoring function from openeye software to generate computationally derived binding scores. The TBA and three other mutated oligonucleotides, selected on the basis of their binding score (best, medium, worst), were incorporated into surface plasmon resonance (SPR) biosensors. By reducing the ionic strength (binding buffer, 50 mM TrisHCl pH 7.4, 140 mM NaCl, 1mM MgCl2, diluted 1:50) in order to match the simulated condition, the analytical performances of the four oligonucleotide sequences were compared using signal amplitude, sensitivity (slope), linearity (R²) and reproducibility (CVav %). The experimental results were in agreement with the simulation findings.

Subject(s)

Aptamers, Nucleotide , Biosensing Techniques/methods , Aptamers, Nucleotide/chemistry , Aptamers, Nucleotide/genetics , Base Sequence , Binding Sites , Biosensing Techniques/statistics & numerical data , Computational Biology , Gene Library , Morpholines , Nuclear Magnetic Resonance, Biomolecular , Nucleic Acid Conformation , Protein Conformation , SELEX Aptamer Technique/statistics & numerical data , Surface Plasmon Resonance , Thrombin/chemistry

An aptamer-based chromatographic strip assay for sensitive toxin semi-quantitative detection.

Wang, Libing; Ma, Wenwei; Chen, Wei; Liu, Liqiang; Ma, Wei; Zhu, Yingyue; Xu, Liguang; Kuang, Hua; Xu, Chuanlai.

Biosens Bioelectron ; 26(6): 3059-62, 2011 Feb 15.

Article in English | MEDLINE | ID: mdl-21167704

ABSTRACT

An aptamer-based chromatographic strip assay method for rapid toxin detection was developed. The aptamer-based strip assay was based on the competition for the aptamer between ochratoxin A and DNA probes. The sensing results indicated that the sensitivity of the aptamer-based strip was better than that of conventional antibody-based strips. The visual limit of detection of the strip for qualitative detection was 1 ng/mL while the LOD for semi-quantitative detection could down to 0.18 ng/mL by using scanning reader. The recoveries of test samples were from 96% to 110%. All detections could be achieved in less than 10 min, indicating that the aptamer-based strip could be a potential useful tool for rapid on-site detections.

Subject(s)

Aptamers, Nucleotide , Biosensing Techniques/instrumentation , Biosensing Techniques/methods , SELEX Aptamer Technique/instrumentation , SELEX Aptamer Technique/methods , Toxins, Biological/analysis , Aptamers, Nucleotide/genetics , Base Sequence , Biosensing Techniques/statistics & numerical data , DNA Probes/genetics , Food Contamination/analysis , Gold , Limit of Detection , Metal Nanoparticles , Microtechnology , Nanotechnology , Ochratoxins/analysis , SELEX Aptamer Technique/statistics & numerical data , Wine/analysis

Subtractive SELEX against two heterogeneous target samples: numerical simulations and analysis.

Chen, Chi-Kan; Kuo, Tzy-Ling; Chan, Po-Chou; Lin, Lung-Ying.

Comput Biol Med ; 37(6): 750-9, 2007 Jun.

Article in English | MEDLINE | ID: mdl-16920093

ABSTRACT

Systematic evolution of ligands by exponential (SELEX) is a revolutionary technology that integrates combinatorial chemistry with high throughput screening to generate from synthesized nucleic acid ligand libraries the high affinity nucleic acid ligands (aptamers) for interesting targets. Recently, the SELEX experiments have advanced from targeting the ligand libraries by a single purified target to multiple heterogeneous target samples. Having the potential of bringing enormous technical and economical advantages to drug discovery, the new application suffers from unpredictable performances. To gain an insight of the new method, we develop a computer model to numerically analyze the subtractive SELEX alternatively against two distinct heterogeneous samples of unknown targets. The model features the discretization of ligand library, the ligand-target binding equilibrium equations, and the separation efficiency of bound and unbound ligands in experiments. By computer simulations, we investigate how aptamers for desired targets embedded in undefined target mixtures are generated under different experimental conditions. We find the iterative screening scheme is fundamentally capable of developing desired aptamers. On the other hand, target sample configuration and separation efficiency may all together significantly diversify the screening dynamics and results.

Subject(s)

SELEX Aptamer Technique/statistics & numerical data , Aptamers, Nucleotide/chemical synthesis , Aptamers, Nucleotide/metabolism , Computer Simulation , Kinetics , Ligands , Models, Statistical

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL