Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 20
Filter
1.
Mol Ther ; 32(2): 372-383, 2024 Feb 07.
Article in English | MEDLINE | ID: mdl-38053334

ABSTRACT

Epidermolysis bullosa simplex (EBS) is a rare skin disease inherited mostly in an autosomal dominant manner. Patients display a skin fragility that leads to blisters and erosions caused by minor mechanical trauma. EBS phenotypic and genotypic variants are caused by genetic defects in intracellular proteins whose function is to provide the attachment of basal keratinocytes to the basement membrane zone and most EBS cases display mutations in keratin 5 (KRT5) and keratin 14 (KRT14) genes. Besides palliative treatments, there is still no long-lasting effective cure to correct the mutant gene and abolish the dominant negative effect of the pathogenic protein over its wild-type counterpart. Here, we propose a molecular strategy for EBS01 patient's keratinocytes carrying a monoallelic c.475/495del21 mutation in KRT14 exon 1. Through the CRISPR-Cas9 system, we perform a specific cleavage only on the mutant allele and restore a normal cellular phenotype and a correct intermediate filament network, without affecting the epidermal stem cell, referred to as holoclones, which play a crucial role in epidermal regeneration.


Subject(s)
Epidermolysis Bullosa Simplex , Humans , Epidermolysis Bullosa Simplex/genetics , Epidermolysis Bullosa Simplex/therapy , Epidermolysis Bullosa Simplex/metabolism , Alleles , CRISPR-Cas Systems , Keratinocytes/metabolism , Mutation , Stem Cells/metabolism
2.
Oncoimmunology ; 12(1): 2170095, 2023.
Article in English | MEDLINE | ID: mdl-36733497

ABSTRACT

Indoleamine 2,3 dioxygenase 1 (IDO1), a leader tryptophan-degrading enzyme, represents a recognized immune checkpoint molecule. In neoplasia, IDO1 is often highly expressed in dendritic cells infiltrating the tumor and/or in tumor cells themselves, particularly in human melanoma. In dendritic cells, IDO1 does not merely metabolize tryptophan into kynurenine but, after phosphorylation of critical tyrosine residues in the non-catalytic small domain, it triggers a signaling pathway prolonging its immunoregulatory effects by a feed-forward mechanism. We here investigated whether the non-enzymatic function of IDO1 could also play a role in tumor cells by using B16-F10 mouse melanoma cells transfected with either the wild-type Ido1 gene (Ido1WT ) or a mutated variant lacking the catalytic, but not signaling activity (Ido1H350A ). As compared to the Ido1WT -transfected counterpart (B16WT), B16-F10 cells expressing Ido1H350A (B16H350A) were characterized by an in vitro accelerated growth mediated by increased Ras and Erk activities. Faster growth and malignant progression of B16H350A cells, also detectable in vivo, were found to be accompanied by a reduction in tumor-infiltrating CD8+ T cells and an increase in Foxp3+ regulatory T cells. Our data, therefore, suggest that the IDO1 signaling function can also occur in tumor cells and that alternative therapeutic approach strategies should be undertaken to effectively tackle this important immune checkpoint molecule.


Subject(s)
Melanoma, Experimental , Tryptophan , Mice , Humans , Animals , CD8-Positive T-Lymphocytes/metabolism , Indoleamine-Pyrrole 2,3,-Dioxygenase/genetics , Indoleamine-Pyrrole 2,3,-Dioxygenase/metabolism , Immune Checkpoint Proteins , Melanoma, Experimental/genetics , Signal Transduction
3.
Breast Cancer Res Treat ; 176(2): 271-289, 2019 Jul.
Article in English | MEDLINE | ID: mdl-31006104

ABSTRACT

PURPOSE: Primary breast and prostate cancers can be cured, but metastatic disease cannot. Identifying cell factors that predict metastatic potential could guide both prognosis and treatment. METHODS: We used Cell-SELEX to screen an RNA aptamer library for differential binding to prostate cancer cell lines with high vs. low metastatic potential. Mass spectroscopy, immunoblot, and immunohistochemistry were used to identify and validate aptamer targets. Aptamer properties were tested in vitro, in xenograft models, and in clinical biopsies. Gene expression datasets were queried for target associations in cancer. RESULTS: We identified a novel aptamer (Apt63) that binds to the beta subunit of F1Fo ATP synthase (ATP5B), present on the plasma membrane of certain normal and cancer cells. Apt63 bound to plasma membranes of multiple aggressive breast and prostate cell lines, but not to normal breast and prostate epithelial cells, and weakly or not at all to non-metastasizing cancer cells; binding led to rapid cell death. A single intravenous injection of Apt63 induced rapid, tumor cell-selective binding and cytotoxicity in MDA-MB-231 xenograft tumors, associated with endonuclease G nuclear translocation and DNA fragmentation. Apt63 was not toxic to non-transformed epithelial cells in vitro or adjacent normal tissue in vivo. In breast cancer tissue arrays, plasma membrane staining with Apt63 correlated with tumor stage (p < 0.0001, n = 416) and was independent of other cancer markers. Across multiple datasets, ATP5B expression was significantly increased relative to normal tissue, and negatively correlated with metastasis-free (p = 0.0063, 0.00039, respectively) and overall (p = 0.050, 0.0198) survival. CONCLUSION: Ecto-ATP5B binding by Apt63 may disrupt an essential survival mechanism in a subset of tumors with high metastatic potential, and defines a novel category of cancers with potential vulnerability to ATP5B-targeted therapy. Apt63 is a unique tool for elucidating the function of surface ATP synthase, and potentially for predicting and treating metastatic breast and prostate cancer.


Subject(s)
Aptamers, Nucleotide/administration & dosage , Breast Neoplasms/pathology , Cell Membrane/metabolism , Mitochondrial Proton-Translocating ATPases/genetics , Mitochondrial Proton-Translocating ATPases/metabolism , Prostatic Neoplasms/pathology , Administration, Intravenous , Animals , Aptamers, Nucleotide/pharmacology , Biomarkers, Tumor/genetics , Biomarkers, Tumor/metabolism , Breast Neoplasms/drug therapy , Breast Neoplasms/genetics , Breast Neoplasms/metabolism , Cell Line, Tumor , Cell Proliferation/drug effects , Cell Survival/drug effects , Early Detection of Cancer , Female , Gene Expression Regulation, Neoplastic/drug effects , Humans , MCF-7 Cells , Male , Mice , Mitochondrial Proton-Translocating ATPases/antagonists & inhibitors , Neoplasm Staging , Prostatic Neoplasms/drug therapy , Prostatic Neoplasms/genetics , Prostatic Neoplasms/metabolism , SELEX Aptamer Technique , Treatment Outcome , Up-Regulation , Xenograft Model Antitumor Assays
4.
Blood Cancer J ; 6(9): e468, 2016 09 09.
Article in English | MEDLINE | ID: mdl-27611921

ABSTRACT

Long non-coding RNAs (lncRNAs) represent a novel class of functional RNA molecules with an important emerging role in cancer. To elucidate their potential pathogenetic role in chronic lymphocytic leukemia (CLL), a biologically and clinically heterogeneous neoplasia, we investigated lncRNAs expression in a prospective series of 217 early-stage Binet A CLL patients and 26 different subpopulations of normal B-cells, through a custom annotation pipeline of microarray data. Our study identified a 24-lncRNA-signature specifically deregulated in CLL compared with the normal B-cell counterpart. Importantly, this classifier was validated on an independent data set of CLL samples. Belonging to the lncRNA signature characterizing distinct molecular CLL subgroups, we identified lncRNAs recurrently associated with adverse prognostic markers, such as unmutated IGHV status, CD38 expression, 11q and 17p deletions, and NOTCH1 mutations. In addition, correlation analyses predicted a putative lncRNAs interplay with genes and miRNAs expression. Finally, we generated a 2-lncRNA independent risk model, based on lnc-IRF2-3 and lnc-KIAA1755-4 expression, able to distinguish three different prognostic groups in our series of early-stage patients. Overall, our study provides an important resource for future studies on the functions of lncRNAs in CLL, and contributes to the discovery of novel molecular markers with clinical relevance associated with the disease.


Subject(s)
Gene Expression Profiling , Leukemia, Lymphocytic, Chronic, B-Cell/genetics , Leukemia, Lymphocytic, Chronic, B-Cell/mortality , RNA, Long Noncoding , Transcriptome , B-Lymphocytes/metabolism , B-Lymphocytes/pathology , Cluster Analysis , Disease Progression , Gene Expression Regulation, Leukemic , Humans , Kaplan-Meier Estimate , Leukemia, Lymphocytic, Chronic, B-Cell/pathology , MicroRNAs/genetics , Neoplasm Staging , Prognosis , RNA Interference
5.
Bioinformatics ; 32(2): 161-4, 2016 Jan 15.
Article in English | MEDLINE | ID: mdl-26395772

ABSTRACT

MOTIVATION: Aptamers are synthetic nucleic acid molecules that can bind biological targets in virtue of both their sequence and three-dimensional structure. Aptamers are selected using SELEX, Systematic Evolution of Ligands by EXponential enrichment, a technique that exploits aptamer-target binding affinity. The SELEX procedure, coupled with high-throughput sequencing (HT-SELEX), creates billions of random sequences capable of binding different epitopes on specific targets. Since this technique produces enormous amounts of data, computational analysis represents a critical step to screen and select the most biologically relevant sequences. RESULTS: Here, we present APTANI, a computational tool to identify target-specific aptamers from HT-SELEX data and secondary structure information. APTANI builds on AptaMotif algorithm, originally implemented to analyze SELEX data; extends the applicability of AptaMotif to HT-SELEX data and introduces new functionalities, as the possibility to identify binding motifs, to cluster aptamer families or to compare output results from different HT-SELEX cycles. Tabular and graphical representations facilitate the downstream biological interpretation of results. AVAILABILITY AND IMPLEMENTATION: APTANI is available at http://aptani.unimore.it. CONTACT: silvio.bicciato@unimore.it SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
Aptamers, Nucleotide/chemistry , High-Throughput Nucleotide Sequencing , SELEX Aptamer Technique/methods , Software , Algorithms , Nucleotide Motifs
6.
Cell Death Dis ; 1: e28, 2010.
Article in English | MEDLINE | ID: mdl-21364636

ABSTRACT

Hematopoiesis entails a series of hierarchically organized events that proceed throughout cell specification and terminates with cell differentiation. Commitment needs the transcription factors' effort, which, in concert with microRNAs, drives cell fate and responds to promiscuous patterns of gene expression by turning on lineage-specific genes and repressing alternate lineage transcripts. We obtained microRNA profiles from human CD34+ hematopoietic progenitor cells and in vitro differentiated erythroblasts, megakaryoblasts, monoblasts and myeloblast precursors that we analyzed together with their gene expression profiles. The integrated analysis of microRNA-mRNA expression levels highlighted an inverse correlation between microRNAs specifically upregulated in one single-cell progeny and their putative target genes, which resulted in downregulation. Among the upregulated lineage-enriched microRNAs, hsa-miR-299-5p emerged as having a role in controlling CD34+ progenitor fate, grown in multilineage culture conditions. Gain- and loss-of-function experiments revealed that hsa-miR-299-5p participates in the regulation of hematopoietic progenitor fate, modulating megakaryocytic-granulocytic versus erythroid-monocytic differentiation.


Subject(s)
Antigens, CD34/metabolism , Cell Lineage/genetics , Gene Expression Profiling , Hematopoietic Stem Cells/metabolism , MicroRNAs/metabolism , Myelopoiesis/genetics , Cell Differentiation/genetics , Cell Line , Gene Expression Regulation , Hematopoietic Stem Cells/cytology , Humans , MicroRNAs/genetics , RNA, Messenger/genetics , RNA, Messenger/metabolism , Transfection
9.
Br J Dermatol ; 156(1): 62-71, 2007 Jan.
Article in English | MEDLINE | ID: mdl-17199568

ABSTRACT

BACKGROUND: It is generally accepted that sunlight may contribute to the development of melanoma. OBJECTIVES: To analyse gene expression of melanocytes obtained from clinically unaffected skin of patients with melanoma and healthy controls before and after exposure to ultraviolet B radiation. METHODS: Using GeneChip array technology, the gene expression of melanocytes obtained from the two donor groups was profiled, in order to identify transcriptional differences affecting susceptibility to melanoma. RESULTS: The data collected did not show any difference between the expression profiles of melanocytes purified from normal donors and from patients with melanoma that was able to give a statistically significant class separation. However, by means of unsupervised clustering our data could be divided into two main classes. The first class included the transcriptome profiles of melanocytes obtained from skin samples of patients with a vertical growth phase (VGP) melanoma, while the second class included the transcriptome profiles of melanocytes obtained from skin samples of patients with a radial growth phase (RGP) melanoma. CONCLUSIONS: These data suggest that melanocytes in patients with VGP and RGP melanomas show significant differences in gene expression profiles, which allow us to classify patients with melanoma also from clinically unaffected skin.


Subject(s)
Melanocytes , Melanoma/genetics , Skin Neoplasms/genetics , Ultraviolet Therapy/adverse effects , Adult , Aged , Case-Control Studies , Cell Growth Processes , Child , Child, Preschool , Female , Gene Expression Regulation, Neoplastic , Humans , Male , Melanoma/pathology , Melanoma/radiotherapy , Middle Aged , Neoplasm Metastasis/diagnosis , Skin Neoplasms/pathology , Skin Neoplasms/radiotherapy , Sunlight/adverse effects , Transcription, Genetic , Tumor Cells, Cultured
10.
Bioinformatics ; 22(21): 2658-66, 2006 Nov 01.
Article in English | MEDLINE | ID: mdl-16951291

ABSTRACT

MOTIVATION: The systematic integration of expression profiles and other types of gene information, such as chromosomal localization, ontological annotations and sequence characteristics, still represents a challenge in the gene expression arena. In particular, the analysis of transcriptional data in context of the physical location of genes in a genome appears promising in detecting chromosomal regions with transcriptional imbalances often characterizing cancer. RESULTS: A computational tool named locally adaptive statistical procedure (LAP), which incorporates transcriptional data and structural information for the identification of differentially expressed chromosomal regions, is described. LAP accounts for variations in the distance between genes and in gene density by smoothing standard statistics on gene position before testing the significance of their differential levels of gene expression. The procedure smooths parameters and computes p-values locally to account for the complex structure of the genome and to more precisely estimate the differential expression of chromosomal regions. The application of LAP to three independent sets of raw expression data allowed identifying differentially expressed regions that are directly involved in known chromosomal aberrations characteristic of tumors. AVAILABILITY: Functions in R for implementing the LAP method are available at http://www.dpci.unipd.it/Bioeng/Publications/LAP.htm


Subject(s)
Algorithms , Chromosome Mapping/methods , Gene Expression Profiling/methods , Gene Expression/genetics , Transcription Factors/genetics , Computer Simulation , Data Interpretation, Statistical , Models, Genetic , Models, Statistical
11.
Leukemia ; 20(10): 1751-8, 2006 Oct.
Article in English | MEDLINE | ID: mdl-16932344

ABSTRACT

Acute myeloid leukemia (AML) blasts are immature committed myeloid cells unable to spontaneously undergo terminal maturation, and characterized by heterogeneous sensitivity to natural differentiation inducers. Here, we show a molecular signature predicting the resistance or sensitivity of six myeloid cell lines to differentiation induced in vitro with retinoic acid or vitamin D. The identified signature was further validated by TaqMan assay for the prediction of response to an in vitro differentiation assay performed on 28 freshly isolated AML blast populations. The TaqMan assay successfully predicts the in vitro resistance or responsiveness of AML blasts to differentiation inducers. Furthermore, performing a meta-analysis of publicly available microarray data sets, we also show the accuracy of our prediction on known phenotypes and suggest that our signature could become useful for the identification of patients eligible for new therapeutic strategies.


Subject(s)
Antineoplastic Agents/pharmacology , Drug Resistance, Neoplasm/genetics , Leukemia, Myeloid/drug therapy , Leukemia, Myeloid/genetics , Tretinoin/pharmacology , Acute Disease , Cell Differentiation/drug effects , Cell Line, Tumor , Cluster Analysis , Databases, Factual , Gene Expression Regulation, Leukemic/drug effects , Humans , Leukemia, Myeloid/pathology , Meta-Analysis as Topic , Oligonucleotide Array Sequence Analysis , Predictive Value of Tests , Reverse Transcriptase Polymerase Chain Reaction , Vitamin D/pharmacology , Vitamins/pharmacology
12.
Nucleic Acids Res ; 34(7): e56, 2006 Apr 14.
Article in English | MEDLINE | ID: mdl-16617143

ABSTRACT

Single nucleotide polymorphisms (SNPs) are often determined using TaqMan real-time PCR assays (Applied Biosystems) and commercial software that assigns genotypes based on reporter probe signals at the end of amplification. Limitations to the large-scale application of this approach include the need for positive controls or operator intervention to set signal thresholds when one allele is rare. In the interest of optimizing real-time PCR genotyping, we developed an algorithm for automatic genotype calling based on the full course of real-time PCR data. Best cycle genotyping algorithm (BCGA), written in the open source language R, is based on the assumptions that classification depends on the time (cycle) of amplification and that it is possible to identify a best discriminating cycle for each SNP assay. The algorithm is unique in that it classifies samples according to the behavior of blanks (no DNA samples), which cluster with heterozygous samples. This method of classification eliminates the need for positive controls and permits accurate genotyping even in the absence of a genotype class, for example when one allele is rare. Here, we describe the algorithm and test its validity, compared to the standard end-point method and to DNA sequencing.


Subject(s)
Algorithms , Polymerase Chain Reaction/methods , Polymorphism, Single Nucleotide , Genotype , Humans
14.
Methods Inf Med ; 43(1): 4-8, 2004.
Article in English | MEDLINE | ID: mdl-15026826

ABSTRACT

OBJECTIVES: High-throughput technologies are radically boosting the understanding of living systems, thus creating enormous opportunities to elucidate the biological processes of cells in different physiological states. In particular, the application of DNA micro-arrays to monitor expression profiles from tumor cells is improving cancer analysis to levels that classical methods have been unable to reach. However, molecular diagnostics based on expression profiling requires addressing computational issues as the overwhelming number of variables and the complex, multi-class nature of tumor samples. Thus, the objective of the present research has been the development of a computational procedure for feature extraction and classification of gene expression data. METHODS: The Soft Independent Modeling of Class Analogy (SIMCA) approach has been implemented in a data mining scheme, which allows the identification of those genes that are most likely to confer robust and accurate classification of samples from multiple tumor types. RESULTS: The proposed method has been tested on two different microarray data sets, namely Golub's analysis of acute human leukemia and the small round blue cell tumors study presented by Khan et al.. The identified features represent a rational and dimensionally reduced base for understanding the biology of diseases, defining targets of therapeutic intervention, and developing diagnostic tools for classification of pathological states. CONCLUSIONS: The analysis of the SIMCA model residuals allows the identification of specific phenotype markers. At the same time, the class analogy approach provides the assignment to multiple classes, such as different pathological conditions or tissue samples, for previously unseen instances.


Subject(s)
Biomarkers, Tumor/physiology , Databases, Genetic , Gene Expression Profiling/methods , Leukemia/classification , Leukemia/genetics , Oligonucleotide Array Sequence Analysis/classification , Pattern Recognition, Automated , Principal Component Analysis , Biomarkers, Tumor/genetics , Computational Biology , DNA, Neoplasm/classification , DNA, Neoplasm/physiology , Data Interpretation, Statistical , Gene Expression Profiling/statistics & numerical data , Humans , Phenotype , Sequence Analysis, DNA
15.
Leukemia ; 17(8): 1557-65, 2003 Aug.
Article in English | MEDLINE | ID: mdl-12886243

ABSTRACT

Precursor B-acute lymphoblastic leukemia (pB-ALL) is a heterogeneous disease and multiparameter flow cytometry, molecular genetics, and cytogenetic studies have all contributed to classification of subgroups with prognostic significance. Recently, gene expression microarray technology has been used to investigate lymphoblastic leukemias, demonstrating that known and novel pB-ALL subclasses can be separated on the basis of gene expression profiles. The strength of microarray technique lays in part in the multivariate nature of the expression data. We propose a parallel multiparametric approach based on immunophenotypic flow-cytometry expression data for the analysis of leukemia patients. Specifically, we tested the potential of this approach on a data set of 145 samples of pediatric pB-ALL that included 46 samples positive for mixed lineage leukemia (MLL) translocations (MLL+) and 99 control pB-ALLs, negative for this translocation (MLL-). The expression levels of 16 marker proteins have been monitored by four-color flow cytometry using a standardized diagnostic panel of antibodies. The protein expression database has been then analyzed using those univariate and multivariate computational techniques normally applied to mine and model large microarray data sets. Marker protein expression profiling not only allowed separating pB-ALL cases with an MLL rearrangement from other ALLs, but also demonstrates that MLL+ leukemias constitute a heterogeneous group in which MLL/AF4 leukemias represent a homogenous subclass described by a specific expression fingerprint.


Subject(s)
Computational Biology/methods , DNA-Binding Proteins/analysis , Immunophenotyping/methods , Nuclear Proteins/analysis , Precursor Cell Lymphoblastic Leukemia-Lymphoma/diagnosis , Proto-Oncogenes , Transcription Factors , Adolescent , Analysis of Variance , Antigens, CD/analysis , Biomarkers/analysis , Child , Child, Preschool , Diagnosis, Differential , Flow Cytometry , Histone-Lysine N-Methyltransferase , Humans , Infant , Myeloid-Lymphoid Leukemia Protein , Precursor B-Cell Lymphoblastic Leukemia-Lymphoma/classification , Precursor B-Cell Lymphoblastic Leukemia-Lymphoma/diagnosis , Precursor B-Cell Lymphoblastic Leukemia-Lymphoma/pathology , Precursor Cell Lymphoblastic Leukemia-Lymphoma/classification , Precursor Cell Lymphoblastic Leukemia-Lymphoma/pathology , Retrospective Studies , Sensitivity and Specificity , Transcriptional Elongation Factors , Translocation, Genetic
16.
Bioinformatics ; 19(5): 571-8, 2003 Mar 22.
Article in English | MEDLINE | ID: mdl-12651714

ABSTRACT

MOTIVATION: Microarray expression profiling appears particularly promising for a deeper understanding of cancer biology and to identify molecular signatures supporting the histological classification schemes of neoplastic specimens. However, molecular diagnostics based on microarray data presents major challenges due to the overwhelming number of variables and the complex, multiclass nature of tumor samples. Thus, the development of marker selection methods, that allow the identification of those genes that are most likely to confer high classification accuracy of multiple tumor types, and of multiclass classification schemes is of paramount importance. RESULTS: A computational procedure for marker identification and for classification of multiclass gene expression data through the application of disjoint principal component models is described. The identified features represent a rational and dimensionally reduced base for understanding the basic biology of diseases, defining targets for therapeutic intervention, and developing diagnostic tools for the identification and classification of multiple pathological states. The method has been tested on different microarray data sets obtained from various human tumor samples. The results demonstrate that this procedure allows the identification of specific phenotype markers and can classify previously unseen instances in the presence of multiple classes.


Subject(s)
Biomarkers, Tumor/genetics , Gene Expression Profiling/methods , Models, Genetic , Models, Statistical , Neoplasms/genetics , Oligonucleotide Array Sequence Analysis/methods , Principal Component Analysis/methods , Acute Disease , Algorithms , Child , Child, Preschool , Gene Expression Regulation, Neoplastic/genetics , Humans , Infant , Infant, Newborn , Leukemia, Myeloid/classification , Leukemia, Myeloid/genetics , Lymphoma, Non-Hodgkin/classification , Lymphoma, Non-Hodgkin/genetics , Neoplasms/classification , Neuroblastoma/classification , Neuroblastoma/genetics , Precursor Cell Lymphoblastic Leukemia-Lymphoma/classification , Precursor Cell Lymphoblastic Leukemia-Lymphoma/genetics , Rhabdomyosarcoma/classification , Rhabdomyosarcoma/genetics , Sarcoma, Ewing/classification , Sarcoma, Ewing/genetics
17.
Metab Eng ; 2(3): 218-27, 2000 Jul.
Article in English | MEDLINE | ID: mdl-11056064

ABSTRACT

Large volumes of data are routinely collected during bioprocess operations and, more recently, in basic biological research using genomics-based technologies. While these data often lack sufficient detail to be used for mechanism identification, it is possible that the underlying mechanisms affecting cell phenotype or process outcome are reflected as specific patterns in the overall or temporal sensor logs. This raises the possibility of identifying outcome-specific fingerprints that can be used for process or phenotype classification and the identification of discriminating characteristics, such as specific genes or process variables. The aim of this work is to provide a systematic approach to identifying and modeling patterns in historical records and using this information for process classification. This approach differs from others in that emphasis is placed on analyzing the data structure first and thereby extracting potentially relevant features prior to model creation. The initial step in this overall approach is to first identify the discriminating features of the relevant measurements and time windows, which can then be subsequently used to discriminate among different classes of process behavior. This is achieved via a mean hypothesis testing algorithm. Next, the homogeneity of the multivariate data in each class is explored via a novel cluster analysis technique called PC1 Time Series Clustering to ensure that the data subsets used accurately reflect the variability displayed in the historical records. This will be the topic of the second paper in this series. We present here the method for identifying discriminating features in data via mean hypothesis testing along with results from the analysis of case studies from industrial fermentations


Subject(s)
Computational Biology , Data Interpretation, Statistical , Algorithms , Biomedical Engineering , Cluster Analysis , Decision Trees , Fermentation , Models, Biological , Models, Statistical
18.
Metab Eng ; 2(3): 228-38, 2000 Jul.
Article in English | MEDLINE | ID: mdl-11056065

ABSTRACT

An important step in data analysis is class assignment which is usually done on the basis of a macroscopic phenotypic or bioprocess characteristic, such as high vs low growth, healthy vs diseased state, or high vs. low productivity. Unfortunately, such an assignment may lump together samples, which when derived from a more detailed phenotypic or bioprocess description are dissimilar, giving rise to models of lower quality and predictive power. In this paper we present a clustering algorithm for data preprocessing which involves the identification of fundamentally similar lots on the basis of the extent of similarity among the system variables. The algorithm combines aspects of cluster analysis and principal component analysis by applying agglomerative clustering methods to the first principal component of the system data matrix. As part of a rational strategy for developing empirical models, this technique selects lots (samples) which are most appropriate for inclusion in a training set by analyzing multivariate data homogeneity. Samples with similar data structures are identified and grouped together into distinct clusters. This knowledge is used in the formation of potential training sets. Additionally, this technique can identify atypical lots, i.e., samples that are not simply outliers but exhibit the general properties of one class but have been given the assignment of the other. The method is presented along with examples from its application to fermentation data sets.


Subject(s)
Computational Biology , Data Interpretation, Statistical , Algorithms , Biomedical Engineering , Cluster Analysis , Fermentation , Models, Biological , Models, Statistical
19.
J Pept Res ; 50(3): 231-7, 1997 Sep.
Article in English | MEDLINE | ID: mdl-9309587

ABSTRACT

This paper describes a new method for the evaluation of conductimetric data collected during the in-line monitoring of the coupling step in solid-phase peptide synthesis. The control scheme relies on a feed-forward artificial neural network algorithm which can predict the final yield of the reaction within its initial 5 min by analyzing the conductivity signal profile. The yield values predicted by the artificial neural network algorithm result in good accordance with the data obtained by the commonly used ninhydrin test.


Subject(s)
Algorithms , Peptides/chemical synthesis , Electric Conductivity , Neural Networks, Computer , Ninhydrin , Peptides/analysis , Photometry , Signal Processing, Computer-Assisted
20.
J Pept Res ; 49(1): 103-11, 1997 Jan.
Article in English | MEDLINE | ID: mdl-9128106

ABSTRACT

The H-Ala-Arg-(Ala)6-Lys-OH sequence is a biologically interesting 'difficult sequence' presenting N alpha-Fmoc deprotection and coupling problems. Different chemical conditions and synthetic strategies have been tested in order to overcome the problems due to sequence-dependent interactions. In particular, it was confirmed that different solvents in the deprotection step did not provide any significant improvement, but the use of a more efficient base in the deprotection mixture avoided insufficient unblocking of N alpha-protecting group; problems due to partial coupling in the last steps of the synthesis were solved by double coupling techniques. Moreover, the synthesis of the model peptide was carried out using both "continuous flow' and "batch' techniques. The present results demonstrate that on-line monitoring of the deprotection step by absorbance measurements represents a very effective tool to detect the onset of internal aggregations during the synthesis.


Subject(s)
Oligopeptides/chemical synthesis , Chromatography, High Pressure Liquid , Methods , Oligopeptides/chemistry
SELECTION OF CITATIONS
SEARCH DETAIL
...