Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 12 de 12
Filter
1.
Front Immunol ; 5: 597, 2014.
Article in English | MEDLINE | ID: mdl-25505899

ABSTRACT

Human leukocyte antigens (HLA) are important biomarkers because multiple diseases, drug toxicity, and vaccine responses reveal strong HLA associations. Current clinical HLA typing is an elimination process requiring serial testing. We present an alternative in situ synthesized DNA-based microarray method that contains hundreds of thousands of probes representing a complete overlapping set covering 1,610 clinically relevant HLA class I alleles accompanied by computational tools for assigning HLA type to 4-digit resolution. Our proof-of-concept experiment included 21 blood samples, 18 cell lines, and multiple controls. The method is accurate, robust, and amenable to automation. Typing errors were restricted to homozygous samples or those with very closely related alleles from the same locus, but readily resolved by targeted DNA sequencing validation of flagged samples. High-throughput HLA typing technologies that are effective, yet inexpensive, can be used to analyze the world's populations, benefiting both global public health and personalized health care.

2.
Chinese Medical Journal ; (24): 3891-3896, 2013.
Article in English | WPRIM (Western Pacific) | ID: wpr-236143

ABSTRACT

<p><b>BACKGROUND</b>Olfactory ensheathing cell (OEC) transplantation is a promising or potential therapy for spinal cord injury (SCI). However, the effects of injecting OECs directly into SCI site have been limited and unsatisfied due to the complexity of SCI. To improve the outcome, proper biomaterials are thought to be helpful since these materials would allow the cells to grow three-dimensionally and guide cell migration.</p><p><b>METHODS</b>In this study, we made a new peptide hydrogel scaffold named GRGDSPmx by mixing the pure RADA16 and designer peptide RADA16-GRGDSP solution, and we examined the molecular integration of the mixed nanofiber scaffolds using atomic force microscopy. In addition, we have studied the behavior of OECs in GRGDSPmx condition as well as on RADA16 scaffold by analyzing their phenotypes including cell proliferation, apoptosis, survival, and morphology.</p><p><b>RESULTS</b>The experimental results showed that GRGDSPmx could be self-assembled to form a hydrogel. Inverted optical microscopic and scanning electron microscopic analyses showed that OECs are viable and they proliferate within the nanostructured environment of the scaffold. Thiazolyl blue (MTT) assay demonstrated that OEC proliferation rate was increased on GRGDSPmx scaffold compared with the pure RADA16 scaffold. In addition, OECs on GRGDSPmx scaffolds also showed less apoptosis and maintained the original spindle-shaped morphology. Calcein-AM/PI fluorescence staining revealed that OECs cultured on GRGDSPmx grew well and the viable cell count was 95%.</p><p><b>CONCLUSION</b>These results suggested that this new hydrogel scaffold provided an ideal substrate for OEC three-dimensional culture and suggested its further application for SCI repair.</p>


Subject(s)
Animals , Male , Rats , Cell Proliferation , Cells, Cultured , Hydrogel, Polyethylene Glycol Dimethacrylate , Chemistry , Immunohistochemistry , Microscopy, Atomic Force , Microscopy, Confocal , Olfactory Bulb , Cell Biology , Peptides , Chemistry , Rats, Sprague-Dawley , Tissue Engineering , Methods , Tissue Scaffolds , Chemistry
3.
PLoS One ; 7(6): e38979, 2012.
Article in English | MEDLINE | ID: mdl-22723914

ABSTRACT

Multi-functional enzymes are enzymes that perform multiple physiological functions. Characterization and identification of multi-functional enzymes are critical for communication and cooperation between different functions and pathways within a complex cellular system or between cells. In present study, we collected literature-reported 6,799 multi-functional enzymes and systematically characterized them in structural, functional, and evolutionary aspects. It was found that four physiochemical properties, that is, charge, polarizability, hydrophobicity, and solvent accessibility, are important for characterization of multi-functional enzymes. Accordingly, a combinational model of support vector machine and random forest model was constructed, based on which 6,956 potential novel multi-functional enzymes were successfully identified from the ENZYME database. Moreover, it was observed that multi-functional enzymes are non-evenly distributed in species, and that Bacteria have relatively more multi-functional enzymes than Archaebacteria and Eukaryota. Comparative analysis indicated that the multi-functional enzymes experienced a fluctuation of gene gain and loss during the evolution from S. cerevisiae to H. sapiens. Further pathway analyses indicated that a majority of multi-functional enzymes were well preserved in catalyzing several essential cellular processes, for example, metabolisms of carbohydrates, nucleotides, and amino acids. What's more, a database of known multi-functional enzymes and a server for novel multi-functional enzyme prediction were also constructed for free access at http://bioinf.xmu.edu.cn/databases/MFEs/index.htm.


Subject(s)
Enzymes/metabolism , Support Vector Machine , Databases, Protein , Enzymes/chemistry , Models, Biological
4.
J Immunol Methods ; 374(1-2): 18-25, 2011 Nov 30.
Article in English | MEDLINE | ID: mdl-21782820

ABSTRACT

The immune system is characterized by high combinatorial complexity that necessitates the use of specialized computational tools for analysis of immunological data. Machine learning (ML) algorithms are used in combination with classical experimentation for the selection of vaccine targets and in computational simulations that reduce the number of necessary experiments. The development of ML algorithms requires standardized data sets, consistent measurement methods, and uniform scales. To bridge the gap between the immunology community and the ML community, we designed a repository for machine learning in immunology named Dana-Farber Repository for Machine Learning in Immunology (DFRMLI). This repository provides standardized data sets of HLA-binding peptides with all binding affinities mapped onto a common scale. It also provides a list of experimentally validated naturally processed T cell epitopes derived from tumor or virus antigens. The DFRMLI data were preprocessed and ensure consistency, comparability, detailed descriptions, and statistically meaningful sample sizes for peptides that bind to various HLA molecules. The repository is accessible at http://bio.dfci.harvard.edu/DFRMLI/.


Subject(s)
Allergy and Immunology/statistics & numerical data , Artificial Intelligence , Academies and Institutes , Algorithms , Boston , Databases, Factual , Epitope Mapping/methods , Epitopes, T-Lymphocyte/metabolism , HLA Antigens/metabolism , Humans , Peptides/metabolism , Protein Binding
5.
Bioinform Biol Insights ; 1: 19-47, 2009 Nov 24.
Article in English | MEDLINE | ID: mdl-20066123

ABSTRACT

Various computational methods have been used for the prediction of protein and peptide function based on their sequences. A particular challenge is to derive functional properties from sequences that show low or no homology to proteins of known function. Recently, a machine learning method, support vector machines (SVM), have been explored for predicting functional class of proteins and peptides from amino acid sequence derived properties independent of sequence similarity, which have shown promising potential for a wide spectrum of protein and peptide classes including some of the low- and non-homologous proteins. This method can thus be explored as a potential tool to complement alignment-based, clustering-based, and structure-based methods for predicting protein function. This article reviews the strategies, current progresses, and underlying difficulties in using SVM for predicting the functional class of proteins. The relevant software and web-servers are described. The reported prediction performances in the application of these methods are also presented.

6.
BMC Bioinformatics ; 9 Suppl 12: S22, 2008 Dec 12.
Article in English | MEDLINE | ID: mdl-19091022

ABSTRACT

BACKGROUND: Initiation and regulation of immune responses in humans involves recognition of peptides presented by human leukocyte antigen class II (HLA-II) molecules. These peptides (HLA-II T-cell epitopes) are increasingly important as research targets for the development of vaccines and immunotherapies. HLA-II peptide binding studies involve multiple overlapping peptides spanning individual antigens, as well as complete viral proteomes. Antigen variation in pathogens and tumor antigens, and extensive polymorphism of HLA molecules increase the number of targets for screening studies. Experimental screening methods are expensive and time consuming and reagents are not readily available for many of the HLA class II molecules. Computational prediction methods complement experimental studies, minimize the number of validation experiments, and significantly speed up the epitope mapping process. We collected test data from four independent studies that involved 721 peptide binding assays. Full overlapping studies of four antigens identified binding affinity of 103 peptides to seven common HLA-DR molecules (DRB1*0101, 0301, 0401, 0701, 1101, 1301, and 1501). We used these data to analyze performance of 21 HLA-II binding prediction servers accessible through the WWW. RESULTS: Because not all servers have predictors for all tested HLA-II molecules, we assessed a total of 113 predictors. The length of test peptides ranged from 15 to 19 amino acids. We tried three prediction strategies - the best 9-mer within the longer peptide, the average of best three 9-mer predictions, and the average of all 9-mer predictions within the longer peptide. The best strategy was the identification of a single best 9-mer within the longer peptide. Overall, measured by the receiver operating characteristic method (AROC), 17 predictors showed good (AROC > 0.8), 41 showed marginal (AROC > 0.7), and 55 showed poor performance (AROC < 0.7). Good performance predictors included HLA-DRB1*0101 (seven), 1101 (six), 0401 (three), and 0701 (one). The best individual predictor was NETMHCIIPAN, closely followed by PROPRED, IEDB (Consensus), and MULTIPRED (SVM). None of the individual predictors was shown to be suitable for prediction of promiscuous peptides. Current predictive capabilities allow prediction of only 50% of actual T-cell epitopes using practical thresholds. CONCLUSION: The available HLA-II servers do not match prediction capabilities of HLA-I predictors. Currently available HLA-II prediction servers offer only a limited prediction accuracy and the development of improved predictors is needed for large-scale studies, such as proteome-wide epitope mapping. The requirements for accuracy of HLA-II binding predictions are stringent because of the substantial effect of false positives.


Subject(s)
Computational Biology/methods , Peptides/chemistry , Vaccines/chemistry , Algorithms , Antigens/chemistry , Binding Sites , Epitope Mapping , Epitopes/chemistry , Epitopes, T-Lymphocyte/chemistry , False Positive Reactions , Humans , Markov Chains , Models, Theoretical , Protein Binding , ROC Curve
7.
BMC Immunol ; 9: 8, 2008 Mar 16.
Article in English | MEDLINE | ID: mdl-18366636

ABSTRACT

BACKGROUND: Protein antigens and their specific epitopes are formulation targets for epitope-based vaccines. A number of prediction servers are available for identification of peptides that bind major histocompatibility complex class I (MHC-I) molecules. The lack of standardized methodology and large number of human MHC-I molecules make the selection of appropriate prediction servers difficult. This study reports a comparative evaluation of thirty prediction servers for seven human MHC-I molecules. RESULTS: Of 147 individual predictors 39 have shown excellent, 47 good, 33 marginal, and 28 poor ability to classify binders from non-binders. The classifiers for HLA-A*0201, A*0301, A*1101, B*0702, B*0801, and B*1501 have excellent, and for A*2402 moderate classification accuracy. Sixteen prediction servers predict peptide binding affinity to MHC-I molecules with high accuracy; correlation coefficients ranging from r = 0.55 (B*0801) to r = 0.87 (A*0201). CONCLUSION: Non-linear predictors outperform matrix-based predictors. Most predictors can be improved by non-linear transformations of their raw prediction scores. The best predictors of peptide binding are also best in prediction of T-cell epitopes. We propose a new standard for MHC-I binding prediction - a common scale for normalization of prediction scores, applicable to both experimental and predicted data. The results of this study provide assistance to researchers in selection of most adequate prediction tools and selection criteria that suit the needs of their projects.


Subject(s)
Computational Biology/standards , Databases, Protein/standards , Histocompatibility Antigens Class I/immunology , Internet/standards , Vaccines , Animals , Epitopes, T-Lymphocyte/immunology , Humans , Peptides , Protein Binding
8.
Cancer Res ; 67(20): 9996-10003, 2007 Oct 15.
Article in English | MEDLINE | ID: mdl-17942933

ABSTRACT

Microarrays have been explored for deriving molecular signatures to determine disease outcomes, mechanisms, targets, and treatment strategies. Although exhibiting good predictive performance, some derived signatures are unstable due to noises arising from measurement variability and biological differences. Improvements in measurement, annotation, and signature selection methods have been proposed. We explored a new signature selection method that incorporates consensus scoring of multiple random sampling and multistep evaluation of gene-ranking consistency for maximally avoiding erroneous elimination of predictor genes. This method was tested by using a well-studied 62-sample colon cancer data set and two other cancer data sets (86-sample lung adenocarcinoma and 60-sample hepatocellular carcinoma). For the colon cancer data set, the derived signatures of 20 sampling sets, composed of 10,000 training test sets, are fairly stable with 80% of top 50 and 69% to 93% of all predictor genes shared by all 20 signatures. These shared predictor genes include 48 cancer-related and 16 cancer-implicated genes, as well as 50% of the previously derived predictor genes. The derived signatures outperform all previously derived signatures in predicting colon cancer outcomes from an independent data set collected from the Stanford Microarray Database. Our method showed similar performance for the other two data sets, suggesting its usefulness in deriving stable signatures for biomarker and target discovery.


Subject(s)
Colonic Neoplasms/genetics , Gene Expression Profiling/methods , Oligonucleotide Array Sequence Analysis/methods , Biometry/methods , Carcinoma, Hepatocellular/genetics , Humans , Liver Neoplasms/genetics , Lung Neoplasms/genetics , Pattern Recognition, Automated/methods , Reproducibility of Results
9.
BMC Bioinformatics ; 8: 300, 2007 Aug 17.
Article in English | MEDLINE | ID: mdl-17705863

ABSTRACT

BACKGROUND: Sequence-derived structural and physicochemical descriptors have frequently been used in machine learning prediction of protein functional families, thus there is a need to comparatively evaluate the effectiveness of these descriptor-sets by using the same method and parameter optimization algorithm, and to examine whether the combined use of these descriptor-sets help to improve predictive performance. Six individual descriptor-sets and four combination-sets were evaluated in support vector machines (SVM) prediction of six protein functional families. RESULTS: The performance of these descriptor-sets were ranked by Matthews correlation coefficient (MCC), and categorized into two groups based on their performance. While there is no overwhelmingly favourable choice of descriptor-sets, certain trends were found. The combination-sets tend to give slightly but consistently higher MCC values and thus overall best performance such that three out of four combination-sets show slightly better performance compared to one out of six individual descriptor-sets. CONCLUSION: Our study suggests that currently used descriptor-sets are generally useful for classifying proteins and the prediction performance may be enhanced by exploring combinations of descriptors.


Subject(s)
Algorithms , Proteins/chemistry , Proteins/classification , Sequence Alignment/methods , Sequence Analysis, Protein/methods , Amino Acid Sequence , Molecular Sequence Data , Proteins/metabolism , Structure-Activity Relationship
10.
Drug Discov Today ; 12(7-8): 304-13, 2007 Apr.
Article in English | MEDLINE | ID: mdl-17395090

ABSTRACT

Identification and validation of viable targets is an important first step in drug discovery and new methods, and integrated approaches are continuously explored to improve the discovery rate and exploration of new drug targets. An in silico machine learning method, support vector machines, has been explored as a new method for predicting druggable proteins from amino acid sequence independent of sequence similarity, thereby facilitating the prediction of druggable proteins that exhibit no or low homology to known targets.


Subject(s)
Databases, Protein , Drug Design , Sequence Alignment/methods , Algorithms , Computer Simulation , Humans , Models, Theoretical
11.
Immunogenetics ; 58(8): 607-13, 2006 Aug.
Article in English | MEDLINE | ID: mdl-16832638

ABSTRACT

Major histocompatibility complex (MHC)-binding peptides are essential for antigen recognition by T-cell receptors and are being explored for vaccine design. Computational methods have been developed for predicting MHC-binding peptides of fixed lengths, based on the training of relatively few non-binders. It is desirable to introduce methods applicable for peptides of flexible lengths and trained by using more diverse sets of non-binders. MHC-BPS is a web-based MHC-binder prediction server that uses support vector machines for predicting peptide binders of flexible lengths for 18 MHC class I and 12 class II alleles from sequence-derived physicochemical properties, which were trained by using 4,208 approximately 3,252 binders and 234,333 approximately 168,793 non-binders, and evaluated by an independent set of 545 approximately 476 binders and 110,564 approximately 84,430 non-binders. The binder prediction accuracies are 86 approximately 99% for 25 and 70 approximately 80% for five alleles, and the non-binder accuracies are 96 approximately 99% for 30 alleles. A screening of HIV-1 genome identifies 0.01 approximately 5% and 5 approximately 8% of the constituent peptides as binders for 24 and 6 alleles, respectively, including 75 approximately 100% of the known epitopes. This method correctly predicts 73.3% of the 15 newly published epitopes in the last 4 months of 2005. MHC-BPS is available at http://bidd.cz3.nus.edu.sg/mhc/ .


Subject(s)
Computational Biology , Databases, Protein , Epitopes/chemistry , Histocompatibility Antigens Class II/chemistry , Histocompatibility Antigens Class I/chemistry , Oligopeptides/chemistry , Peptide Fragments/chemistry , Algorithms , Alleles , Amino Acid Sequence , Animals , Epitopes/genetics , Histocompatibility Antigens Class I/genetics , Histocompatibility Antigens Class II/genetics , Humans , Molecular Sequence Data , Oligopeptides/genetics , Protein Binding
12.
Drug Discov Today ; 10(7): 521-9, 2005 Apr 01.
Article in English | MEDLINE | ID: mdl-15809198

ABSTRACT

Drug resistance is of increasing concern in the treatment of infectious diseases and cancer. Mutation in drug-interacting disease proteins is one of the primary causes for resistance particularly against anti-infectious drugs. Prediction of resistance mutations in these proteins is valuable both for the molecular dissection of drug resistance mechanisms and for predicting features that guide the design of new agents to counter resistant strains. Several protein structure- and sequence-based computer methods have been explored for mechanistic study and prediction of resistance mutations. These methods and their usefulness are reviewed here.


Subject(s)
Computer Simulation , Drug Resistance/genetics , Mutation , Proteins/genetics , Computing Methodologies , Models, Molecular , Neural Networks, Computer , Protein Conformation
SELECTION OF CITATIONS
SEARCH DETAIL
...