Search | VHL Regional Portal

C³: Consensus Cancer Driver Gene Caller.

Zhu, Chen-Yu; Zhou, Chi; Chen, Yun-Qin; Shen, Ai-Zong; Guo, Zong-Ming; Yang, Zhao-Yi; Ye, Xiang-Yun; Qu, Shen; Wei, Jia; Liu, Qi.

Genomics Proteomics Bioinformatics ; 17(3): 311-318, 2019 06.

Article in English | MEDLINE | ID: mdl-31465854

ABSTRACT

Next-generation sequencing has allowed identification of millions of somatic mutations in human cancer cells. A key challenge in interpreting cancer genomes is to distinguish drivers of cancer development among available genetic mutations. To address this issue, we present the first web-based application, consensus cancer driver gene caller (C3), to identify the consensus driver genes using six different complementary strategies, i.e., frequency-based, machine learning-based, functional bias-based, clustering-based, statistics model-based, and network-based strategies. This application allows users to specify customized operations when calling driver genes, and provides solid statistical evaluations and interpretable visualizations on the integration results. C3 is implemented in Python and is freely available for public use at http://drivergene.rwebox.com/c3.

Subject(s)

Algorithms , Neoplasms/genetics , Cluster Analysis , Humans , Internet , Machine Learning

Robust authentication for paper-based text documents based on text watermarking technology.

Qi, Wen Fa; Guo, Wei; Zhang, Tong; Liu, Yu Xin; Guo, Zong Ming; Fang, Xi Feng.

Math Biosci Eng ; 16(4): 2233-2249, 2019 03 15.

Article in English | MEDLINE | ID: mdl-31137209

ABSTRACT

Aiming at the problem of easy tampering and difficult integrity authentication of paper text documents, this paper proposes a robust content authentication method for printed documents based on text watermarking scheme resisting print-and-scan attack. Firstly, an authentication watermark signal sequence related to content of text document is generated based on the Logistic chaotic map model; then, the authentication watermark signal sequence is embedded into printed paper document by using a robust text watermarking scheme; finally, the watermark information is extracted from scanned image of paper document, and compared with the authentication watermark information calculated in real time by the text document content obtained by OCR technology, thereby performing content integrity authentication of the paper text documents. Experimental results show that our method can achieve the robust content integrity authentication of paper text documents, and can also accurately locate the tampering position. In addition, the document after embedding the watermark information has a good visual effect, and the text watermarking scheme has a large information capacity.

Subject(s)

Computer Security , Medical Informatics/instrumentation , Algorithms , Computer Graphics/standards , Data Compression/methods , Language , Medical Informatics/methods , Nonlinear Dynamics , Pattern Recognition, Automated/methods , Software

[Expression of CD66c (CEACM6) in adult acute leukemia and its significance].

Chen, Bao-guo; Yan, Wei-hua; Meng, Zhe-feng; Guo, Zong-ming; Zhu, Min; Li, Bo-li.

Zhonghua Xue Ye Xue Za Zhi ; 27(6): 370-3, 2006 Jun.

Article in Chinese | MEDLINE | ID: mdl-17147224

ABSTRACT

OBJECTIVE: To explore the expression of CD66c (CEACM6) in adult acute leukemia and its significance. METHODS: Acute leukemia cell lines HL-60, K562, LCL721.221 and Jurkat were cultured in vitro. RT-PCR and multi-parameter flow cytometry were applied to analysis of CD66c mRNA and protein expression respectively in the cell lines and patient' s bone marrow leukemic cells. Cytogenetic analysis for 199 bone marrow samples from leukemia patients and Minimal Residual Disease (MRD) detection for 25 CD66c positive B lineage ALL were performed. RESULTS: (1) CD66c expression both on cell surface and in plasma were negative in all the cell lines. (2) Four of 127 AML (3.15%) (mainly of M2 and M4), and 28 of 79 ALL (35.44%) (all of B linage ALL) were CD66c positive the subtypes of the ALL being common B-ALL (20/54) and pre B-ALL (8/11) including 8 Ph + B-linage ALL. (3) Six-month relapse rate was significantly different between the MRD positive and negative patients. (4) CD66c mRNA was strongly expressed in B-linage ALL. For the cell lines, only the HL60 cells weakly expressed CD66c mRNA. CONCLUSION: CD66c expression could be a useful bio-marker for the MRD analysis in ALL, and is closely associated with its transcription level.

Subject(s)

Antigens, CD/biosynthesis , Carcinoembryonic Antigen/biosynthesis , Cell Adhesion Molecules/biosynthesis , Leukemia, Myeloid, Acute/metabolism , Precursor Cell Lymphoblastic Leukemia-Lymphoma/metabolism , Adolescent , Adult , Aged , Carcinoembryonic Antigen/genetics , GPI-Linked Proteins , HL-60 Cells , Humans , K562 Cells , Male , Middle Aged , Neoplasm, Residual/metabolism , RNA, Messenger/biosynthesis

Systematic analysis of head-to-head gene organization: evolutionary conservation and potential biological relevance.

Li, Yuan-Yuan; Yu, Hui; Guo, Zong-Ming; Guo, Ting-Qing; Tu, Kang; Li, Yi-Xue.

PLoS Comput Biol ; 2(7): e74, 2006 Jul 07.

Article in English | MEDLINE | ID: mdl-16839196

ABSTRACT

Several "head-to-head" (or "bidirectional") gene pairs have been studied in individual experiments, but genome-wide analysis of this gene organization, especially in terms of transcriptional correlation and functional association, is still insufficient. We conducted a systematic investigation of head-to-head gene organization focusing on structural features, evolutionary conservation, expression correlation and functional association. Of the present 1,262, 1,071, and 491 head-to-head pairs identified in human, mouse, and rat genomes, respectively, pairs with 1- to 400-base pair distance between transcription start sites form the majority (62.36%, 64.15%, and 55.19% for human, mouse, and rat,respectively) of each dataset, and the largest group is always the one with a transcription start site distance of 101 to 200 base pairs. The phylogenetic analysis among Fugu, chicken, and human indicates a negative selection on the separation of head-to-head genes across vertebrate evolution, and thus the ancestral existence of this gene organization. The expression analysis shows that most of the human head-to-head genes are significantly correlated,and the correlation could be positive, negative, or alternative depending on the experimental conditions. Finally, head to-head genes statistically tend to perform similar functions, and gene pairs associated with the significant cofunctions seem to have stronger expression correlations. The findings indicate that the head-to-head gene organization is ancient and conserved, which subjects functionally related genes to correlated transcriptional regulation and thus provides an exquisite mechanism of transcriptional regulation based on gene organization. These results have significantly expanded the knowledge about head-to-head gene organization. Supplementary materials for this study are available at http://www.scbit.org/h2h.

Subject(s)

Computational Biology/methods , Evolution, Molecular , Animals , Chickens , Chromosome Mapping , Databases, Genetic , Genetic Linkage , Genome , Humans , Mice , Models, Biological , Open Reading Frames , Phylogeny , Rats , Species Specificity , Systems Biology , Transcription, Genetic

In silico discovery of human natural antisense transcripts.

Li, Yuan-Yuan; Qin, Lei; Guo, Zong-Ming; Liu, Lei; Xu, Hao; Hao, Pei; Su, Jiong; Shi, Yixiang; He, Wei-Zhong; Li, Yi-Xue.

BMC Bioinformatics ; 7: 18, 2006 Jan 13.

Article in English | MEDLINE | ID: mdl-16409644

ABSTRACT

BACKGROUND: Several high-throughput searches for potential natural antisense transcripts (NATs) have been performed recently, but most of the reports were focused on cis type. A thorough in silico analysis of human transcripts will help expand our knowledge of NATs. RESULTS: We have identified 568 NATs from human RefSeq RNA sequences. Among them, 403 NATs are reported for the first time, and at least 157 novel NATs are trans type. According to the pairing region of a sense and antisense RNA pair, hNATs are divided into 6 classes, of which about 87% involve 5' or 3' UTR sequences, supporting the regulatory role of UTRs. Among a total of 535 NAT pairs related with splice variants, 77.4% (414/535) have their pairing regions affected or completely eliminated by alternative splicing, suggesting significant relationship of alternative splicing and antisense-directed regulation. The extensive occurrence of splice variants in hNATs and other multiple pairing patterns results in a one-to-many relationship, allowing the formation of complex regulation networks. Based on microarray data from Stanford Microarray Database, two hNAT pairs were found to display significant inverse expression patterns before and after insulin injection. CONCLUSION: NATs might carry out more extensive and complex functions than previously thought. Combined with endogenous micro RNAs, hNATs could be regarded as a special group of transcripts contributing to the complex regulation networks.

Subject(s)

Algorithms , Chromosome Mapping/methods , Proteome/genetics , RNA, Antisense/genetics , Sequence Alignment/methods , Sequence Analysis, RNA/methods , Transcription Factors/genetics , Base Sequence , Databases, Protein , Humans , Molecular Sequence Data

Cross-host evolution of severe acute respiratory syndrome coronavirus in palm civet and human.

Song, Huai-Dong; Tu, Chang-Chun; Zhang, Guo-Wei; Wang, Sheng-Yue; Zheng, Kui; Lei, Lian-Cheng; Chen, Qiu-Xia; Gao, Yu-Wei; Zhou, Hui-Qiong; Xiang, Hua; Zheng, Hua-Jun; Chern, Shur-Wern Wang; Cheng, Feng; Pan, Chun-Ming; Xuan, Hua; Chen, Sai-Juan; Luo, Hui-Ming; Zhou, Duan-Hua; Liu, Yu-Fei; He, Jian-Feng; Qin, Peng-Zhe; Li, Ling-Hui; Ren, Yu-Qi; Liang, Wen-Jia; Yu, Ye-Dong; Anderson, Larry; Wang, Ming; Xu, Rui-Heng; Wu, Xin-Wei; Zheng, Huan-Ying; Chen, Jin-Ding; Liang, Guodong; Gao, Yang; Liao, Ming; Fang, Ling; Jiang, Li-Yun; Li, Hui; Chen, Fang; Di, Biao; He, Li-Juan; Lin, Jin-Yan; Tong, Suxiang; Kong, Xiangang; Du, Lin; Hao, Pei; Tang, Hua; Bernini, Andrea; Yu, Xiao-Jing; Spiga, Ottavia; Guo, Zong-Ming.

Proc Natl Acad Sci U S A ; 102(7): 2430-5, 2005 Feb 15.

Article in English | MEDLINE | ID: mdl-15695582

ABSTRACT

The genomic sequences of severe acute respiratory syndrome coronaviruses from human and palm civet of the 2003/2004 outbreak in the city of Guangzhou, China, were nearly identical. Phylogenetic analysis suggested an independent viral invasion from animal to human in this new episode. Combining all existing data but excluding singletons, we identified 202 single-nucleotide variations. Among them, 17 are polymorphic in palm civets only. The ratio of nonsynonymous/synonymous nucleotide substitution in palm civets collected 1 yr apart from different geographic locations is very high, suggesting a rapid evolving process of viral proteins in civet as well, much like their adaptation in the human host in the early 2002-2003 epidemic. Major genetic variations in some critical genes, particularly the Spike gene, seemed essential for the transition from animal-to-human transmission to human-to-human transmission, which eventually caused the first severe acute respiratory syndrome outbreak of 2002/2003.

Subject(s)

Evolution, Molecular , Severe Acute Respiratory Syndrome/virology , Severe acute respiratory syndrome-related coronavirus/genetics , Viverridae/virology , Amino Acid Substitution , Animals , China/epidemiology , Disease Outbreaks , Genes, Viral , Humans , Membrane Glycoproteins/genetics , Phylogeny , Polymorphism, Single Nucleotide , Severe acute respiratory syndrome-related coronavirus/isolation & purification , Severe acute respiratory syndrome-related coronavirus/pathogenicity , Severe acute respiratory syndrome-related coronavirus/physiology , Severe Acute Respiratory Syndrome/epidemiology , Severe Acute Respiratory Syndrome/transmission , Species Specificity , Spike Glycoprotein, Coronavirus , Viral Envelope Proteins/genetics , Zoonoses/epidemiology , Zoonoses/transmission , Zoonoses/virology

Application of pseudo amino acid composition for predicting protein subcellular location: stochastic signal processing approach.

Pan, Yu-Xi; Zhang, Zhi-Zhou; Guo, Zong-Ming; Feng, Guo-Yin; Huang, Zhen-De; He, Lin.

J Protein Chem ; 22(4): 395-402, 2003 May.

Article in English | MEDLINE | ID: mdl-13678304

ABSTRACT

The function of a protein is closely correlated with its subcellular location. With the success of human genome project and the rapid increase in the number of newly found protein sequences entering into data banks, it is highly desirable to develop an automated method for predicting the subcellular location of proteins. The establishment of such a predictor will no doubt expedite the functionality determination of newly found proteins and the process of prioritizing genes and proteins identified by genomics efforts as potential molecular targets for drug design. Based on the concept of pseudo amino acid composition originally proposed by K. C. Chou (Proteins: Struct. Funct. Genet. 43: 246-255, 2001), the digital signal processing approach has been introduced to partially incorporate the sequence order effect. One of the remarkable merits by doing so is that many existing tools in mathematics and engineering can be straightforwardly used in predicting protein subcellular location. The results thus obtained are quite encouraging. It is anticipated that the digital signal processing may serve as a useful vehicle for many other protein science areas as well.

Subject(s)

Amino Acids/analysis , Cells/metabolism , Computational Biology/methods , Proteins/chemistry , Proteins/metabolism , Algorithms , Cells/cytology , Humans , Protein Transport , Stochastic Processes , Subcellular Fractions/chemistry , Subcellular Fractions/metabolism

Putative hAPN receptor binding sites in SARS_CoV spike protein.

Yu, Xiao-Jing; Luo, Cheng; Lin, Jian-Cheng; Hao, Pei; He, You-Yu; Guo, Zong-Ming; Qin, Lei; Su, Jiong; Liu, Bo-Shu; Huang, Yin; Nan, Peng; Li, Chuan-Song; Xiong, Bin; Luo, Xiao-Min; Zhao, Guo-Ping; Pei, Gang; Chen, Kai-Xian; Shen, Xu; Shen, Jian-Hua; Zou, Jian-Ping; He, Wei-Zhong; Shi, Tie-Liu; Zhong, Yang; Jiang, Hua-Liang; Li, Yi-Xue.

Acta Pharmacol Sin ; 24(6): 481-8, 2003 Jun.

Article in English | MEDLINE | ID: mdl-12791172

ABSTRACT

AIM: To obtain the information of ligand-receptor binding between the S protein of SARS-CoV and CD13, identify the possible interacting domains or motifs related to binding sites, and provide clues for studying the functions of SARS proteins and designing anti-SARS drugs and vaccines. METHODS: On the basis of comparative genomics, the homology search, phylogenetic analyses, and multi-sequence alignment were used to predict CD13 related interacting domains and binding sites in the S protein of SARS-CoV. Molecular modeling and docking simulation methods were employed to address the interaction feature between CD13 and S protein of SARS-CoV in validating the bioinformatics predictions. RESULTS: Possible binding sites in the SARS-CoV S protein to CD13 have been mapped out by using bioinformatics analysis tools. The binding for one protein-protein interaction pair (D757-R761 motif of the SARS-CoV S protein to P585-A653 domain of CD13) has been simulated by molecular modeling and docking simulation methods. CONCLUSION: CD13 may be a possible receptor of the SARS-CoV S protein, which may be associated with the SARS infection. This study also provides a possible strategy for mapping the possible binding receptors of the proteins in a genome.

Subject(s)

CD13 Antigens/metabolism , Membrane Glycoproteins/metabolism , Severe Acute Respiratory Syndrome/virology , Severe acute respiratory syndrome-related coronavirus/chemistry , Viral Envelope Proteins/metabolism , Amino Acid Sequence , Binding Sites , CD13 Antigens/chemistry , CD13 Antigens/genetics , Catalytic Domain , Computational Biology , Humans , Membrane Glycoproteins/chemistry , Membrane Glycoproteins/genetics , Molecular Sequence Data , Protein Binding , Protein Interaction Mapping , Protein Structure, Tertiary , Severe acute respiratory syndrome-related coronavirus/genetics , Sequence Alignment , Spike Glycoprotein, Coronavirus , Viral Envelope Proteins/chemistry , Viral Envelope Proteins/genetics

Identification of probable genomic packaging signal sequence from SARS-CoV genome by bioinformatics analysis.

Qin, Lei; Xiong, Bin; Luo, Cheng; Guo, Zong-Ming; Hao, Pei; Su, Jiong; Nan, Peng; Feng, Ying; Shi, Yi-Xiang; Yu, Xiao-Jing; Luo, Xiao-Min; Chen, Kai-Xian; Shen, Xu; Shen, Jian-Hua; Zou, Jian-Ping; Zhao, Guo-Ping; Shi, Tie-Liu; He, Wei-Zhong; Zhong, Yang; Jiang, Hua-Liang; Li, Yi-Xue.

Acta Pharmacol Sin ; 24(6): 489-96, 2003 Jun.

Article in English | MEDLINE | ID: mdl-12791173

ABSTRACT

AIM: To predict the probable genomic packaging signal of SARS-CoV by bioinformatics analysis. The derived packaging signal may be used to design antisense RNA and RNA interfere (RNAi) drugs treating SARS. METHODS: Based on the studies about the genomic packaging signals of MHV and BCoV, especially the information about primary and secondary structures, the putative genomic packaging signal of SARS-CoV were analyzed by using bioinformatic tools. Multi-alignment for the genomic sequences was performed among SARS-CoV, MHV, BCoV, PEDV and HCoV 229E. Secondary structures of RNA sequences were also predicted for the identification of the possible genomic packaging signals. Meanwhile, the N and M proteins of all five viruses were analyzed to study the evolutionary relationship with genomic packaging signals. RESULTS: The putative genomic packaging signal of SARS-CoV locates at the 3' end of ORF1b near that of MHV and BCoV, where is the most variable region of this gene. The RNA secondary structure of SARS-CoV genomic packaging signal is very similar to that of MHV and BCoV. The same result was also obtained in studying the genomic packaging signals of PEDV and HCoV 229E. Further more, the genomic sequence multi-alignment indicated that the locations of packaging signals of SARS-CoV, PEDV, and HCoV overlaped each other. It seems that the mutation rate of packaging signal sequences is much higher than the N protein, while only subtle variations for the M protein. CONCLUSIONS: The probable genomic packaging signal of SARS-CoV is analogous to that of MHV and BCoV, with the corresponding secondary RNA structure locating at the similar region of ORF1b. The positions where genomic packaging signals exist have suffered rounds of mutations, which may influence the primary structures of the N and M proteins consequently.

Subject(s)

Nucleocapsid Proteins/genetics , Protein Sorting Signals/genetics , Severe Acute Respiratory Syndrome/virology , Severe acute respiratory syndrome-related coronavirus/genetics , Viral Matrix Proteins/genetics , Amino Acid Sequence , Base Sequence , Computational Biology , Coronavirus 229E, Human/genetics , Coronavirus, Bovine/genetics , Genome, Viral , Humans , Molecular Sequence Data , Murine hepatitis virus/genetics , Protein Structure, Secondary , RNA Interference , RNA, Antisense/genetics , RNA, Viral/genetics , Severe acute respiratory syndrome-related coronavirus/isolation & purification , Sequence Alignment , Sequence Homology, Amino Acid

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL