Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 1 de 1
Filter
Add more filters










Database
Language
Publication year range
1.
Genome Inform ; 17(1): 124-33, 2006.
Article in English | MEDLINE | ID: mdl-17503362

ABSTRACT

DNA motifs, or cis-elements, are short nucleotide sequence patterns recognized by various transcription factors (TFs). In promoters, these TFs bind in a complex combinatorial manner in order to regulate the expression of a downstream gene. The combinatorial space is frequently large and difficult to manage since vertebrates have thousands of transcription factors and more than 20,000 genes. We introduce a computer program called CAYCE (Combinatorial AnalYsis of Cis-Elements) that systematically detects statistically overrepresented DNA motif association rules independent of Microarray information. CAYCE is an adaptation of the apriori algorithm traditionally used for association rule mining, but offers three significant advancements. (1) It analyzes multiple occurrences of an item, corresponding to multiple TF binding sites, (2) It compares results with a biologically relevant background, and (3), it provides p-values for straightforward statistical interpretation. CAYCE can be easily applied to any item-set data where the investigator is also interested in multiple occurrences of a single item, and/or overrepresentation of association rules compared with a background. Applying CAYCE to human promoters in 1% of the human genome, we discover that motif clusters containing five repetitions of SP1 are the most statistically significant.


Subject(s)
Combinatorial Chemistry Techniques/statistics & numerical data , DNA/genetics , Promoter Regions, Genetic , Sequence Analysis, DNA , Transcription Factors/genetics , Algorithms , Amino Acid Motifs/genetics , Binding Sites/genetics , Cell Line , DNA/metabolism , GA-Binding Protein Transcription Factor/genetics , GA-Binding Protein Transcription Factor/metabolism , Humans , NF-E2-Related Factor 1/genetics , NF-E2-Related Factor 1/metabolism , Random Allocation , Sp1 Transcription Factor/genetics , Sp1 Transcription Factor/metabolism , Transcription Factors/chemistry , Transcription Factors/metabolism
SELECTION OF CITATIONS
SEARCH DETAIL
...