RESUMO
Aims: To establish the common rules of exon combinatorics during RNA splicing. Study Design: Inferring a plausible statistical model of exon combinatorics from the annotated models of human genes during RNA splicing. Place and Duration of Study: Department of Genetics (Belarusian State University), Proteome and Genome Research Unit (Luxembourg Institute of Health), Department of Genetics (Lomonosov Moscow State University) and Moscow Center of Experimental Embryology and Reproductive Biotechnologies, between January 2017 and July 2019. Methodology: We used human mRNA and EST sequences from GenBank (1093522 unique records in total) and linear models of the human genes from Ensembl (58051 genes), AceView (72384 genes), ECgene (57172 genes), NCBI RefSeq (54262 genes), UCSC Genome Browser (58037 genes) and VEGA (54950 genes) to calculate a combinatorial index of human exons. We inferred the most plausible statistical model describing the distribution of combinatorial index of human exons using Clauset’s mathematical formalism. Predictors of the combinatorial index values and functional outcomes of the predefined behavior of exons during splicing were also determined. Results: Power-law is the most plausible statistical model describing the combinatorics of exons during RNA splicing. The combinatorial index of human exons is defined by more than 90% by the 138 features that have different importance. The most important of these features are the abundance of exon in transcripts, the strength of splice sites, the rank of exon in transcripts and the type of exon. Analysis of the marginal effects shows that different values of the same feature have unequal influence on the combinatorial index of human exons. Power-law behavior of exons during RNA splicing pre-determines structural diversity of transcripts, low sensitivity of splicing process to random perturbations and its high vulnerability to manipulation with highly combinative exons. Conclusion: Exons widely involved in alternative splicing are a part of the common power-law phenomenon in human cells. The power-law behavior of exons during RNA splicing gives the unique characteristics to human genes.