ABSTRACT
A whole-exome capture and next-generation sequencing was applied to an 11 y/o patient with a clinical history of congenital hypotonia, generalized motor and cognitive neurodevelopmental delay, and severe cognitive deficit, and without any identifiable Syndromic pattern, and to her parents, we disclosed a de novo heterozygous pathogenic mutation, c.697_699del p.Phe233del (rs786204835)(ACMG classification PS2, PM1, PM2, PP5), harbored in the PURA gene (MIM*600473) (5q31.3), associated with Autosomal Dominant Mental Retardation 31 (MIM # 616158). We used the significant improvement in the accuracy of protein structure prediction recently implemented in AlphaFold that incorporates novel neural network architectures and training procedures based on the evolutionary, physical, and geometric constraints of protein structures. The wild-type (WT) sequence and the mutated sequence, missing the Phe233, were reconstructed. The predicted local Distance Difference Test (lDDT) for the PURAwt and the PURA-Phe233del showed that the occurrence of the Phe233del affects between 220-320 amino acids. The distortion in the PURA structural conformation in the ~5 Å surrounding area after the p.Phe233del produces a conspicuous disruption of the repeat III, where the DNA and RNA helix unwinding capability occurs. PURA Protein-DNA docking corroborated these results in an in silico analysis that showed a loss of the contact of the PURA-Phe233del III repeat domain model with the DNA. Together, (i) the energetic and stereochemical, (ii) the hydropathic indexes and polarity surfaces, and (iii) the hybrid Quantum Mechanics-Molecular Mechanics (QM-MM) analyses of the PURA molecular models demarcate, at the atomic resolution, the specific surrounding region affected by these mutations and pave the way for future cell-based functional analysis. To the best of our knowledge, this is the first report of a de novo mutation underpinning a PURA syndrome in a Latin American patient and highlights the importance of predicting the molecular effects in protein structure using artificial intelligence algorithms and molecular and atomic resolution stereochemical analyses.