Search | VHL Regional Portal

Experiences with a training DSW knowledge model for early-stage researchers.

Devignes, Marie-Dominique; Smaïl-Tabbone, Malika; Dhondge, Hrishikesh; Dolcemascolo, Roswitha; Gavaldá-García, Jose; Higuera-Rodriguez, R Anahí; Kravchenko, Anna; Roca Martínez, Joel; Messini, Niki; Pérez-Ràfols, Anna; Pérez Ropero, Guillermo; Sperotto, Luca; Chauvot de Beauchêne, Isaure; Vranken, Wim.

Open Res Eur ; 3: 97, 2023.

Article in English | MEDLINE | ID: mdl-37645489

ABSTRACT

Background: Data management is fast becoming an essential part of scientific practice, driven by open science and FAIR (findable, accessible, interoperable, and reusable) data sharing requirements. Whilst data management plans (DMPs) are clear to data management experts and data stewards, understandings of their purpose and creation are often obscure to the producers of the data, which in academic environments are often PhD students. Methods: Within the RNAct EU Horizon 2020 ITN project, we engaged the 10 RNAct early-stage researchers (ESRs) in a training project aimed at formulating a DMP. To do so, we used the Data Stewardship Wizard (DSW) framework and modified the existing Life Sciences Knowledge Model into a simplified version aimed at training young scientists, with computational or experimental backgrounds, in core data management principles. We collected feedback from the ESRs during this exercise. Results: Here, we introduce our new life-sciences training DMP template for young scientists. We report and discuss our experiences as principal investigators (PIs) and ESRs during this project and address the typical difficulties that are encountered in developing and understanding a DMP. Conclusions: We found that the DS-wizard can also be an appropriate tool for DMP training, to get terminology and concepts across to researchers. A full training in addition requires an upstream step to present basic DMP concepts and a downstream step to publish a dataset in a (public) repository. Overall, the DS-Wizard tool was essential for our DMP training and we hope our efforts can be used in other projects.

CroMaSt: a workflow for assessing protein domain classification by cross-mapping of structural instances between domain databases and structural alignment.

Dhondge, Hrishikesh; Chauvot de Beauchêne, Isaure; Devignes, Marie-Dominique.

Bioinform Adv ; 3(1): vbad081, 2023.

Article in English | MEDLINE | ID: mdl-37431435

ABSTRACT

Motivation: Protein domains can be viewed as building blocks, essential for understanding structure-function relationships in proteins. However, each domain database classifies protein domains using its own methodology. Thus, in many cases, domain models and boundaries differ from one domain database to the other, raising the question of domain definition and enumeration of true domain instances. Results: We propose an automated iterative workflow to assess protein domain classification by cross-mapping domain structural instances between domain databases and by evaluating structural alignments. CroMaSt (for Cross-Mapper of domain Structural instances) will classify all experimental structural instances of a given domain type into four different categories ('Core', 'True', 'Domain-like' and 'Failed'). CroMast is developed in Common Workflow Language and takes advantage of two well-known domain databases with wide coverage: Pfam and CATH. It uses the Kpax structural alignment tool with expert-adjusted parameters. CroMaSt was tested with the RNA Recognition Motif domain type and identifies 962 'True' and 541 'Domain-like' structural instances for this domain type. This method solves a crucial issue in domain-centric research and can generate essential information that could be used for synthetic biology and machine-learning approaches of protein domain engineering. Availability and implementation: The workflow and the Results archive for the CroMaSt runs presented in this article are available from WorkflowHub (doi: 10.48546/workflowhub.workflow.390.2). Supplementary information: Supplementary data are available at Bioinformatics Advances online.

Deciphering the RRM-RNA recognition code: A computational analysis.

Roca-Martínez, Joel; Dhondge, Hrishikesh; Sattler, Michael; Vranken, Wim F.

PLoS Comput Biol ; 19(1): e1010859, 2023 01.

Article in English | MEDLINE | ID: mdl-36689472

ABSTRACT

RNA recognition motifs (RRM) are the most prevalent class of RNA binding domains in eucaryotes. Their RNA binding preferences have been investigated for almost two decades, and even though some RRM domains are now very well described, their RNA recognition code has remained elusive. An increasing number of experimental structures of RRM-RNA complexes has become available in recent years. Here, we perform an in-depth computational analysis to derive an RNA recognition code for canonical RRMs. We present and validate a computational scoring method to estimate the binding between an RRM and a single stranded RNA, based on structural data from a carefully curated multiple sequence alignment, which can predict RRM binding RNA sequence motifs based on the RRM protein sequence. Given the importance and prevalence of RRMs in humans and other species, this tool could help design RNA binding motifs with uses in medical or synthetic biology applications, leading towards the de novo design of RRMs with specific RNA recognition.

Subject(s)

RNA Recognition Motif , RNA , Humans , RNA/chemistry , Amino Acid Sequence , Sequence Alignment , Nucleotide Motifs/genetics , Protein Binding , Binding Sites

ABSTRACT

ABSTRACT

ABSTRACT

Subject(s)

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL