Your browser doesn't support javascript.
FAIR data pipeline: provenance-driven data management for traceable scientific workflows.
Mitchell, Sonia Natalie; Lahiff, Andrew; Cummings, Nathan; Hollocombe, Jonathan; Boskamp, Bram; Field, Ryan; Reddyhoff, Dennis; Zarebski, Kristian; Wilson, Antony; Viola, Bruno; Burke, Martin; Archibald, Blair; Bessell, Paul; Blackwell, Richard; Boden, Lisa A; Brett, Alys; Brett, Sam; Dundas, Ruth; Enright, Jessica; Gonzalez-Beltran, Alejandra N; Harris, Claire; Hinder, Ian; David Hughes, Christopher; Knight, Martin; Mano, Vino; McMonagle, Ciaran; Mellor, Dominic; Mohr, Sibylle; Marion, Glenn; Matthews, Louise; McKendrick, Iain J; Mark Pooley, Christopher; Porphyre, Thibaud; Reeves, Aaron; Townsend, Edward; Turner, Robert; Walton, Jeremy; Reeve, Richard.
  • Mitchell SN; Institute of Biodiversity, Animal Health and Comparative Medicine, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow, G12 8QQ, UK.
  • Lahiff A; Boyd Orr Centre for Population and Ecosystem Health, University of Glasgow, Glasgow, G12 8QQ, UK.
  • Cummings N; United Kingdom Atomic Energy Authority, Didcot OX14 3DB, UK.
  • Hollocombe J; United Kingdom Atomic Energy Authority, Didcot OX14 3DB, UK.
  • Boskamp B; United Kingdom Atomic Energy Authority, Didcot OX14 3DB, UK.
  • Field R; Biomathematics and Statistics Scotland (BioSS), James Clerk Maxwell Building, Peter Guthrie Tait Road, The King's Buildings, Edinburgh EH9 3FD, UK.
  • Reddyhoff D; MRC/CSO Social and Public Health Sciences Unit, Institute of Health and Wellbeing, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow, G12 8QQ, UK.
  • Zarebski K; Department of Computer Science, University of Sheffield, Regent Court, Sheffield S1 4DP, UK.
  • Wilson A; United Kingdom Atomic Energy Authority, Didcot OX14 3DB, UK.
  • Viola B; Science and Technology Facilities Council, Harwell Campus, Harwell OX11, UK.
  • Burke M; United Kingdom Atomic Energy Authority, Didcot OX14 3DB, UK.
  • Archibald B; Biomathematics and Statistics Scotland (BioSS), James Clerk Maxwell Building, Peter Guthrie Tait Road, The King's Buildings, Edinburgh EH9 3FD, UK.
  • Bessell P; School of Computing Science, College of Science and Engineering, University of Glasgow, Glasgow, G12 8QQ, UK.
  • Blackwell R; Roslin Institute, University of Edinburgh, Edinburgh EH8 9YL, UK.
  • Boden LA; Man Group plc, Riverbank House, 2 Swan Lane, London EC4R 3AD, UK.
  • Brett A; Roslin Institute, University of Edinburgh, Edinburgh EH8 9YL, UK.
  • Brett S; United Kingdom Atomic Energy Authority, Didcot OX14 3DB, UK.
  • Enright J; MRC/CSO Social and Public Health Sciences Unit, Institute of Health and Wellbeing, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow, G12 8QQ, UK.
  • Gonzalez-Beltran AN; Boyd Orr Centre for Population and Ecosystem Health, University of Glasgow, Glasgow, G12 8QQ, UK.
  • Harris C; School of Computing Science, College of Science and Engineering, University of Glasgow, Glasgow, G12 8QQ, UK.
  • Hinder I; Science and Technology Facilities Council, Harwell Campus, Harwell OX11, UK.
  • David Hughes C; Boyd Orr Centre for Population and Ecosystem Health, University of Glasgow, Glasgow, G12 8QQ, UK.
  • Knight M; Biomathematics and Statistics Scotland (BioSS), James Clerk Maxwell Building, Peter Guthrie Tait Road, The King's Buildings, Edinburgh EH9 3FD, UK.
  • Mano V; The University of Manchester, Research IT, Manchester M1 3BU, UK.
  • McMonagle C; Man Group plc, Riverbank House, 2 Swan Lane, London EC4R 3AD, UK.
  • Mellor D; Biomathematics and Statistics Scotland (BioSS), James Clerk Maxwell Building, Peter Guthrie Tait Road, The King's Buildings, Edinburgh EH9 3FD, UK.
  • Mohr S; Man Group plc, Riverbank House, 2 Swan Lane, London EC4R 3AD, UK.
  • Marion G; Boyd Orr Centre for Population and Ecosystem Health, University of Glasgow, Glasgow, G12 8QQ, UK.
  • Matthews L; MRC/CSO Social and Public Health Sciences Unit, Institute of Health and Wellbeing, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow, G12 8QQ, UK.
  • McKendrick IJ; Boyd Orr Centre for Population and Ecosystem Health, University of Glasgow, Glasgow, G12 8QQ, UK.
  • Mark Pooley C; School of Veterinary Medicine, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow G61 1QH, UK.
  • Porphyre T; Institute of Biodiversity, Animal Health and Comparative Medicine, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow, G12 8QQ, UK.
  • Reeves A; Boyd Orr Centre for Population and Ecosystem Health, University of Glasgow, Glasgow, G12 8QQ, UK.
  • Townsend E; Boyd Orr Centre for Population and Ecosystem Health, University of Glasgow, Glasgow, G12 8QQ, UK.
  • Turner R; Biomathematics and Statistics Scotland (BioSS), James Clerk Maxwell Building, Peter Guthrie Tait Road, The King's Buildings, Edinburgh EH9 3FD, UK.
  • Walton J; Institute of Biodiversity, Animal Health and Comparative Medicine, College of Medical, Veterinary and Life Sciences, University of Glasgow, Glasgow, G12 8QQ, UK.
  • Reeve R; Boyd Orr Centre for Population and Ecosystem Health, University of Glasgow, Glasgow, G12 8QQ, UK.
Philos Trans A Math Phys Eng Sci ; 380(2233): 20210300, 2022 Oct 03.
Article in English | MEDLINE | ID: covidwho-1992458
ABSTRACT
Modern epidemiological analyses to understand and combat the spread of disease depend critically on access to, and use of, data. Rapidly evolving data, such as data streams changing during a disease outbreak, are particularly challenging. Data management is further complicated by data being imprecisely identified when used. Public trust in policy decisions resulting from such analyses is easily damaged and is often low, with cynicism arising where claims of 'following the science' are made without accompanying evidence. Tracing the provenance of such decisions back through open software to primary data would clarify this evidence, enhancing the transparency of the decision-making process. Here, we demonstrate a Findable, Accessible, Interoperable and Reusable (FAIR) data pipeline. Although developed during the COVID-19 pandemic, it allows easy annotation of any data as they are consumed by analyses, or conversely traces the provenance of scientific outputs back through the analytical or modelling source code to primary data. Such a tool provides a mechanism for the public, and fellow scientists, to better assess scientific evidence by inspecting its provenance, while allowing scientists to support policymakers in openly justifying their decisions. We believe that such tools should be promoted for use across all areas of policy-facing research. This article is part of the theme issue 'Technical challenges of modelling real-life epidemics and examples of overcoming these'.
Subject(s)
Keywords

Full text: Available Collection: International databases Database: MEDLINE Main subject: Data Management / COVID-19 Limits: Humans Language: English Journal: Philos Trans A Math Phys Eng Sci Journal subject: Biophysics / Biomedical Engineering Year: 2022 Document Type: Article Affiliation country: Rsta.2021.0300

Similar

MEDLINE

...
LILACS

LIS


Full text: Available Collection: International databases Database: MEDLINE Main subject: Data Management / COVID-19 Limits: Humans Language: English Journal: Philos Trans A Math Phys Eng Sci Journal subject: Biophysics / Biomedical Engineering Year: 2022 Document Type: Article Affiliation country: Rsta.2021.0300