Journal article 505 views 32 downloads

Improving Science That Uses Code

Harold Thimbleby Orcid Logo

The Computer Journal, Volume: 67, Issue: 4, Pages: 1381 - 1404

Swansea University Author: Harold Thimbleby Orcid Logo

  • 63873.VoR.pdf

    PDF | Version of Record

    ©The Author(s) 2023. This is an Open Access article distributed under the terms of the Creative Commons Attribution License.

    Download (1.11MB)

Check full text

DOI (Published version): 10.1093/comjnl/bxad067

Abstract

As code is now an inextricable part of science it should be supported by competent Software Engineering, analogously to statistical claims being properly supported by competent statistics.If and when code avoids adequate scrutiny, science becomes unreliable and unverifiable because results — text, d...

Full description

Published in: The Computer Journal
ISSN: 0010-4620 1460-2067
Published: Oxford University Press (OUP) 2024
Online Access: Check full text

URI: https://cronfa.swan.ac.uk/Record/cronfa63873
first_indexed 2023-09-05T15:03:31Z
last_indexed 2024-11-25T14:13:02Z
id cronfa63873
recordtype SURis
fullrecord <?xml version="1.0"?><rfc1807><datestamp>2024-08-20T13:18:37.6988063</datestamp><bib-version>v2</bib-version><id>63873</id><entry>2023-07-12</entry><title>Improving Science That Uses Code</title><swanseaauthors><author><sid>c12beb0ab0e333a9a512589d411d17f3</sid><ORCID>0000-0003-2222-4243</ORCID><firstname>Harold</firstname><surname>Thimbleby</surname><name>Harold Thimbleby</name><active>true</active><ethesisStudent>false</ethesisStudent></author></swanseaauthors><date>2023-07-12</date><abstract>As code is now an inextricable part of science it should be supported by competent Software Engineering, analogously to statistical claims being properly supported by competent statistics.If and when code avoids adequate scrutiny, science becomes unreliable and unverifiable because results &#x2014; text, data, graphs, images, etc &#x2014; depend on untrustworthy code.Currently, scientists rarely assure the quality of the code they rely on, and rarely make it accessible for scrutiny. Even when available, scientists rarely provide adequate documentation to understand or use it reliably.This paper proposes and justifies ways to improve science using code:1. Professional Software Engineers can help, particularly in critical fields such as public health, climate change and energy.2. &#x2018;Software Engineering Boards,&#x2019; analogous to Ethics or Institutional Review Boards, should be instigated and used.3. The Reproducible Analytic Pipeline (RAP) methodology can be generalized to cover code and Software Engineering methodologies, in a generalization this paper introduces called RAP+. RAP+ (or comparable interventions) could be supported and or even required in journal, conference and funding body policies.The paper&#x2019;s Supplemental Material provides a summary of Software Engineering best practice relevant to scientific research, including further suggestions for RAP+ workflows.&#x2018;Science is what we understand well enough to explain to a computer.&#x2019; Donald E. Knuth in A=B [ 1]&#x2018;I have to write to discover what I am doing.&#x2019; Flannery O&#x2019;Connor, quoted in Write for your life [ 2]&#x2018;Criticism is the mother of methodology.&#x2019; Robert P. Abelson in Statistics as Principled Argument [ 3]&#x2018;From its earliest times, science has operated by being open and transparent about methods and evidence, regardless of which technology has been in vogue.&#x2019; Editorial in Nature [4]</abstract><type>Journal Article</type><journal>The Computer Journal</journal><volume>67</volume><journalNumber>4</journalNumber><paginationStart>1381</paginationStart><paginationEnd>1404</paginationEnd><publisher>Oxford University Press (OUP)</publisher><placeOfPublication/><isbnPrint/><isbnElectronic/><issnPrint>0010-4620</issnPrint><issnElectronic>1460-2067</issnElectronic><keywords>Computational Science, Software Engineering, reproducibility, scientific scrutiny, reproducible analytic pipeline (RAP &amp;amp; RAP+)</keywords><publishedDay>21</publishedDay><publishedMonth>4</publishedMonth><publishedYear>2024</publishedYear><publishedDate>2024-04-21</publishedDate><doi>10.1093/comjnl/bxad067</doi><url/><notes/><college>COLLEGE NANME</college><CollegeCode>COLLEGE CODE</CollegeCode><institution>Swansea University</institution><apcterm>SU Library paid the OA fee (TA Institutional Deal)</apcterm><funders>This work was jointly supported by See Change (M&amp;RA-P), Scotland (an anonymous funder), by the Engineering and Physical Sciences Research Council [grant EP/M022722/1], by the Royal Academy of Engineering through the Engineering X Pandemic Preparedness Programme [grant EXPP2021\1\186] and by Assuring Autonomy International Programme, Assuring Safe AI in Ambulance Service Triage. The funders had no involvement in the research or in this paper.</funders><projectreference/><lastEdited>2024-08-20T13:18:37.6988063</lastEdited><Created>2023-07-12T12:41:37.2553680</Created><path><level id="1">Faculty of Science and Engineering</level><level id="2">School of Mathematics and Computer Science - Computer Science</level></path><authors><author><firstname>Harold</firstname><surname>Thimbleby</surname><orcid>0000-0003-2222-4243</orcid><order>1</order></author></authors><documents><document><filename>63873__31146__df8eaedee365423789341b71662d61d2.pdf</filename><originalFilename>63873.VoR.pdf</originalFilename><uploaded>2024-08-20T13:14:57.5829437</uploaded><type>Output</type><contentLength>1162645</contentLength><contentType>application/pdf</contentType><version>Version of Record</version><cronfaStatus>true</cronfaStatus><documentNotes>&#xA9;The Author(s) 2023. This is an Open Access article distributed under the terms of the Creative Commons Attribution License.</documentNotes><copyrightCorrect>true</copyrightCorrect><language>eng</language><licence>https://creativecommons.org/licenses/by/4.0/</licence></document></documents><OutputDurs/></rfc1807>
spelling 2024-08-20T13:18:37.6988063 v2 63873 2023-07-12 Improving Science That Uses Code c12beb0ab0e333a9a512589d411d17f3 0000-0003-2222-4243 Harold Thimbleby Harold Thimbleby true false 2023-07-12 As code is now an inextricable part of science it should be supported by competent Software Engineering, analogously to statistical claims being properly supported by competent statistics.If and when code avoids adequate scrutiny, science becomes unreliable and unverifiable because results — text, data, graphs, images, etc — depend on untrustworthy code.Currently, scientists rarely assure the quality of the code they rely on, and rarely make it accessible for scrutiny. Even when available, scientists rarely provide adequate documentation to understand or use it reliably.This paper proposes and justifies ways to improve science using code:1. Professional Software Engineers can help, particularly in critical fields such as public health, climate change and energy.2. ‘Software Engineering Boards,’ analogous to Ethics or Institutional Review Boards, should be instigated and used.3. The Reproducible Analytic Pipeline (RAP) methodology can be generalized to cover code and Software Engineering methodologies, in a generalization this paper introduces called RAP+. RAP+ (or comparable interventions) could be supported and or even required in journal, conference and funding body policies.The paper’s Supplemental Material provides a summary of Software Engineering best practice relevant to scientific research, including further suggestions for RAP+ workflows.‘Science is what we understand well enough to explain to a computer.’ Donald E. Knuth in A=B [ 1]‘I have to write to discover what I am doing.’ Flannery O’Connor, quoted in Write for your life [ 2]‘Criticism is the mother of methodology.’ Robert P. Abelson in Statistics as Principled Argument [ 3]‘From its earliest times, science has operated by being open and transparent about methods and evidence, regardless of which technology has been in vogue.’ Editorial in Nature [4] Journal Article The Computer Journal 67 4 1381 1404 Oxford University Press (OUP) 0010-4620 1460-2067 Computational Science, Software Engineering, reproducibility, scientific scrutiny, reproducible analytic pipeline (RAP &amp; RAP+) 21 4 2024 2024-04-21 10.1093/comjnl/bxad067 COLLEGE NANME COLLEGE CODE Swansea University SU Library paid the OA fee (TA Institutional Deal) This work was jointly supported by See Change (M&RA-P), Scotland (an anonymous funder), by the Engineering and Physical Sciences Research Council [grant EP/M022722/1], by the Royal Academy of Engineering through the Engineering X Pandemic Preparedness Programme [grant EXPP2021\1\186] and by Assuring Autonomy International Programme, Assuring Safe AI in Ambulance Service Triage. The funders had no involvement in the research or in this paper. 2024-08-20T13:18:37.6988063 2023-07-12T12:41:37.2553680 Faculty of Science and Engineering School of Mathematics and Computer Science - Computer Science Harold Thimbleby 0000-0003-2222-4243 1 63873__31146__df8eaedee365423789341b71662d61d2.pdf 63873.VoR.pdf 2024-08-20T13:14:57.5829437 Output 1162645 application/pdf Version of Record true ©The Author(s) 2023. This is an Open Access article distributed under the terms of the Creative Commons Attribution License. true eng https://creativecommons.org/licenses/by/4.0/
title Improving Science That Uses Code
spellingShingle Improving Science That Uses Code
Harold Thimbleby
title_short Improving Science That Uses Code
title_full Improving Science That Uses Code
title_fullStr Improving Science That Uses Code
title_full_unstemmed Improving Science That Uses Code
title_sort Improving Science That Uses Code
author_id_str_mv c12beb0ab0e333a9a512589d411d17f3
author_id_fullname_str_mv c12beb0ab0e333a9a512589d411d17f3_***_Harold Thimbleby
author Harold Thimbleby
author2 Harold Thimbleby
format Journal article
container_title The Computer Journal
container_volume 67
container_issue 4
container_start_page 1381
publishDate 2024
institution Swansea University
issn 0010-4620
1460-2067
doi_str_mv 10.1093/comjnl/bxad067
publisher Oxford University Press (OUP)
college_str Faculty of Science and Engineering
hierarchytype
hierarchy_top_id facultyofscienceandengineering
hierarchy_top_title Faculty of Science and Engineering
hierarchy_parent_id facultyofscienceandengineering
hierarchy_parent_title Faculty of Science and Engineering
department_str School of Mathematics and Computer Science - Computer Science{{{_:::_}}}Faculty of Science and Engineering{{{_:::_}}}School of Mathematics and Computer Science - Computer Science
document_store_str 1
active_str 0
description As code is now an inextricable part of science it should be supported by competent Software Engineering, analogously to statistical claims being properly supported by competent statistics.If and when code avoids adequate scrutiny, science becomes unreliable and unverifiable because results — text, data, graphs, images, etc — depend on untrustworthy code.Currently, scientists rarely assure the quality of the code they rely on, and rarely make it accessible for scrutiny. Even when available, scientists rarely provide adequate documentation to understand or use it reliably.This paper proposes and justifies ways to improve science using code:1. Professional Software Engineers can help, particularly in critical fields such as public health, climate change and energy.2. ‘Software Engineering Boards,’ analogous to Ethics or Institutional Review Boards, should be instigated and used.3. The Reproducible Analytic Pipeline (RAP) methodology can be generalized to cover code and Software Engineering methodologies, in a generalization this paper introduces called RAP+. RAP+ (or comparable interventions) could be supported and or even required in journal, conference and funding body policies.The paper’s Supplemental Material provides a summary of Software Engineering best practice relevant to scientific research, including further suggestions for RAP+ workflows.‘Science is what we understand well enough to explain to a computer.’ Donald E. Knuth in A=B [ 1]‘I have to write to discover what I am doing.’ Flannery O’Connor, quoted in Write for your life [ 2]‘Criticism is the mother of methodology.’ Robert P. Abelson in Statistics as Principled Argument [ 3]‘From its earliest times, science has operated by being open and transparent about methods and evidence, regardless of which technology has been in vogue.’ Editorial in Nature [4]
published_date 2024-04-21T08:07:48Z
_version_ 1828001906802294784
score 11.055715