Journal article 505 views 32 downloads
Improving Science That Uses Code
The Computer Journal, Volume: 67, Issue: 4, Pages: 1381 - 1404
Swansea University Author:
Harold Thimbleby
-
PDF | Version of Record
©The Author(s) 2023. This is an Open Access article distributed under the terms of the Creative Commons Attribution License.
Download (1.11MB)
DOI (Published version): 10.1093/comjnl/bxad067
Abstract
As code is now an inextricable part of science it should be supported by competent Software Engineering, analogously to statistical claims being properly supported by competent statistics.If and when code avoids adequate scrutiny, science becomes unreliable and unverifiable because results — text, d...
Published in: | The Computer Journal |
---|---|
ISSN: | 0010-4620 1460-2067 |
Published: |
Oxford University Press (OUP)
2024
|
Online Access: |
Check full text
|
URI: | https://cronfa.swan.ac.uk/Record/cronfa63873 |
first_indexed |
2023-09-05T15:03:31Z |
---|---|
last_indexed |
2024-11-25T14:13:02Z |
id |
cronfa63873 |
recordtype |
SURis |
fullrecord |
<?xml version="1.0"?><rfc1807><datestamp>2024-08-20T13:18:37.6988063</datestamp><bib-version>v2</bib-version><id>63873</id><entry>2023-07-12</entry><title>Improving Science That Uses Code</title><swanseaauthors><author><sid>c12beb0ab0e333a9a512589d411d17f3</sid><ORCID>0000-0003-2222-4243</ORCID><firstname>Harold</firstname><surname>Thimbleby</surname><name>Harold Thimbleby</name><active>true</active><ethesisStudent>false</ethesisStudent></author></swanseaauthors><date>2023-07-12</date><abstract>As code is now an inextricable part of science it should be supported by competent Software Engineering, analogously to statistical claims being properly supported by competent statistics.If and when code avoids adequate scrutiny, science becomes unreliable and unverifiable because results — text, data, graphs, images, etc — depend on untrustworthy code.Currently, scientists rarely assure the quality of the code they rely on, and rarely make it accessible for scrutiny. Even when available, scientists rarely provide adequate documentation to understand or use it reliably.This paper proposes and justifies ways to improve science using code:1. Professional Software Engineers can help, particularly in critical fields such as public health, climate change and energy.2. ‘Software Engineering Boards,’ analogous to Ethics or Institutional Review Boards, should be instigated and used.3. The Reproducible Analytic Pipeline (RAP) methodology can be generalized to cover code and Software Engineering methodologies, in a generalization this paper introduces called RAP+. RAP+ (or comparable interventions) could be supported and or even required in journal, conference and funding body policies.The paper’s Supplemental Material provides a summary of Software Engineering best practice relevant to scientific research, including further suggestions for RAP+ workflows.‘Science is what we understand well enough to explain to a computer.’ Donald E. Knuth in A=B [ 1]‘I have to write to discover what I am doing.’ Flannery O’Connor, quoted in Write for your life [ 2]‘Criticism is the mother of methodology.’ Robert P. Abelson in Statistics as Principled Argument [ 3]‘From its earliest times, science has operated by being open and transparent about methods and evidence, regardless of which technology has been in vogue.’ Editorial in Nature [4]</abstract><type>Journal Article</type><journal>The Computer Journal</journal><volume>67</volume><journalNumber>4</journalNumber><paginationStart>1381</paginationStart><paginationEnd>1404</paginationEnd><publisher>Oxford University Press (OUP)</publisher><placeOfPublication/><isbnPrint/><isbnElectronic/><issnPrint>0010-4620</issnPrint><issnElectronic>1460-2067</issnElectronic><keywords>Computational Science, Software Engineering, reproducibility, scientific scrutiny, reproducible analytic pipeline (RAP &amp; RAP+)</keywords><publishedDay>21</publishedDay><publishedMonth>4</publishedMonth><publishedYear>2024</publishedYear><publishedDate>2024-04-21</publishedDate><doi>10.1093/comjnl/bxad067</doi><url/><notes/><college>COLLEGE NANME</college><CollegeCode>COLLEGE CODE</CollegeCode><institution>Swansea University</institution><apcterm>SU Library paid the OA fee (TA Institutional Deal)</apcterm><funders>This work was jointly supported by See Change (M&RA-P), Scotland (an anonymous funder), by the Engineering and Physical Sciences Research Council [grant EP/M022722/1], by the Royal
Academy of Engineering through the Engineering X Pandemic Preparedness Programme [grant EXPP2021\1\186] and by Assuring Autonomy International Programme, Assuring Safe AI in Ambulance Service Triage. The funders had no involvement in the research or in this paper.</funders><projectreference/><lastEdited>2024-08-20T13:18:37.6988063</lastEdited><Created>2023-07-12T12:41:37.2553680</Created><path><level id="1">Faculty of Science and Engineering</level><level id="2">School of Mathematics and Computer Science - Computer Science</level></path><authors><author><firstname>Harold</firstname><surname>Thimbleby</surname><orcid>0000-0003-2222-4243</orcid><order>1</order></author></authors><documents><document><filename>63873__31146__df8eaedee365423789341b71662d61d2.pdf</filename><originalFilename>63873.VoR.pdf</originalFilename><uploaded>2024-08-20T13:14:57.5829437</uploaded><type>Output</type><contentLength>1162645</contentLength><contentType>application/pdf</contentType><version>Version of Record</version><cronfaStatus>true</cronfaStatus><documentNotes>©The Author(s) 2023. This is an Open Access article distributed under the terms of the Creative Commons Attribution License.</documentNotes><copyrightCorrect>true</copyrightCorrect><language>eng</language><licence>https://creativecommons.org/licenses/by/4.0/</licence></document></documents><OutputDurs/></rfc1807> |
spelling |
2024-08-20T13:18:37.6988063 v2 63873 2023-07-12 Improving Science That Uses Code c12beb0ab0e333a9a512589d411d17f3 0000-0003-2222-4243 Harold Thimbleby Harold Thimbleby true false 2023-07-12 As code is now an inextricable part of science it should be supported by competent Software Engineering, analogously to statistical claims being properly supported by competent statistics.If and when code avoids adequate scrutiny, science becomes unreliable and unverifiable because results — text, data, graphs, images, etc — depend on untrustworthy code.Currently, scientists rarely assure the quality of the code they rely on, and rarely make it accessible for scrutiny. Even when available, scientists rarely provide adequate documentation to understand or use it reliably.This paper proposes and justifies ways to improve science using code:1. Professional Software Engineers can help, particularly in critical fields such as public health, climate change and energy.2. ‘Software Engineering Boards,’ analogous to Ethics or Institutional Review Boards, should be instigated and used.3. The Reproducible Analytic Pipeline (RAP) methodology can be generalized to cover code and Software Engineering methodologies, in a generalization this paper introduces called RAP+. RAP+ (or comparable interventions) could be supported and or even required in journal, conference and funding body policies.The paper’s Supplemental Material provides a summary of Software Engineering best practice relevant to scientific research, including further suggestions for RAP+ workflows.‘Science is what we understand well enough to explain to a computer.’ Donald E. Knuth in A=B [ 1]‘I have to write to discover what I am doing.’ Flannery O’Connor, quoted in Write for your life [ 2]‘Criticism is the mother of methodology.’ Robert P. Abelson in Statistics as Principled Argument [ 3]‘From its earliest times, science has operated by being open and transparent about methods and evidence, regardless of which technology has been in vogue.’ Editorial in Nature [4] Journal Article The Computer Journal 67 4 1381 1404 Oxford University Press (OUP) 0010-4620 1460-2067 Computational Science, Software Engineering, reproducibility, scientific scrutiny, reproducible analytic pipeline (RAP & RAP+) 21 4 2024 2024-04-21 10.1093/comjnl/bxad067 COLLEGE NANME COLLEGE CODE Swansea University SU Library paid the OA fee (TA Institutional Deal) This work was jointly supported by See Change (M&RA-P), Scotland (an anonymous funder), by the Engineering and Physical Sciences Research Council [grant EP/M022722/1], by the Royal Academy of Engineering through the Engineering X Pandemic Preparedness Programme [grant EXPP2021\1\186] and by Assuring Autonomy International Programme, Assuring Safe AI in Ambulance Service Triage. The funders had no involvement in the research or in this paper. 2024-08-20T13:18:37.6988063 2023-07-12T12:41:37.2553680 Faculty of Science and Engineering School of Mathematics and Computer Science - Computer Science Harold Thimbleby 0000-0003-2222-4243 1 63873__31146__df8eaedee365423789341b71662d61d2.pdf 63873.VoR.pdf 2024-08-20T13:14:57.5829437 Output 1162645 application/pdf Version of Record true ©The Author(s) 2023. This is an Open Access article distributed under the terms of the Creative Commons Attribution License. true eng https://creativecommons.org/licenses/by/4.0/ |
title |
Improving Science That Uses Code |
spellingShingle |
Improving Science That Uses Code Harold Thimbleby |
title_short |
Improving Science That Uses Code |
title_full |
Improving Science That Uses Code |
title_fullStr |
Improving Science That Uses Code |
title_full_unstemmed |
Improving Science That Uses Code |
title_sort |
Improving Science That Uses Code |
author_id_str_mv |
c12beb0ab0e333a9a512589d411d17f3 |
author_id_fullname_str_mv |
c12beb0ab0e333a9a512589d411d17f3_***_Harold Thimbleby |
author |
Harold Thimbleby |
author2 |
Harold Thimbleby |
format |
Journal article |
container_title |
The Computer Journal |
container_volume |
67 |
container_issue |
4 |
container_start_page |
1381 |
publishDate |
2024 |
institution |
Swansea University |
issn |
0010-4620 1460-2067 |
doi_str_mv |
10.1093/comjnl/bxad067 |
publisher |
Oxford University Press (OUP) |
college_str |
Faculty of Science and Engineering |
hierarchytype |
|
hierarchy_top_id |
facultyofscienceandengineering |
hierarchy_top_title |
Faculty of Science and Engineering |
hierarchy_parent_id |
facultyofscienceandengineering |
hierarchy_parent_title |
Faculty of Science and Engineering |
department_str |
School of Mathematics and Computer Science - Computer Science{{{_:::_}}}Faculty of Science and Engineering{{{_:::_}}}School of Mathematics and Computer Science - Computer Science |
document_store_str |
1 |
active_str |
0 |
description |
As code is now an inextricable part of science it should be supported by competent Software Engineering, analogously to statistical claims being properly supported by competent statistics.If and when code avoids adequate scrutiny, science becomes unreliable and unverifiable because results — text, data, graphs, images, etc — depend on untrustworthy code.Currently, scientists rarely assure the quality of the code they rely on, and rarely make it accessible for scrutiny. Even when available, scientists rarely provide adequate documentation to understand or use it reliably.This paper proposes and justifies ways to improve science using code:1. Professional Software Engineers can help, particularly in critical fields such as public health, climate change and energy.2. ‘Software Engineering Boards,’ analogous to Ethics or Institutional Review Boards, should be instigated and used.3. The Reproducible Analytic Pipeline (RAP) methodology can be generalized to cover code and Software Engineering methodologies, in a generalization this paper introduces called RAP+. RAP+ (or comparable interventions) could be supported and or even required in journal, conference and funding body policies.The paper’s Supplemental Material provides a summary of Software Engineering best practice relevant to scientific research, including further suggestions for RAP+ workflows.‘Science is what we understand well enough to explain to a computer.’ Donald E. Knuth in A=B [ 1]‘I have to write to discover what I am doing.’ Flannery O’Connor, quoted in Write for your life [ 2]‘Criticism is the mother of methodology.’ Robert P. Abelson in Statistics as Principled Argument [ 3]‘From its earliest times, science has operated by being open and transparent about methods and evidence, regardless of which technology has been in vogue.’ Editorial in Nature [4] |
published_date |
2024-04-21T08:07:48Z |
_version_ |
1828001906802294784 |
score |
11.055715 |