No Cover Image

Journal article 132 views 51 downloads

Visualization strategies to aid interpretation of high-dimensional genotoxicity data

Stephen D. Dertinger, Erica Briggs, Yusuf Hussien, Steven M. Bryce, Svetlana L. Avlasevich, Adam Conrad, George Johnson Orcid Logo, Andrew Williams, Jeffrey C. Bemis

Environmental and Molecular Mutagenesis, Volume: 65, Issue: 5, Pages: 156 - 178

Swansea University Author: George Johnson Orcid Logo

  • 70447.VoR.pdf

    PDF | Version of Record

    © 2024 His Majesty the King in Right of Canada and The Authors. This is an open access article under the terms of the Creative Commons Attribution-NonCommercial-NoDerivs License.

    Download (5.04MB)

Check full text

DOI (Published version): 10.1002/em.22604

Abstract

This article describes a range of high-dimensional data visualization strategies that we have explored for their ability to complement machine learning algorithm predictions derived from MultiFlow® assay results. For this exercise, we focused on seven biomarker responses resulting from the exposure...

Full description

Published in: Environmental and Molecular Mutagenesis
ISSN: 0893-6692 1098-2280
Published: Wiley 2024
Online Access: Check full text

URI: https://cronfa.swan.ac.uk/Record/cronfa70447
first_indexed 2025-09-22T16:02:05Z
last_indexed 2025-10-11T04:30:25Z
id cronfa70447
recordtype SURis
fullrecord <?xml version="1.0"?><rfc1807><datestamp>2025-10-10T15:22:32.0188529</datestamp><bib-version>v2</bib-version><id>70447</id><entry>2025-09-22</entry><title>Visualization strategies to aid interpretation of high-dimensional genotoxicity data</title><swanseaauthors><author><sid>37d0f121db69fd09f364df89e4405e31</sid><ORCID>0000-0001-5643-9942</ORCID><firstname>George</firstname><surname>Johnson</surname><name>George Johnson</name><active>true</active><ethesisStudent>false</ethesisStudent></author></swanseaauthors><date>2025-09-22</date><deptcode>MEDS</deptcode><abstract>This article describes a range of high-dimensional data visualization strategies that we have explored for their ability to complement machine learning algorithm predictions derived from MultiFlow&#xAE; assay results. For this exercise, we focused on seven biomarker responses resulting from the exposure of TK6 cells to each of 126 diverse chemicals over a range of concentrations. Obviously, challenges associated with visualizing seven biomarker responses were further complicated whenever there was a desire to represent the entire 126 chemical data set as opposed to results from a single chemical. Scatter plots, spider plots, parallel coordinate plots, hierarchical clustering, principal component analysis, toxicological prioritization index, multidimensional scaling, t-distributed stochastic neighbor embedding, and uniform manifold approximation and projection are each considered in turn. Our report provides a comparative analysis of these techniques. In an era where multiplexed assays and machine learning algorithms are becoming the norm, stakeholders should find some of these visualization strategies useful for efficiently and effectively interpreting their high-dimensional data.</abstract><type>Journal Article</type><journal>Environmental and Molecular Mutagenesis</journal><volume>65</volume><journalNumber>5</journalNumber><paginationStart>156</paginationStart><paginationEnd>178</paginationEnd><publisher>Wiley</publisher><placeOfPublication/><isbnPrint/><isbnElectronic/><issnPrint>0893-6692</issnPrint><issnElectronic>1098-2280</issnElectronic><keywords>dimensionality reduction, hierarchical clustering, multidimensional scaling, parallel coordinate plots, principal component analysis, spider plots, ToxPi, t-distributed stochastic neighbor embedding, uniform manifold approximation</keywords><publishedDay>1</publishedDay><publishedMonth>6</publishedMonth><publishedYear>2024</publishedYear><publishedDate>2024-06-01</publishedDate><doi>10.1002/em.22604</doi><url/><notes/><college>COLLEGE NANME</college><department>Medical School</department><CollegeCode>COLLEGE CODE</CollegeCode><DepartmentCode>MEDS</DepartmentCode><institution>Swansea University</institution><apcterm>Another institution paid the OA fee</apcterm><funders>National Institute of Environmental Health Sciences, Grant/Award Number: R44ES033138</funders><projectreference/><lastEdited>2025-10-10T15:22:32.0188529</lastEdited><Created>2025-09-22T11:06:02.6521849</Created><path><level id="1">Faculty of Medicine, Health and Life Sciences</level><level id="2">Swansea University Medical School - Biomedical Science</level></path><authors><author><firstname>Stephen D.</firstname><surname>Dertinger</surname><order>1</order></author><author><firstname>Erica</firstname><surname>Briggs</surname><order>2</order></author><author><firstname>Yusuf</firstname><surname>Hussien</surname><order>3</order></author><author><firstname>Steven M.</firstname><surname>Bryce</surname><order>4</order></author><author><firstname>Svetlana L.</firstname><surname>Avlasevich</surname><order>5</order></author><author><firstname>Adam</firstname><surname>Conrad</surname><order>6</order></author><author><firstname>George</firstname><surname>Johnson</surname><orcid>0000-0001-5643-9942</orcid><order>7</order></author><author><firstname>Andrew</firstname><surname>Williams</surname><order>8</order></author><author><firstname>Jeffrey C.</firstname><surname>Bemis</surname><order>9</order></author></authors><documents><document><filename>70447__35312__51cbe887018142aca10df35905ef9ffd.pdf</filename><originalFilename>70447.VoR.pdf</originalFilename><uploaded>2025-10-10T15:19:39.5775920</uploaded><type>Output</type><contentLength>5280846</contentLength><contentType>application/pdf</contentType><version>Version of Record</version><cronfaStatus>true</cronfaStatus><documentNotes>&#xA9; 2024 His Majesty the King in Right of Canada and The Authors. This is an open access article under the terms of the Creative Commons Attribution-NonCommercial-NoDerivs License.</documentNotes><copyrightCorrect>true</copyrightCorrect><language>eng</language><licence>http://creativecommons.org/licenses/by-nc-nd/4.0/</licence></document></documents><OutputDurs/></rfc1807>
spelling 2025-10-10T15:22:32.0188529 v2 70447 2025-09-22 Visualization strategies to aid interpretation of high-dimensional genotoxicity data 37d0f121db69fd09f364df89e4405e31 0000-0001-5643-9942 George Johnson George Johnson true false 2025-09-22 MEDS This article describes a range of high-dimensional data visualization strategies that we have explored for their ability to complement machine learning algorithm predictions derived from MultiFlow® assay results. For this exercise, we focused on seven biomarker responses resulting from the exposure of TK6 cells to each of 126 diverse chemicals over a range of concentrations. Obviously, challenges associated with visualizing seven biomarker responses were further complicated whenever there was a desire to represent the entire 126 chemical data set as opposed to results from a single chemical. Scatter plots, spider plots, parallel coordinate plots, hierarchical clustering, principal component analysis, toxicological prioritization index, multidimensional scaling, t-distributed stochastic neighbor embedding, and uniform manifold approximation and projection are each considered in turn. Our report provides a comparative analysis of these techniques. In an era where multiplexed assays and machine learning algorithms are becoming the norm, stakeholders should find some of these visualization strategies useful for efficiently and effectively interpreting their high-dimensional data. Journal Article Environmental and Molecular Mutagenesis 65 5 156 178 Wiley 0893-6692 1098-2280 dimensionality reduction, hierarchical clustering, multidimensional scaling, parallel coordinate plots, principal component analysis, spider plots, ToxPi, t-distributed stochastic neighbor embedding, uniform manifold approximation 1 6 2024 2024-06-01 10.1002/em.22604 COLLEGE NANME Medical School COLLEGE CODE MEDS Swansea University Another institution paid the OA fee National Institute of Environmental Health Sciences, Grant/Award Number: R44ES033138 2025-10-10T15:22:32.0188529 2025-09-22T11:06:02.6521849 Faculty of Medicine, Health and Life Sciences Swansea University Medical School - Biomedical Science Stephen D. Dertinger 1 Erica Briggs 2 Yusuf Hussien 3 Steven M. Bryce 4 Svetlana L. Avlasevich 5 Adam Conrad 6 George Johnson 0000-0001-5643-9942 7 Andrew Williams 8 Jeffrey C. Bemis 9 70447__35312__51cbe887018142aca10df35905ef9ffd.pdf 70447.VoR.pdf 2025-10-10T15:19:39.5775920 Output 5280846 application/pdf Version of Record true © 2024 His Majesty the King in Right of Canada and The Authors. This is an open access article under the terms of the Creative Commons Attribution-NonCommercial-NoDerivs License. true eng http://creativecommons.org/licenses/by-nc-nd/4.0/
title Visualization strategies to aid interpretation of high-dimensional genotoxicity data
spellingShingle Visualization strategies to aid interpretation of high-dimensional genotoxicity data
George Johnson
title_short Visualization strategies to aid interpretation of high-dimensional genotoxicity data
title_full Visualization strategies to aid interpretation of high-dimensional genotoxicity data
title_fullStr Visualization strategies to aid interpretation of high-dimensional genotoxicity data
title_full_unstemmed Visualization strategies to aid interpretation of high-dimensional genotoxicity data
title_sort Visualization strategies to aid interpretation of high-dimensional genotoxicity data
author_id_str_mv 37d0f121db69fd09f364df89e4405e31
author_id_fullname_str_mv 37d0f121db69fd09f364df89e4405e31_***_George Johnson
author George Johnson
author2 Stephen D. Dertinger
Erica Briggs
Yusuf Hussien
Steven M. Bryce
Svetlana L. Avlasevich
Adam Conrad
George Johnson
Andrew Williams
Jeffrey C. Bemis
format Journal article
container_title Environmental and Molecular Mutagenesis
container_volume 65
container_issue 5
container_start_page 156
publishDate 2024
institution Swansea University
issn 0893-6692
1098-2280
doi_str_mv 10.1002/em.22604
publisher Wiley
college_str Faculty of Medicine, Health and Life Sciences
hierarchytype
hierarchy_top_id facultyofmedicinehealthandlifesciences
hierarchy_top_title Faculty of Medicine, Health and Life Sciences
hierarchy_parent_id facultyofmedicinehealthandlifesciences
hierarchy_parent_title Faculty of Medicine, Health and Life Sciences
department_str Swansea University Medical School - Biomedical Science{{{_:::_}}}Faculty of Medicine, Health and Life Sciences{{{_:::_}}}Swansea University Medical School - Biomedical Science
document_store_str 1
active_str 0
description This article describes a range of high-dimensional data visualization strategies that we have explored for their ability to complement machine learning algorithm predictions derived from MultiFlow® assay results. For this exercise, we focused on seven biomarker responses resulting from the exposure of TK6 cells to each of 126 diverse chemicals over a range of concentrations. Obviously, challenges associated with visualizing seven biomarker responses were further complicated whenever there was a desire to represent the entire 126 chemical data set as opposed to results from a single chemical. Scatter plots, spider plots, parallel coordinate plots, hierarchical clustering, principal component analysis, toxicological prioritization index, multidimensional scaling, t-distributed stochastic neighbor embedding, and uniform manifold approximation and projection are each considered in turn. Our report provides a comparative analysis of these techniques. In an era where multiplexed assays and machine learning algorithms are becoming the norm, stakeholders should find some of these visualization strategies useful for efficiently and effectively interpreting their high-dimensional data.
published_date 2024-06-01T05:30:54Z
_version_ 1851098036750516224
score 11.089572