Conference Paper/Proceeding/Abstract 263 views 58 downloads
Guided Latent Diffusion for Universal Medical Image Segmentation
International Conference on AI-generated Content
Swansea University Authors:
Chen Hu, Hans Ren, Xianghua Xie
-
PDF | Accepted Manuscript
Author accepted manuscript document released under the terms of a Creative Commons CC-BY licence using the Swansea University Research Publications Policy (rights retention).
Download (1.35MB)
Abstract
Deep learning based medical segmentation still presents a great challenge due to the lack of large-scale datasets in the medical domain. The existing publicly available datasets vary significantly in terms of imaging modalities and target anatomies. This paper presents a novel guided latent diffusio...
Published in: | International Conference on AI-generated Content |
---|---|
Published: |
|
URI: | https://cronfa.swan.ac.uk/Record/cronfa68384 |
first_indexed |
2024-11-29T13:46:46Z |
---|---|
last_indexed |
2025-04-25T05:18:26Z |
id |
cronfa68384 |
recordtype |
SURis |
fullrecord |
<?xml version="1.0"?><rfc1807><datestamp>2025-04-24T12:48:40.3268806</datestamp><bib-version>v2</bib-version><id>68384</id><entry>2024-11-29</entry><title>Guided Latent Diffusion for Universal Medical Image Segmentation</title><swanseaauthors><author><sid>55d3ba5f8378c2e3439d7e3962aee726</sid><firstname>Chen</firstname><surname>Hu</surname><name>Chen Hu</name><active>true</active><ethesisStudent>false</ethesisStudent></author><author><sid>9e043b899a2b786672a28ed4f864ffcc</sid><firstname>Hans</firstname><surname>Ren</surname><name>Hans Ren</name><active>true</active><ethesisStudent>false</ethesisStudent></author><author><sid>b334d40963c7a2f435f06d2c26c74e11</sid><ORCID>0000-0002-2701-8660</ORCID><firstname>Xianghua</firstname><surname>Xie</surname><name>Xianghua Xie</name><active>true</active><ethesisStudent>false</ethesisStudent></author></swanseaauthors><date>2024-11-29</date><deptcode>MACS</deptcode><abstract>Deep learning based medical segmentation still presents a great challenge due to the lack of large-scale datasets in the medical domain. The existing publicly available datasets vary significantly in terms of imaging modalities and target anatomies. This paper presents a novel guided latent diffusion model for universal medical segmentation, capable of segmenting diverse anatomical structures using a single and unified architecture. Given a Contrastive Language-Image Pretraining (CLIP) embedding specifying the target anatomical structure, the proposed model leverages a collection of datasets covering the diverse structures which can segment any anatomical targets that are presented in the aggregated data. By performing diffusion fully in latent space, we achieve comparable results to pixel-space diffusion with significantly lower computational cost. The proposed methods demonstrates competitive performance against existing deep learning-based discriminative approaches on several benchmarks. Furthermore, it shows strong generalization capabilities on unseen datasets.</abstract><type>Conference Paper/Proceeding/Abstract</type><journal>International Conference on AI-generated Content</journal><volume/><journalNumber/><paginationStart/><paginationEnd/><publisher/><placeOfPublication/><isbnPrint/><isbnElectronic/><issnPrint/><issnElectronic/><keywords/><publishedDay>0</publishedDay><publishedMonth>0</publishedMonth><publishedYear>0</publishedYear><publishedDate>0001-01-01</publishedDate><doi/><url/><notes/><college>COLLEGE NANME</college><department>Mathematics and Computer Science School</department><CollegeCode>COLLEGE CODE</CollegeCode><DepartmentCode>MACS</DepartmentCode><institution>Swansea University</institution><apcterm>Not Required</apcterm><funders/><projectreference/><lastEdited>2025-04-24T12:48:40.3268806</lastEdited><Created>2024-11-29T11:08:49.4467788</Created><path><level id="1">Faculty of Science and Engineering</level><level id="2">School of Mathematics and Computer Science - Computer Science</level></path><authors><author><firstname>Mattia</firstname><surname>Salsi</surname><order>1</order></author><author><firstname>Yunying</firstname><surname>Wang</surname><order>2</order></author><author><firstname>Chen</firstname><surname>Hu</surname><order>3</order></author><author><firstname>Hans</firstname><surname>Ren</surname><order>4</order></author><author><firstname>Jingjing</firstname><surname>Deng</surname><order>5</order></author><author><firstname>Xianghua</firstname><surname>Xie</surname><orcid>0000-0002-2701-8660</orcid><order>6</order></author></authors><documents><document><filename>68384__32999__99c56a38d1b94664a21f677fbcd214d3.pdf</filename><originalFilename>Diffusion_Segmentation.pdf</originalFilename><uploaded>2024-11-29T11:13:45.7541122</uploaded><type>Output</type><contentLength>1417819</contentLength><contentType>application/pdf</contentType><version>Accepted Manuscript</version><cronfaStatus>true</cronfaStatus><documentNotes>Author accepted manuscript document released under the terms of a Creative Commons CC-BY licence using the Swansea University Research Publications Policy (rights retention).</documentNotes><copyrightCorrect>true</copyrightCorrect><language>eng</language><licence>https://creativecommons.org/licenses/by/4.0/deed.en</licence></document></documents><OutputDurs/></rfc1807> |
spelling |
2025-04-24T12:48:40.3268806 v2 68384 2024-11-29 Guided Latent Diffusion for Universal Medical Image Segmentation 55d3ba5f8378c2e3439d7e3962aee726 Chen Hu Chen Hu true false 9e043b899a2b786672a28ed4f864ffcc Hans Ren Hans Ren true false b334d40963c7a2f435f06d2c26c74e11 0000-0002-2701-8660 Xianghua Xie Xianghua Xie true false 2024-11-29 MACS Deep learning based medical segmentation still presents a great challenge due to the lack of large-scale datasets in the medical domain. The existing publicly available datasets vary significantly in terms of imaging modalities and target anatomies. This paper presents a novel guided latent diffusion model for universal medical segmentation, capable of segmenting diverse anatomical structures using a single and unified architecture. Given a Contrastive Language-Image Pretraining (CLIP) embedding specifying the target anatomical structure, the proposed model leverages a collection of datasets covering the diverse structures which can segment any anatomical targets that are presented in the aggregated data. By performing diffusion fully in latent space, we achieve comparable results to pixel-space diffusion with significantly lower computational cost. The proposed methods demonstrates competitive performance against existing deep learning-based discriminative approaches on several benchmarks. Furthermore, it shows strong generalization capabilities on unseen datasets. Conference Paper/Proceeding/Abstract International Conference on AI-generated Content 0 0 0 0001-01-01 COLLEGE NANME Mathematics and Computer Science School COLLEGE CODE MACS Swansea University Not Required 2025-04-24T12:48:40.3268806 2024-11-29T11:08:49.4467788 Faculty of Science and Engineering School of Mathematics and Computer Science - Computer Science Mattia Salsi 1 Yunying Wang 2 Chen Hu 3 Hans Ren 4 Jingjing Deng 5 Xianghua Xie 0000-0002-2701-8660 6 68384__32999__99c56a38d1b94664a21f677fbcd214d3.pdf Diffusion_Segmentation.pdf 2024-11-29T11:13:45.7541122 Output 1417819 application/pdf Accepted Manuscript true Author accepted manuscript document released under the terms of a Creative Commons CC-BY licence using the Swansea University Research Publications Policy (rights retention). true eng https://creativecommons.org/licenses/by/4.0/deed.en |
title |
Guided Latent Diffusion for Universal Medical Image Segmentation |
spellingShingle |
Guided Latent Diffusion for Universal Medical Image Segmentation Chen Hu Hans Ren Xianghua Xie |
title_short |
Guided Latent Diffusion for Universal Medical Image Segmentation |
title_full |
Guided Latent Diffusion for Universal Medical Image Segmentation |
title_fullStr |
Guided Latent Diffusion for Universal Medical Image Segmentation |
title_full_unstemmed |
Guided Latent Diffusion for Universal Medical Image Segmentation |
title_sort |
Guided Latent Diffusion for Universal Medical Image Segmentation |
author_id_str_mv |
55d3ba5f8378c2e3439d7e3962aee726 9e043b899a2b786672a28ed4f864ffcc b334d40963c7a2f435f06d2c26c74e11 |
author_id_fullname_str_mv |
55d3ba5f8378c2e3439d7e3962aee726_***_Chen Hu 9e043b899a2b786672a28ed4f864ffcc_***_Hans Ren b334d40963c7a2f435f06d2c26c74e11_***_Xianghua Xie |
author |
Chen Hu Hans Ren Xianghua Xie |
author2 |
Mattia Salsi Yunying Wang Chen Hu Hans Ren Jingjing Deng Xianghua Xie |
format |
Conference Paper/Proceeding/Abstract |
container_title |
International Conference on AI-generated Content |
institution |
Swansea University |
college_str |
Faculty of Science and Engineering |
hierarchytype |
|
hierarchy_top_id |
facultyofscienceandengineering |
hierarchy_top_title |
Faculty of Science and Engineering |
hierarchy_parent_id |
facultyofscienceandengineering |
hierarchy_parent_title |
Faculty of Science and Engineering |
department_str |
School of Mathematics and Computer Science - Computer Science{{{_:::_}}}Faculty of Science and Engineering{{{_:::_}}}School of Mathematics and Computer Science - Computer Science |
document_store_str |
1 |
active_str |
0 |
description |
Deep learning based medical segmentation still presents a great challenge due to the lack of large-scale datasets in the medical domain. The existing publicly available datasets vary significantly in terms of imaging modalities and target anatomies. This paper presents a novel guided latent diffusion model for universal medical segmentation, capable of segmenting diverse anatomical structures using a single and unified architecture. Given a Contrastive Language-Image Pretraining (CLIP) embedding specifying the target anatomical structure, the proposed model leverages a collection of datasets covering the diverse structures which can segment any anatomical targets that are presented in the aggregated data. By performing diffusion fully in latent space, we achieve comparable results to pixel-space diffusion with significantly lower computational cost. The proposed methods demonstrates competitive performance against existing deep learning-based discriminative approaches on several benchmarks. Furthermore, it shows strong generalization capabilities on unseen datasets. |
published_date |
0001-01-01T13:58:08Z |
_version_ |
1831919617634729984 |
score |
11.059359 |