Journal article 198 views 1 download
A2D2C: Adaptive attention-driven dynamic convolution for local feature adaptation
Pattern Recognition, Volume: 180, Issue: Part B, Start page: 113915
Swansea University Author:
Xianghua Xie
-
PDF | Version of Record
This is an open access article distributed under the terms of the Creative Commons CC-BY license.
Download (2.6MB)
DOI (Published version): 10.1016/j.patcog.2026.113915
Abstract
Dynamic convolution is an advanced deep-learning strategy that enables neural networks to adjust their convolutional kernels dynamically in response to varying input data. This adaptability enhances the network’s efficiency in processing diverse features. However, traditional dynamic convolution tec...
| Published in: | Pattern Recognition |
|---|---|
| ISSN: | 0031-3203 1873-5142 |
| Published: |
Elsevier BV
2026
|
| Online Access: |
Check full text
|
| URI: | https://cronfa.swan.ac.uk/Record/cronfa71874 |
| Abstract: |
Dynamic convolution is an advanced deep-learning strategy that enables neural networks to adjust their convolutional kernels dynamically in response to varying input data. This adaptability enhances the network’s efficiency in processing diverse features. However, traditional dynamic convolution techniques often overlook the critical role of local features in image classification, resulting in suboptimal performance in capturing fine details and textures necessary for accurate image analysis. To address this, our research introduces Adaptive Attention-Driven Dynamic Convolution (A2D2C), an innovative adaptive adjustment mechanism that focuses on local image features, significantly improving the network’s ability to capture fine details and overall performance. Moreover, our paper proposes a novel dynamic convolution that enhances the network’s feature learning ability by combining the input feature map with multiple convolution kernels to generate the attention weights. Additionally, we develop a streamlined version of our model, named A2D2C+, which significantly increases operational efficiency and reduces computational costs. Experimental evaluations on the ImageNet, CIFAR-100 and COCO datasets demonstrate substantial performance enhancements, underscoring the efficacy and applicability of our approach. |
|---|---|
| Keywords: |
Attention; Dynamic convolution; Local features |
| College: |
Faculty of Science and Engineering |
| Funders: |
This work was supported by the Royal Society International Exchanges Scheme—Towards Collaborative Cloud-Edge Deep Learning Deployment under Grant IEC/NSFC/223523; the National Edge AI Hub for Real Data: Edge Intelligence for Cyber-disturbances and Data Quality under Grant EP/Y028813/1; and the UK Medical Research Council (MRC) Innovation Fellowship under Grant MR/S003916/2. |
| Issue: |
Part B |
| Start Page: |
113915 |

