No Cover Image

Journal article 198 views 1 download

A2D2C: Adaptive attention-driven dynamic convolution for local feature adaptation

Tianyu Zhang Orcid Logo, Fan Wan, Xingyu Miao Orcid Logo, Jingjing Deng Orcid Logo, Xianghua Xie Orcid Logo, Yang Long Orcid Logo

Pattern Recognition, Volume: 180, Issue: Part B, Start page: 113915

Swansea University Author: Xianghua Xie Orcid Logo

  • 71874.VOR.pdf

    PDF | Version of Record

    This is an open access article distributed under the terms of the Creative Commons CC-BY license.

    Download (2.6MB)

Abstract

Dynamic convolution is an advanced deep-learning strategy that enables neural networks to adjust their convolutional kernels dynamically in response to varying input data. This adaptability enhances the network’s efficiency in processing diverse features. However, traditional dynamic convolution tec...

Full description

Published in: Pattern Recognition
ISSN: 0031-3203 1873-5142
Published: Elsevier BV 2026
Online Access: Check full text

URI: https://cronfa.swan.ac.uk/Record/cronfa71874
Abstract: Dynamic convolution is an advanced deep-learning strategy that enables neural networks to adjust their convolutional kernels dynamically in response to varying input data. This adaptability enhances the network’s efficiency in processing diverse features. However, traditional dynamic convolution techniques often overlook the critical role of local features in image classification, resulting in suboptimal performance in capturing fine details and textures necessary for accurate image analysis. To address this, our research introduces Adaptive Attention-Driven Dynamic Convolution (A2D2C), an innovative adaptive adjustment mechanism that focuses on local image features, significantly improving the network’s ability to capture fine details and overall performance. Moreover, our paper proposes a novel dynamic convolution that enhances the network’s feature learning ability by combining the input feature map with multiple convolution kernels to generate the attention weights. Additionally, we develop a streamlined version of our model, named A2D2C+, which significantly increases operational efficiency and reduces computational costs. Experimental evaluations on the ImageNet, CIFAR-100 and COCO datasets demonstrate substantial performance enhancements, underscoring the efficacy and applicability of our approach.
Keywords: Attention; Dynamic convolution; Local features
College: Faculty of Science and Engineering
Funders: This work was supported by the Royal Society International Exchanges Scheme—Towards Collaborative Cloud-Edge Deep Learning Deployment under Grant IEC/NSFC/223523; the National Edge AI Hub for Real Data: Edge Intelligence for Cyber-disturbances and Data Quality under Grant EP/Y028813/1; and the UK Medical Research Council (MRC) Innovation Fellowship under Grant MR/S003916/2.
Issue: Part B
Start Page: 113915