DOI码:10.1016/j.jag.2024.103661
所属单位:School of Remote Sensing and Information Engineering, Wuhan University, China
发表刊物:INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION
关键字:Category attention,Semantic segmentation,Remote sensing images,CNN,Transformer
摘要:The semantic segmentation task is an essential issue in various fields, including land cover classification and cultural heritage investigation. The CNN and Transformer have been widely utilized in semantic segmentation tasks due to notable advancements in deep learning technologies. However, these methodologies may not fully account for remote sensing images' distinctive attributes, including the large intra-class variation and the small inter-class variation. Driven by it, we propose a category attention guided network (CAGNet). Initially, a local feature extraction module is devised to cater to striped objects and features at different scales. Then, we propose a novel concept of category attention for remote sensing images as a feature representation of category differences between pixels. Meanwhile, we designed the Transformer-based and CNN-based category attention guided modules to integrate the proposed category attention into the global scoring functions and local category feature weights, respectively. The network is designed to give more attention to the category features by updating these weights during the training process. Finally, a feature fusion module is developed to integrate global, local, and category multi-scale features and contextual information. A series of extensive experiments along with ablation studies on the UAVid, Vaihingen, and Potsdam datasets indicate that our network outperforms existing methods, including those based on CNN and Transformer.
合写作者:Li, Pengcheng,Zhao, Shaohua, Qingwu, Jiayuan,Wang,Hu, Mingyao,Ai, Shunli
论文类型:期刊论文
学科门类:工学
文献类型:J
卷号:127
ISSN号:1569-8432
是否译文:否
CN号:Scopus:2-s2.0-85183550122,WOS:001171572600001
发表时间:2024-03-01