Abstract
Accurate monitoring of aquatic vegetation from unmanned aerial vehicle (UAV) imagery remains challenging due to complex water backgrounds, severe inter-class similarity, and the lack of balanced, dual-annotated datasets. Existing studies primarily address segmentation or classification independently, limiting their effectiveness for integrated species-level analysis. To address these gaps, this study proposes a clearly defined attention-enhanced multi-task learning framework that simultaneously performs binary segmentation and 14-class species classification, enabling unified structural and semantic understanding. The model employs a shared encoder with attention-guided skip connections and a joint optimization strategy to enhance feature discrimination while reducing redundancy. Comprehensive ablation analysis demonstrates that attention improves both segmentation and classification performance, while joint learning with Gaussian blur achieves the best overall balance, confirming the complementary role of spatial and semantic features. On a newly collected UAV dataset from diverse wetlands in Bangladesh, the proposed model achieves a Dice coefficient of 0.7344, mIoU of 0.6904, and pixel accuracy of 0.8757 for segmentation, along with 98.77% classification accuracy and an F1-score of 0.9874, indicating strong performance across both tasks. In addition, computational complexity analysis shows that the proposed framework reduces parameters by (sim)50% (31.10M vs. 62.09M), lowers FLOPs (54.66 vs. 96.31 GFLOPs), and improves inference speed by (sim)48.6% compared to deploying separate single-task models for segmentation and classification, demonstrating its suitability for real-time UAV deployment. Furthermore, Gradient-weighted Class Activation Mapping (Grad-CAM) and Grad-CAM++ are employed to provide visual explanations of model predictions, improving interpretability and reliability. The results demonstrate robust performance in complex aquatic environments and highlight the framework’s suitability for large-scale biodiversity monitoring, invasive species detection, and data-driven freshwater ecosystem management.
Similar content being viewed by others
AqUavplant Dataset: A High-Resolution Aquatic Plant Classification and Segmentation Image Dataset Using UAV
Utilizing active learning and attention-CNN to classify vegetation based on UAV multispectral data
DSIA U-Net: deep shallow interaction with attention mechanism UNet for remote sensing satellite images
Funding
Open access funding provided by OsloMet – Oslo Metropolitan University. The authors received no specific funding for this work. This research did not receive any grant from funding agencies in the public, commercial, or not-for-profit sectors.
Author information
Authors and Affiliations
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
Reprints and permissions
About this article
Cite this article
Rahman, A., Syeed, M.M.M., Khan, R.H. et al. Attention-enhanced multi-task learning for binary segmentation and fine-grained aquatic plant classification in UAV imagery.
Sci Rep (2026). https://doi.org/10.1038/s41598-026-51881-5
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-026-51881-5
Source: Ecology - nature.com
