0% Complete
صفحه اصلی
/
سی و دومین کنفرانس بین المللی مهندسی برق
High-Resolution Remote Sensing Image Captioning Based on Structured Attention and SAM Network
نویسندگان :
Yassin Riyazi
1
Seyyed Mostafa Sadjadi
2
Abbas Zohrevand
3
Reshad Hosseini
4
1- دانشگاه تهران
2- دانشگاه تهران
3- دانشگاه تهران
4- دانشگاه تهران
کلمات کلیدی :
image captioning،image segmentation،remote sensing image،structured attention
چکیده :
Due to its broad applications, remote sensing image captioning (RSIC) has gained popularity in recent years. However, it poses extra challenges for containing low-resolution images with highly structured semantic content. By incorporating image labeling and segmentation, this work expands on the RSIC framework developed by Zhao et al. [1]. The method presents a structured attention module that highlights important semantic components to maintain a geometric and structured shape. The quality and edge emphasis of UCM-captioned photographs is improved by upsampling them to 512×512 pixels. Using the Segment Anything Model (SAM) produces better image proposals, leading to higher accuracy than traditional techniques. A balanced output of large- and small-object masks is facilitated by SAM's promptability. The decoder can more easily learn a suitable statistical model using the model's spatial structure to provide an all-encompassing attention map. The effects of multiple hyperparameters, such as teacher forcing, the number of region proposals, and the effects of DSR and AVR loss factors, are investigated in this work. Overall, by combining image labeling and segmentation, this research improves remote sensing capabilities. It also shows how well the structured attention module and SAM work together to improve accuracy and consider different hyperparameter issues.
لیست مقالات
لیست مقالات بایگانی شده
تجزیه وابستگی با استفاده از Q-Learning محافظه کار
امیر زارعی - علیرضا خیاطیان - پیمان ستوده
طراحی و ساخت تقویت کننده توان اصلاح شده مقاومتی-راکتیوی باند گسترده کلاس B/J با گین بالا در توان خروجی پشتی و شرایط بایاس کلاس AB
سارا آقاجانی - محمود کمره ای - مرضیه چگینی
Adaptive Control of Telerehabilitation Systems in The Framework of Multi-Agent Systems
Mohammadreza Sheykh - Heidar Ali Talebi - ّIman Sharifi
High-Performance Biosensor Based on SRR for Early Breast Cancer Detection
Hasti Enayattarighehkari - Sina Aramtan - Gholamreza Moradi - Farhad Azadi Namin
A New Protocol to Improve Effect of repetitive Transcranial Magnetic Stimulation in Treatment of Alzheimer's Disease
Ali Abedi - Gholamreza Moradi - Reza Sarraf Shirazi - Mehran Jahed
A New Low Noise 4-Gb/s Serial CMOS MPPM Modulator
Erfan Alasvand Andekah - Noushin Ghaderi - Mostafa Pour Sayahi
A New Unsupervised Feature Learning Method for Object Recognition using Prior-Knowledge Data
Ashkan Farrokhi - Hadi Seyedarabi
Impact of Particle Shape on Optical and Electrical Properties of Ultrathin Silicon Solar Cells
Sayyed Reza Mirnaziry - Mohammad Ali Shameli - Leila Yousefi
Contextual Based Locality Preserving Projection for Classification of SAR Images with Multiple Polarizations
Maryam Imani
Second-order Sliding Mode Control for DC-DC buck converter with input Voltage Ripple Elimination
Maede Azimi - Mehdi Asadi - Adel Zakipour
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 40.4.2