0% Complete
صفحه اصلی
/
سی و دومین کنفرانس بین المللی مهندسی برق
High-Resolution Remote Sensing Image Captioning Based on Structured Attention and SAM Network
نویسندگان :
Yassin Riyazi
1
Seyyed Mostafa Sadjadi
2
Abbas Zohrevand
3
Reshad Hosseini
4
1- دانشگاه تهران
2- دانشگاه تهران
3- دانشگاه تهران
4- دانشگاه تهران
کلمات کلیدی :
image captioning،image segmentation،remote sensing image،structured attention
چکیده :
Due to its broad applications, remote sensing image captioning (RSIC) has gained popularity in recent years. However, it poses extra challenges for containing low-resolution images with highly structured semantic content. By incorporating image labeling and segmentation, this work expands on the RSIC framework developed by Zhao et al. [1]. The method presents a structured attention module that highlights important semantic components to maintain a geometric and structured shape. The quality and edge emphasis of UCM-captioned photographs is improved by upsampling them to 512×512 pixels. Using the Segment Anything Model (SAM) produces better image proposals, leading to higher accuracy than traditional techniques. A balanced output of large- and small-object masks is facilitated by SAM's promptability. The decoder can more easily learn a suitable statistical model using the model's spatial structure to provide an all-encompassing attention map. The effects of multiple hyperparameters, such as teacher forcing, the number of region proposals, and the effects of DSR and AVR loss factors, are investigated in this work. Overall, by combining image labeling and segmentation, this research improves remote sensing capabilities. It also shows how well the structured attention module and SAM work together to improve accuracy and consider different hyperparameter issues.
لیست مقالات
لیست مقالات بایگانی شده
Application of Metaheurestic Optimization Algorithms for Feature Selection in Text Classification
Elham Nazari - Nafise Haghshenas - Alireza Basiri - Mohammad Reza Ahmadzadeh
Medical Ultrasound Image Restoration in Presence of Defective Transducer Elements
Mohammad Saeed Zare Dehabadi - Mehran Jahed
T-type L-2L De-Embedding Method for On-Wafer T-model Transmission Line Network
Milad Seyedi - Nasser Masoumi - Samad Sheikhaei
طراحی کنترلکننده مد لغزشی دینامیک برای سیستم تعلیق فعال غیر خطی با عملگر غیرایدهآل
مونا عظیمی - الهه مرادی
Recurrence Quantification and Machine Learning: A Novel Approach for Parkinson’s Disease Diagnosis from EEG Signals
Asghar Zarei - Alireza Talesh Jafadideh
Brain Effective Connectivity Comparision in Different States of Familiarity and Desiring Brands Confrontation: a Neuromarketing Study
Mahdi Taghaddossi - Mohammad Hasan Moradi
On the Correction of the Boundary Deformation Errors in Microwave Imaging With Spatial Priors
Seyyed Mohammad Hosseini - Amir Ahmad Shishegar
بهبود پردازش وفقی فضا-زمان (STAP) در سیستمهای رادار هوابرد با استفاده از الگوریتمهای آگاه به تنک بودن (Sparsity) سیستم
علی شیخیان - سارا میهن دوست - نعمت الله عزتی - احسان مصطفی پور
SAR Images Clustering Based on Modified Nonlinear Orthogonal non-Negative Matrix Factorization (NMF)
Mahdi Jowkar dehouei - Soolmaz Khazandi - Yaser Norouzi
Area-Efficient Partially-Pipelined Architecture for Fast-SSC Decoding of Polar Codes
Mehdi Saeidi - Matin Hashemi
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 40.3.1