0% Complete
صفحه اصلی
/
سی و دومین کنفرانس بین المللی مهندسی برق
High-Resolution Remote Sensing Image Captioning Based on Structured Attention and SAM Network
نویسندگان :
Yassin Riyazi
1
Seyyed Mostafa Sadjadi
2
Abbas Zohrevand
3
Reshad Hosseini
4
1- دانشگاه تهران
2- دانشگاه تهران
3- دانشگاه تهران
4- دانشگاه تهران
کلمات کلیدی :
image captioning،image segmentation،remote sensing image،structured attention
چکیده :
Due to its broad applications, remote sensing image captioning (RSIC) has gained popularity in recent years. However, it poses extra challenges for containing low-resolution images with highly structured semantic content. By incorporating image labeling and segmentation, this work expands on the RSIC framework developed by Zhao et al. [1]. The method presents a structured attention module that highlights important semantic components to maintain a geometric and structured shape. The quality and edge emphasis of UCM-captioned photographs is improved by upsampling them to 512×512 pixels. Using the Segment Anything Model (SAM) produces better image proposals, leading to higher accuracy than traditional techniques. A balanced output of large- and small-object masks is facilitated by SAM's promptability. The decoder can more easily learn a suitable statistical model using the model's spatial structure to provide an all-encompassing attention map. The effects of multiple hyperparameters, such as teacher forcing, the number of region proposals, and the effects of DSR and AVR loss factors, are investigated in this work. Overall, by combining image labeling and segmentation, this research improves remote sensing capabilities. It also shows how well the structured attention module and SAM work together to improve accuracy and consider different hyperparameter issues.
لیست مقالات
لیست مقالات بایگانی شده
طراحی کنترلکننده مد لغزشی دینامیک برای سیستم تعلیق فعال غیر خطی با عملگر غیرایدهآل
مونا عظیمی - الهه مرادی
Optimal Placement of Unified Power Flow Controller in Power System Considering Transient Stability and Voltage Stability Criteria
Esmail Zahmatkeshan - Mohsen Bandekhoda
A Novel Analytical Tuning Method for Designing of Composite Nonlinear Feedback Control Law in Continuous-time Dynamical Systems
Ali Vazani - Valiollah Ghaffari
Accurate Methods for Automatic Detection of Characteristic Points in Electrocardiograms
Seyedeh Mersedeh Bagheri - Mohammad Pooyan
ساخت و مشخصه یابی حسگر گاز QCM با پوشش نیترات لانتانیوم برای آشکارسازی بخار اسید هیدروفلوئوریک
زهرا خوش بین - وحید غفاری نیا
Secret Sharing Implementation of Predictive Functional Control
Enayat Amiri - Mohammad Haeri - Saeed Adelipour
یادگیری متری عمیق جهت شناسایی افراد
امیرعلی نسیمی - مهران صفایانی - مائده احمدی - عبدالرضا میرزائی
A Barrier Function Based Feedback Linearization Method for On-line Output Tracking Control of Non-minimum Phase Systems
Fatemeh Jahangiri - Ali Talebi - Mohammad Bagher Menhaj
Enhanced Current Commutation Drive Circuit for Hybrid DC Circuit Breaker
Alireza Jaafari - Sadegh Mohsenzade - Ali Asghar Razi-Kazemi
Bi-level Bidding Strategy of a Wind Power Producer Considering Local Intraday Demand Response Exchange Market
Ehsan Nokandi - Mostafa Vahedipour-Dahraie - Saeed Reza Goldani
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 42.8.0