0% Complete
صفحه اصلی
/
سی و دومین کنفرانس بین المللی مهندسی برق
High-Resolution Remote Sensing Image Captioning Based on Structured Attention and SAM Network
نویسندگان :
Yassin Riyazi
1
Seyyed Mostafa Sadjadi
2
Abbas Zohrevand
3
Reshad Hosseini
4
1- دانشگاه تهران
2- دانشگاه تهران
3- دانشگاه تهران
4- دانشگاه تهران
کلمات کلیدی :
image captioning،image segmentation،remote sensing image،structured attention
چکیده :
Due to its broad applications, remote sensing image captioning (RSIC) has gained popularity in recent years. However, it poses extra challenges for containing low-resolution images with highly structured semantic content. By incorporating image labeling and segmentation, this work expands on the RSIC framework developed by Zhao et al. [1]. The method presents a structured attention module that highlights important semantic components to maintain a geometric and structured shape. The quality and edge emphasis of UCM-captioned photographs is improved by upsampling them to 512×512 pixels. Using the Segment Anything Model (SAM) produces better image proposals, leading to higher accuracy than traditional techniques. A balanced output of large- and small-object masks is facilitated by SAM's promptability. The decoder can more easily learn a suitable statistical model using the model's spatial structure to provide an all-encompassing attention map. The effects of multiple hyperparameters, such as teacher forcing, the number of region proposals, and the effects of DSR and AVR loss factors, are investigated in this work. Overall, by combining image labeling and segmentation, this research improves remote sensing capabilities. It also shows how well the structured attention module and SAM work together to improve accuracy and consider different hyperparameter issues.
لیست مقالات
لیست مقالات بایگانی شده
Effects of Derating Factor and Minimum Short Circuit Current on the BOP Cable Sizing of a Power Plant
Hossein Zamanpour abyaneh
Stable Target Tracking in Wireless Sensor Networks Under Malicious Cyber Attacks
Jafar Akhondali - Mohammad Taheri
مدل سازی سینگولار گسسته زمان یک سیستم الکتریکی و کنترل آن به روش الگوریتم یادگیری تکرارشونده
علی غلامی بنادکوکی - طاهره بینازاده
طراحی کنترل کننده مقاوم برای مدل غیرخطی بیماری کووید-19
آرمان مرزبان - الهام امینی بروجنی
Conversion of Linear Polarized Light-to-Orbital Angular Momentum with Variable Topological Charges, Using the Surface Plasmons of Elliptical Holes Etched in a Gold Layer
Amir Mohammad Ghanei - Abolfazl Aghili - Sara Darbari
Design of a plasmonic MIM filter based on ring resonator incorporating circular air holes
Sara Gholinezhad Shafagh - Hassan Kaatuzian - Mohammad Danaie
Design and Practical Implementation of Internal Model Controller for Temperature Regulation of Thermoelectric Cell
Parastoo Kamali - Sanaz Iman Shayan - Mahshid Mousapour - Fatemeh Abdolsamadi - Salar Zeinali - Sadra Rafatnia
FGM Copula based Analysis of Outage Probability for Wireless Three-User Multiple Access Channel with Correlated Channel Coefficients
Mona Sadat Mohsenzadeh - Ghosheh Abed Hodtani
Smartly, reduce the latency of high-priority vehicles using IoT technology
Mahdi Talebi - Masoud Sabaei
Enhancing SCGAN’s Disentangled Representation Learning with Contrastive SSIM Similarity Constraints
Iman Yazdanpanah - Ali Eslamian
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 42.0.4