0% Complete
صفحه اصلی
/
سی و دومین کنفرانس بین المللی مهندسی برق
High-Resolution Remote Sensing Image Captioning Based on Structured Attention and SAM Network
نویسندگان :
Yassin Riyazi
1
Seyyed Mostafa Sadjadi
2
Abbas Zohrevand
3
Reshad Hosseini
4
1- دانشگاه تهران
2- دانشگاه تهران
3- دانشگاه تهران
4- دانشگاه تهران
کلمات کلیدی :
image captioning،image segmentation،remote sensing image،structured attention
چکیده :
Due to its broad applications, remote sensing image captioning (RSIC) has gained popularity in recent years. However, it poses extra challenges for containing low-resolution images with highly structured semantic content. By incorporating image labeling and segmentation, this work expands on the RSIC framework developed by Zhao et al. [1]. The method presents a structured attention module that highlights important semantic components to maintain a geometric and structured shape. The quality and edge emphasis of UCM-captioned photographs is improved by upsampling them to 512×512 pixels. Using the Segment Anything Model (SAM) produces better image proposals, leading to higher accuracy than traditional techniques. A balanced output of large- and small-object masks is facilitated by SAM's promptability. The decoder can more easily learn a suitable statistical model using the model's spatial structure to provide an all-encompassing attention map. The effects of multiple hyperparameters, such as teacher forcing, the number of region proposals, and the effects of DSR and AVR loss factors, are investigated in this work. Overall, by combining image labeling and segmentation, this research improves remote sensing capabilities. It also shows how well the structured attention module and SAM work together to improve accuracy and consider different hyperparameter issues.
لیست مقالات
لیست مقالات بایگانی شده
Design and Application of a Five-Level Cross-Switched Inverter in Low-Voltage Distribution System Voltage Compensation
Mohammad Farhadi-kangarlu - Yousef Neyshabouri - Asra Sotudeh
A Comprehensive Analysis Method to Improve the Operation of Transmission Networks from the Perspective of Resonance and Ferroresonance phenomena
MohamadAli Amini - Mehdi SALAY NADERI - Ali Asghar Farrokhi Raad - Gevork B. Gharehpetian
Study of Performance Characteristics of a Line-Start Synchronous Reluctance Motor Over its Synchronization Region
Ali Jamali-Fard - Mojtaba Mirsalim
امنیت سایبری در مواجه با تزریق اطلاعات نادرست به سیستم قدرت هوشمند و ارائه راهکار مقابله
مهدی جمشیدی آفارانی - مهرداد عابدی
Sensitive RSNs to Schizophrenia; A graph parameter approach
Shirin Karimian - Farzaneh Keyvanfard - Abbas Nasiraei Moghaddam
Temporary Goal Method: A Solution for the Problem of Getting Stuck in Motion Planning Algorithms
Danial Khan mohamad zade - Samaneh Hosseini Semnani
طراحی ماتریس باتلر 8×4 در ساختارSIW با کاهش سطح گلبرگ جانبی در باند فرکانسی 60GHz
زهرا مهرزاد - غلامرضا مرادی - ایاز قربانی
Designing Music Recommendation System based on music Genre by using Bi-LSTM
Saman Mesghali - Javad Askari
Design and fabrication of a microstrip phase shifter based on liquid crystal
Sadegh Rajabi Doulataabadi - Seyed Hossein Hosseini Biuki - Farid Khoshkhati - Seyed Abbas Jazayeri Moghadas - Mohammad Masoudi Mohammadi - Mehdi Ahmadi-Boroujeni
Investigating Validity and Reliability of The Features Extracted by a 5R Vertical Robot for Arm Motion and Learning Assessment
Sarvenaz Bourbour - Fariba Bahrami Boodelalou - Ghorban Taghizadeh
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 43.6.0