0% Complete
صفحه اصلی
/
سی و دومین کنفرانس بین المللی مهندسی برق
High-Resolution Remote Sensing Image Captioning Based on Structured Attention and SAM Network
نویسندگان :
Yassin Riyazi
1
Seyyed Mostafa Sadjadi
2
Abbas Zohrevand
3
Reshad Hosseini
4
1- دانشگاه تهران
2- دانشگاه تهران
3- دانشگاه تهران
4- دانشگاه تهران
کلمات کلیدی :
image captioning،image segmentation،remote sensing image،structured attention
چکیده :
Due to its broad applications, remote sensing image captioning (RSIC) has gained popularity in recent years. However, it poses extra challenges for containing low-resolution images with highly structured semantic content. By incorporating image labeling and segmentation, this work expands on the RSIC framework developed by Zhao et al. [1]. The method presents a structured attention module that highlights important semantic components to maintain a geometric and structured shape. The quality and edge emphasis of UCM-captioned photographs is improved by upsampling them to 512×512 pixels. Using the Segment Anything Model (SAM) produces better image proposals, leading to higher accuracy than traditional techniques. A balanced output of large- and small-object masks is facilitated by SAM's promptability. The decoder can more easily learn a suitable statistical model using the model's spatial structure to provide an all-encompassing attention map. The effects of multiple hyperparameters, such as teacher forcing, the number of region proposals, and the effects of DSR and AVR loss factors, are investigated in this work. Overall, by combining image labeling and segmentation, this research improves remote sensing capabilities. It also shows how well the structured attention module and SAM work together to improve accuracy and consider different hyperparameter issues.
لیست مقالات
لیست مقالات بایگانی شده
Stable Target Tracking in Wireless Sensor Networks Under Malicious Cyber Attacks
Jafar Akhondali - Mohammad Taheri
Performance analysis under the Independent Fluctuating Two-Ray (IFTR) Fading in RIS-Assisted Millimeter Wave Communications
Maryam Olyaee - Hadi Hashemi - Juan Manuel Romero Jerez
A Novel Model for Student's Mental Health Monitoring Based on Hard and Soft Data Fusion
Mohammad Fatahi - Masoud Alizadeh - Behzad Moshiri
Flexibility Assessment of Virtual Power Plant with Considering Dispatchable Wind Turbine
Mahdi Rahimi - Fatemeh Jahanbani Ardakani - Ali Reza Rahimi
An Improved Nonlinear Observer-Based Integrated Guidance and Control for Hypersonic Flight Vehicle with Angle Constraints
Seyedeh Mahsa Zakipour Bahambari - Saeed Khankalantary
Enhancing Disaster Communication: Multi-UAV Optimization for Efficient Coverage
Amirhossein Solati - Javad Zeraatkar Moghaddam - Mehrdad Ardebilipour
A Technical-Managerial Framework for Determining Periodic Performance Indices and Operating Ranges of Power Grid Frequency
Hamed Delkhosh - Hossein Seifi - Sajjad Gholamnejad - Morteza Yousefian
LPV Controller Design for Trajectory Tracking of Nonholonomic Wheeled Mobile Robots in the Presence of Slip
Mohammad Sabouri - Mohammad Hassan Asemani
Numerical investigation of gain switching in Fano semiconductor lasers
Arash Hodaie - Hassan Kaatuzian - Aref Rasoulzadeh Zali
Fully Soft-Switched Quadratic High Step-Up DC-DC Converter with a Single Switch and Low Input Current Ripple for Renewable Energy Applications
Ali Nadermohammadi - Hamed Abdi - Pouya Abolhassani - Seyed Hossein Hosseini - Mehran Sabahi - Naghi Rostami
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 43.6.0