0% Complete
صفحه اصلی
/
سی و دومین کنفرانس بین المللی مهندسی برق
High-Resolution Remote Sensing Image Captioning Based on Structured Attention and SAM Network
نویسندگان :
Yassin Riyazi
1
Seyyed Mostafa Sadjadi
2
Abbas Zohrevand
3
Reshad Hosseini
4
1- دانشگاه تهران
2- دانشگاه تهران
3- دانشگاه تهران
4- دانشگاه تهران
کلمات کلیدی :
image captioning،image segmentation،remote sensing image،structured attention
چکیده :
Due to its broad applications, remote sensing image captioning (RSIC) has gained popularity in recent years. However, it poses extra challenges for containing low-resolution images with highly structured semantic content. By incorporating image labeling and segmentation, this work expands on the RSIC framework developed by Zhao et al. [1]. The method presents a structured attention module that highlights important semantic components to maintain a geometric and structured shape. The quality and edge emphasis of UCM-captioned photographs is improved by upsampling them to 512×512 pixels. Using the Segment Anything Model (SAM) produces better image proposals, leading to higher accuracy than traditional techniques. A balanced output of large- and small-object masks is facilitated by SAM's promptability. The decoder can more easily learn a suitable statistical model using the model's spatial structure to provide an all-encompassing attention map. The effects of multiple hyperparameters, such as teacher forcing, the number of region proposals, and the effects of DSR and AVR loss factors, are investigated in this work. Overall, by combining image labeling and segmentation, this research improves remote sensing capabilities. It also shows how well the structured attention module and SAM work together to improve accuracy and consider different hyperparameter issues.
لیست مقالات
لیست مقالات بایگانی شده
Differential Protection for Power Transformers Using Tree-based Pipeline Optimization Tool
Reza Afsharisefat - Mohsen Jannati - Mohamad Reza Shams
Experimental Study and Implementation of a Generalized Predictive Controller on Delta Parallel Robot Based on Actuator Identification
Hasan Jalali - Behnam Moradkhani - Hossein Damavandi - Mehdi Tale Masouleh - Ahmad Kalhor
مکان یابی اهداف در محیط مختلط دید مستقیم و غیر مستقیم مبتنی بر اندازه گیری های RSS و TOA با مدل احتمالاتی
محمدرضا شمسیان - فریدون بهنیا
طراحی ایستگاه شارژ سریع با در نظر گرفتن عدم قطعیت منابع تجدیدپذیر و مدیریت ریسک
محمد بزرگپور رودباری - میثم جعفری نوکندی - محمد هاشمی مصیر
Smartly, reduce the latency of high-priority vehicles using IoT technology
Mahdi Talebi - Masoud Sabaei
Multi-Objective Concurrent Kernel Scheduling for Multi-GPU Systems
Negar Baradar Alizadeh - Mahmoud Momtazpour
Human Identification based on micro-Doppler images using Residual Networks
Ali Pouresmaeil - Pegah Kakvand - Mohammad Ali Sebt
بهبود دقت و سرعت روش حداکثر جریان در تشخیص خطاهای آغازین وقایع آبشاری
مجتبی فکری - جواد نیکوکار - گئورک قره پتیان
Multi-Machine Traction Drive Based on Parallel Connected Synchronous Machines
Hassan Mohammadi Pirouz
Reactive Power Management of PV Systems by Distributed Cooperative Control in Low Voltage Distribution Networks
Saeed Mahdavian Rostami - Mohsen Hamzeh
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 40.4.2