0% Complete
صفحه اصلی
/
سی و دومین کنفرانس بین المللی مهندسی برق
High-Resolution Remote Sensing Image Captioning Based on Structured Attention and SAM Network
نویسندگان :
Yassin Riyazi
1
Seyyed Mostafa Sadjadi
2
Abbas Zohrevand
3
Reshad Hosseini
4
1- دانشگاه تهران
2- دانشگاه تهران
3- دانشگاه تهران
4- دانشگاه تهران
کلمات کلیدی :
image captioning،image segmentation،remote sensing image،structured attention
چکیده :
Due to its broad applications, remote sensing image captioning (RSIC) has gained popularity in recent years. However, it poses extra challenges for containing low-resolution images with highly structured semantic content. By incorporating image labeling and segmentation, this work expands on the RSIC framework developed by Zhao et al. [1]. The method presents a structured attention module that highlights important semantic components to maintain a geometric and structured shape. The quality and edge emphasis of UCM-captioned photographs is improved by upsampling them to 512×512 pixels. Using the Segment Anything Model (SAM) produces better image proposals, leading to higher accuracy than traditional techniques. A balanced output of large- and small-object masks is facilitated by SAM's promptability. The decoder can more easily learn a suitable statistical model using the model's spatial structure to provide an all-encompassing attention map. The effects of multiple hyperparameters, such as teacher forcing, the number of region proposals, and the effects of DSR and AVR loss factors, are investigated in this work. Overall, by combining image labeling and segmentation, this research improves remote sensing capabilities. It also shows how well the structured attention module and SAM work together to improve accuracy and consider different hyperparameter issues.
لیست مقالات
لیست مقالات بایگانی شده
HFO detection from iEEG signals in epilepsy using time-trained graphs and Deep Graph Convolutional Neural Network
Fatemeh Gharebaghi asl - Sepideh Hajipour Sardouie
Angular Stable Multiband Miniaturized Flexible Frequency Selective Surface
Mozhgun Moazzamnia - Javad Nourinia - Changiz Ghobadi - Keyhan Hosseini - Mohsen Karamirad - Baman Mohammadi
Flexible Microgrid Scheduling with the Presence of Renewable Energy Resources
Mahdi Rahimi - Fatemeh Jahanbani Ardakani - Ali Reza Rahimi
An Improved Version of the SIPO Algorithm with Fast Convergence Speed
Amir Soltany Mahboob - Hadi Shahriar Shahhoseini - Mohammad Reza Ostadi Moghaddam - Shima Yousefi
Dynamic State Estimation of Power System Using Gauss-Seidel Cubature Kalman Filter
Atiyeh Keshavarz-Mohammadiyan
Identifying Singular 2-D Systems Using 1-D Methods
Masoud Shafiee - Kamyar Azarakhsh
Image denoising using convolutional neural network
Behnam Latifi - Abolghasem Raie
Robust Object Detection Against Adversarial Perturbations with Gabor Filter
Mohammad Parsa Karimi - Abdollah Amirkhani - Shahriar B. Shokouhi
H_∞ Robust Constrained Control of Fuzzy-based Continuous-Time Nonlinear Systems
Mohsen Farbood - Mokhtar Shasadeghi - Taher Niknam - Behrouz Safarinejadian
Batch(offline) Reinforcement Learning for recommender system
Mohammad Amir Rezaei Gazik - Mehdy Roayaei
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 41.7.4