0% Complete
صفحه اصلی
/
سی و دومین کنفرانس بین المللی مهندسی برق
High-Resolution Remote Sensing Image Captioning Based on Structured Attention and SAM Network
نویسندگان :
Yassin Riyazi
1
Seyyed Mostafa Sadjadi
2
Abbas Zohrevand
3
Reshad Hosseini
4
1- دانشگاه تهران
2- دانشگاه تهران
3- دانشگاه تهران
4- دانشگاه تهران
کلمات کلیدی :
image captioning،image segmentation،remote sensing image،structured attention
چکیده :
Due to its broad applications, remote sensing image captioning (RSIC) has gained popularity in recent years. However, it poses extra challenges for containing low-resolution images with highly structured semantic content. By incorporating image labeling and segmentation, this work expands on the RSIC framework developed by Zhao et al. [1]. The method presents a structured attention module that highlights important semantic components to maintain a geometric and structured shape. The quality and edge emphasis of UCM-captioned photographs is improved by upsampling them to 512×512 pixels. Using the Segment Anything Model (SAM) produces better image proposals, leading to higher accuracy than traditional techniques. A balanced output of large- and small-object masks is facilitated by SAM's promptability. The decoder can more easily learn a suitable statistical model using the model's spatial structure to provide an all-encompassing attention map. The effects of multiple hyperparameters, such as teacher forcing, the number of region proposals, and the effects of DSR and AVR loss factors, are investigated in this work. Overall, by combining image labeling and segmentation, this research improves remote sensing capabilities. It also shows how well the structured attention module and SAM work together to improve accuracy and consider different hyperparameter issues.
لیست مقالات
لیست مقالات بایگانی شده
Estimation of the Arc Model Parameters Using Heuristic Optimization Methods
Sadegh Ghavami - Ali A Razi-kazemi
A compact 5G MIMO antenna with reduced mutual coupling
Marziyeh Amiri - Ali Ghafoorzadeh-yazdi - Abbas-Ali Heidari
Experimental Study of Pick and Place Operation for Packaging Using Delta Parallel Robot with Two-Fingered Gripper
Mona Mohades Mojtahedi - Arvin Mohammadi - Mehdi Tale Masouleh
Realization of a high-resolution plasmonic refractive index sensor based on double-nanodisk shaped resonators
Leila Hajshahvaladi - Hassan Kaatuzian - Mohammad Danaie - Ghazaleh Nourbakhsh
Design of a High-Efficiency Balanced Power Amplifier with 68% Fractional Bandwidth
Fatemeh Mohabati - Marzieh Chegini - Mahmoud Kamarei
نقش پوشش گیاهی عمودی به همراه اینترنت اشیا در کاهش آلودگی شهری
فرانک صید جانی - سبا کرمی میرعزیزی - هادی اشعریون
تفکیک منبع تخلیه جزئی شدید در کابل های قدرت به کمک روش یادگیری عمیق
سید محسن علی پور - کیان شاهین فر - سید محمد شهرتاش
Analysis and Simulation of the Formation and dimensions of Gate-Defined Double Quantum Dots
Mahya Mostafavi - Majid Shalchian
Heart Abnormality Classification by Phonocardiogram Analysis Using Fusion in Feature and Decision Levels
Hossein Rahmati - Hassan Ghassemian - Maryam Imani
The most descriptive surprise definition for brain’s EEG response to visual and auditory oddball tasks
Mohammad Mahdi Kiani - Zahra Mousavi - Hamid Aghajan
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 42.8.0