0% Complete
صفحه اصلی
/
سی و دومین کنفرانس بین المللی مهندسی برق
High-Resolution Remote Sensing Image Captioning Based on Structured Attention and SAM Network
نویسندگان :
Yassin Riyazi
1
Seyyed Mostafa Sadjadi
2
Abbas Zohrevand
3
Reshad Hosseini
4
1- دانشگاه تهران
2- دانشگاه تهران
3- دانشگاه تهران
4- دانشگاه تهران
کلمات کلیدی :
image captioning،image segmentation،remote sensing image،structured attention
چکیده :
Due to its broad applications, remote sensing image captioning (RSIC) has gained popularity in recent years. However, it poses extra challenges for containing low-resolution images with highly structured semantic content. By incorporating image labeling and segmentation, this work expands on the RSIC framework developed by Zhao et al. [1]. The method presents a structured attention module that highlights important semantic components to maintain a geometric and structured shape. The quality and edge emphasis of UCM-captioned photographs is improved by upsampling them to 512×512 pixels. Using the Segment Anything Model (SAM) produces better image proposals, leading to higher accuracy than traditional techniques. A balanced output of large- and small-object masks is facilitated by SAM's promptability. The decoder can more easily learn a suitable statistical model using the model's spatial structure to provide an all-encompassing attention map. The effects of multiple hyperparameters, such as teacher forcing, the number of region proposals, and the effects of DSR and AVR loss factors, are investigated in this work. Overall, by combining image labeling and segmentation, this research improves remote sensing capabilities. It also shows how well the structured attention module and SAM work together to improve accuracy and consider different hyperparameter issues.
لیست مقالات
لیست مقالات بایگانی شده
Shielding factor enhancement method for Bi-stage active shield in SQUID-based Magnetocardiography system
Zeynab Alipour - Fatemeh Esmaili - Faezeh Shanehsazzadeh - Mehdi Fardmanesh
Strategic Offering of a Virtual Power Plant in Energy Markets Under Contingency Conditions: A Hybrid Stochastic Robust Optimization Approach
Elahe Ghanaee - Morteza Rahimiyan
Low-Leakage 6T SRAM Cell for In-Memory Computing with High Stability
Deniz Najafi - Behzad Ebrahimi
Microgrid Damping Improvement Using High-Pass Filter-Based Virtual Synchronous Generator
Shayan Zaimi - Ashkan Moradi Naserkhani - Sharara Rehimi - Amin Karimi - Rahmatollah Mirzaei - Hassan Bevrani
Electricity Tariff Volatility Mitigation Using Uncertainty-Diminution and Hedge Contracts along with Risk Management Policies
Majid Moazzami - Hossein Shahinzadeh - Majid Najafi - Zohreh Azani - Shohreh Azani - Gevork B. Gharehpetian
Fusion of Multi-Level CNN With LBP Features For Facial Emotion Recognition
Ehsan Bahmanabady - Maryam Imani - Hassan Ghassemian
Electrical Properties of Dielectric Barrier Discharge Plasma Actuator In Argon With 13.56MHz RF Power Supply
Sepideh Bashiry - Nayyereh Zahednia - Mehdi Bakhshzad Mahmoudi
طراحی و شبیه سازی یک فراسطح بازتابی با قابلیت تحقق الگوی تشعشعی هم شار با قطبش های خطی و دایروی در باند X مناسب برای ماهواره سنجشی
مجید کریمی پور - ایمان آریانیان
طراحی بهینه ی آرایه ی تُنُک بی افزونگی با فاصله ی ناصحیح میان عناصر
سید محمد حسینی - محمود کریمی
طراحی و ساخت چرخاننده سهدرگاهی صفحه E در موجبر باند X
زهرا عابدان - محمد حسین حسینی
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 42.8.0