0% Complete
صفحه اصلی
/
سی و دومین کنفرانس بین المللی مهندسی برق
A Transformer-Based Model for Similar Fashion Image Retrieval with Image and Text Features
نویسندگان :
Zahra Sheykhvand
1
Milad Farzalizadeh
2
Majid Meghdadi
3
1- دانشگاه زنجان
2- دانشگاه زنجان
3- دانشگاه زنجان
کلمات کلیدی :
Fashion Image Retrieval،Transformer،Multimodal Fusion،Image Similarity،Computer Vision
چکیده :
Similar fashion retrieval system has various applications in online shops and image-based recommender systems. Online shops employ textual metadata of products to search for products by customers. However, customer satisfaction is not fully guaranteed due to limitations caused by inaccuracies in input metadata and incorrect product categorization. These issues lead to confusion and hinder the attainment of desired products. This system significantly enhances user satisfaction by expediting image searches and finding desired products. This paper proposes a Transformer-based architecture specifically designed for searching and retrieving similar images in the fashion domain that uses both visual and descriptive product features for more efficient retrieval. In this architecture, the features extracting and image and text vector embeddings are crucial for establishing similarity. Therefore, DeiT, BLIP, and BERT transformer models have been employed. Since previous research focused solely on image features to determine similarity, this paper incorporates textual features as additional product descriptions to achieve the most accurate matches. The Evaluation of the proposed architecture on the DeepFashion data set demonstrates a remarkable 41.5% improvement in recall compared to the baseline paper and several previous research.
لیست مقالات
لیست مقالات بایگانی شده
طبقهبندی خطاهای ترانسفورماتورهای قدرت توسط روش خوشهبندی K-means با استفاده از آنالیز گازهای محلول در روغن
ناصر کیانی مهر - حامد زین الدینی میمند
A New 10 Watt Power Amplifier for GSM 900 MHz base stations with 44% Bandwidth
Marzieh Chegini - HojjatAllah Nemati - Mahmoud Kamarei
Microgrid Damping Improvement Using High-Pass Filter-Based Virtual Synchronous Generator
Shayan Zaimi - Ashkan Moradi Naserkhani - Sharara Rehimi - Amin Karimi - Rahmatollah Mirzaei - Hassan Bevrani
تشخیص حضور انسان در خانه های هوشمند با استفاده از شبکه ی بی سیم محلی
امیرمحمد بصیرت - نغمه سادات مویدیان
Finite-time consensus of multi-agent systems via event-triggered control
Mehdi Zamanian - Farzaneh Abdollahi - Seyyed Kamaleddin Yadavar Nikravesh
An Event-Triggered Robust Data-Driven Predictive Control with Transient Response Improvement
Amir Mehrnoosh - Mohammad Haeri
ℒ1 Adaptive Control Design Using CMPC: Applied to Single-Link Flexible Joint Manipulator
Hossein Ahmadian - Heidar Ali Talebi - Iman Sharifi
Delay Independent Controller Design for Delayed Discrete Singular Systems with Input Saturation
Emad Jafari - Tahereh Binazadeh
A New High Voltage Gain Non-isolated DC-DC Converter
Ahmadreza Ghanaatian - Reza Takarli - Abolfazl Vahedi
Agglomerative Hierarchical Clustering Based on Q-learning for D2D Communication in Public Safety Communication Networks
Sahel Alipour - Mohammad Mansour Kesargheh - Abdulhamid Zahedi - Ghasem Mirjalily - Jamshid Abouei
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 42.5.3