0% Complete
صفحه اصلی
/
سی و دومین کنفرانس بین المللی مهندسی برق
A Transformer-Based Model for Similar Fashion Image Retrieval with Image and Text Features
نویسندگان :
Zahra Sheykhvand
1
Milad Farzalizadeh
2
Majid Meghdadi
3
1- دانشگاه زنجان
2- دانشگاه زنجان
3- دانشگاه زنجان
کلمات کلیدی :
Fashion Image Retrieval،Transformer،Multimodal Fusion،Image Similarity،Computer Vision
چکیده :
Similar fashion retrieval system has various applications in online shops and image-based recommender systems. Online shops employ textual metadata of products to search for products by customers. However, customer satisfaction is not fully guaranteed due to limitations caused by inaccuracies in input metadata and incorrect product categorization. These issues lead to confusion and hinder the attainment of desired products. This system significantly enhances user satisfaction by expediting image searches and finding desired products. This paper proposes a Transformer-based architecture specifically designed for searching and retrieving similar images in the fashion domain that uses both visual and descriptive product features for more efficient retrieval. In this architecture, the features extracting and image and text vector embeddings are crucial for establishing similarity. Therefore, DeiT, BLIP, and BERT transformer models have been employed. Since previous research focused solely on image features to determine similarity, this paper incorporates textual features as additional product descriptions to achieve the most accurate matches. The Evaluation of the proposed architecture on the DeepFashion data set demonstrates a remarkable 41.5% improvement in recall compared to the baseline paper and several previous research.
لیست مقالات
لیست مقالات بایگانی شده
A Novel Step-up Converter Based on Active Network and Coupled-Inductor Technique with Soft Switching Operation
Mohammadreza Zeynalhosseyni - Reza Beiranvand
تجزیه وابستگی با استفاده از Q-Learning محافظه کار
امیر زارعی - علیرضا خیاطیان - پیمان ستوده
A Two-Step Stochastic Market-Oriented Approach for Optimal Operation of Commercial VPPs under Uncertainty
Jalal Moradi - Hossein Shahinzadeh - Ahmad Hafezimagham - Gevork B. Gharehpetian - S.M. Muyeen - Mohamed Benbouzid
Multi-Objective Concurrent Kernel Scheduling for Multi-GPU Systems
Negar Baradar Alizadeh - Mahmoud Momtazpour
تولید ریزداپلر راداری بدن انسان با استفاده از آموزش شبکه مولد متقابل کانولوشنال عمیق
مهدی استوان - صادق صمدی - علیرضا کاظمی
A New High Voltage Gain Z-Source Based DC-DC Converter for High-Power DG Applications
Sakina Bakhshi - Reza Beiranvand
High efficiency Continuous class J/B power amplifier design with 130% Fractional Bandwidth
Sara Aghajani - Mahmoud Kamarei - Marzieh Chegini
Design of a 2MW Medium Voltage Conventional Hybrid DC Circuit Breaker for Railway Application
Seyed Hamid Khalkhali - Mohsen Taghizadeh Kejani - Ali Asghar Razi Kazemi
یک روش جدید در تشخیص اختلال طیف اوتیسم از تصاویر چهره کودکان با استفاده از معماری چندمقیاسی MS-ViT و پردازش لبهای
خسرو رضائی - طیبه شمولی جوانمردی - امیر محمد حیدری
Φ-OTDR Event Classification Using Machine Learning and Optical Signal Processing
Amir Babaoughli - Tohid Alizadeh - Seyed Sadra Kashef
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 42.3.2