0% Complete
صفحه اصلی
/
سی و دومین کنفرانس بین المللی مهندسی برق
A Transformer-Based Model for Similar Fashion Image Retrieval with Image and Text Features
نویسندگان :
Zahra Sheykhvand
1
Milad Farzalizadeh
2
Majid Meghdadi
3
1- دانشگاه زنجان
2- دانشگاه زنجان
3- دانشگاه زنجان
کلمات کلیدی :
Fashion Image Retrieval،Transformer،Multimodal Fusion،Image Similarity،Computer Vision
چکیده :
Similar fashion retrieval system has various applications in online shops and image-based recommender systems. Online shops employ textual metadata of products to search for products by customers. However, customer satisfaction is not fully guaranteed due to limitations caused by inaccuracies in input metadata and incorrect product categorization. These issues lead to confusion and hinder the attainment of desired products. This system significantly enhances user satisfaction by expediting image searches and finding desired products. This paper proposes a Transformer-based architecture specifically designed for searching and retrieving similar images in the fashion domain that uses both visual and descriptive product features for more efficient retrieval. In this architecture, the features extracting and image and text vector embeddings are crucial for establishing similarity. Therefore, DeiT, BLIP, and BERT transformer models have been employed. Since previous research focused solely on image features to determine similarity, this paper incorporates textual features as additional product descriptions to achieve the most accurate matches. The Evaluation of the proposed architecture on the DeepFashion data set demonstrates a remarkable 41.5% improvement in recall compared to the baseline paper and several previous research.
لیست مقالات
لیست مقالات بایگانی شده
یک روش اقتصادی برای تعیین مکان بهینه ریکلوزرها در فیدرهای توزیع شعاعی با هدف بهبود قابلیت اطمینان
محمودرضا شاکرمی - میثم دوستی زاده - هومن بسطامی - مهران امیری - ابراهیم شریفی پور - شمس الدین کمالوند
Infrared Small Target Detection Based on Directional Mean Difference and Compactness
Mohammad Rahbari Dust - MASOUMEH AZGHANI
بازسازی تصاویر رادار دهانه مصنوعی با استفاده از نمایش تنک مبتنی بر گروه
محبوبه خدرزاده - صادق صمدی
Design and Simulation of a Novel High Sensitive MEMS Microphone Based On a Spring-Supported Circular Diaphragm
Mehdi Pazhooh - Ebrahim Abbaspour-Sani
Implementation of a 14-Channel Real-time Compact Data Logger for Structure and Mechanical Engineering Laboratories
Keivan Sadeghinezhad - Esmaeil Najafiaghdam - Sara Dezhakam - Ali Sadeghinezhad
LoRa-based Intelligent Helmet for Coal Miner Safety: Neural Network Prediction and BLE Location Tracking
Saba Pirahmadian - Sorin Yousefnia - Soheil Ganjefar
Average Secrecy Capacity Performance Analysis for SWIPT-Based SIMO Underlay Cognitive Radio
Mohammad Javad Saber1 - Seyedeh Maryam Mazloum - Seyed Mohammad Sajad Sadough
A New Unsupervised Feature Learning Method for Object Recognition using Prior-Knowledge Data
Ashkan Farrokhi - Hadi Seyedarabi
Adaptive fault tolerant neural control of heterogeneous second-order multi-agent systems
Mohammad Hadi Rezaei - Ali Abooee
A Novel Estimation Law for Impedance-Controlled Bilateral Teleoperation to Enhance Human-Environment Interaction
Mobina Kameli - Mohammad Motaharifar - Negin Sayyaf
بیشتر
ثمین همایش، سامانه مدیریت کنفرانس ها و جشنواره ها - نگارش 42.0.4